Grok Imagine Video has secured the #1 position on the Arena Image-to-Video leaderboard, according to a post shared on X. The ranking puts xAI's model ahead of a crowded field of generative video systems that convert static images into animated clips. The result also continues a broader trend of Grok models rising in public evaluations - a pattern seen earlier with Grok 4.2.0 Beta Tops Search Arena, Edges Out Larger AI Models.
Benchmark data shows the grok-imagine-video-720p model leading with a score of 1406 (+-7), backed by more than 49,000 user votes. Arena's evaluation framework relies on direct user comparisons and community voting, making it one of the more organic measures of real-world model performance.
Google's Veo 3.1 Trails Behind xAI in Close Competition
Several versions of Google's Veo 3.1 sit immediately behind Grok in the rankings. The veo-3.1-audio-1080p variant scores 1402, followed by veo-3.1-audio at 1395 and veo-3.1-fast-audio-1080p at 1383. The margins are narrow, signaling just how competitive the image-to-video space has become over a short period.
Beyond the top three, grok-imagine-video-480p also places within the top five, showing xAI occupying two spots in the upper tier simultaneously. Other models in the mix include vidu-q3-pro, wan2.5-i2v-preview, and kling-v3-pro, reflecting how many teams are now focused on image-to-video as a core capability.
Multimodal AI Advances Reshape Video Generation Benchmarks
Grok's leaderboard performance follows a string of competitive showings across different Arena categories. The model previously appeared in top results for Grok Imagine Breaks Into Top 3 on Video Editing Arena Leaderboard, suggesting xAI is building a consistent presence across multiple video and image benchmarks rather than peaking in a single category.
The broader context matters too. As generative video models improve, the AI ecosystem is evolving rapidly in adjacent areas. Research into memory and retrieval efficiency, explored in AI Memory Evolution: 10x Efficiency Gains as RAG Systems Become Obsolete, reflects how quickly foundational AI infrastructure is being rethought. Together, these shifts point to an industry accelerating across every layer - from generation quality to the underlying systems that power it.
Usman Salis
Usman Salis