⬤ xAI's latest model is making waves after fresh Arena Expert Leaderboard rankings dropped online. Grok 4.1 (Thinking) now holds the #1 spot, leaving behind top systems from Anthropic, OpenAI, and Google. The result is being seen as a major shift in the AI race, with Grok pushing past Claude Sonnet, GPT models, and Gemini in expert evaluations.
⬤ The leaderboard shows Grok 4.1 leading with 1510 points (Preliminary). Right behind it sits Claude Sonnet 4.5 Thinking (32K) at 1509, Claude Sonnet 4.5 at 1487, Claude Opus 4.1 Thinking (16K) at 1482, and Gemini 2.5 Pro at 1468. Lower positions include Qwen3 Max Preview, GPT-5-High, and other proprietary and open systems. The ranking puts Grok ahead of a packed field of advanced AI models from established research teams.
⬤ Online reactions highlighted how surprising the competitive flip was, especially given that some competing labs work with much bigger budgets. The discussion focused on Elon Musk's involvement and framed Grok's jump as a David-versus-Goliath moment against well-funded AI giants. The updated standings sparked wider conversation about xAI's speed of progress and where Grok 4.1 now stands against industry heavyweights.
⬤ Musk also shared that "Grok 4.1 just released" and told users to expect "a significant increase in speed and quality." Pairing a fresh release with a top benchmark ranking makes this a standout moment in the AI development race. With Grok 4.1 now sitting first on a closely watched leaderboard, the update could shift how companies measure model strength, reshape expectations for what's coming next, and fuel ongoing debates about AI capabilities across the industry.
Sergey Diakov
Sergey Diakov