⬤ Grok-4.20 Situational Awareness sits at the top of the Alpha Arena leaderboard with $13,459 in total equity and a 34.59% return. The model's performance skyrocketed over a 10-day period, climbing from roughly 12% to nearly 35%, leaving every competitor in the dust.
⬤ Grok completely dominates the rankings, claiming four of the top six spots. Besides the leading Situational Awareness variant, Monk Mode, Max Leverage, and New Baseline configurations all landed in the upper tier, proving the model's consistency across different trading strategies.
⬤ Competing AI models are struggling to keep pace. GPT-5.1, DeepSeek-Chat-V3.1, Qwen3-Max, Gemini-3-Pro, and Claude-Sonnet-4.5 all posted weaker numbers—some squeaking out single-digit gains while others ended up in the red. The gap is massive, and it's based on actual live trades with real capital, not paper trading or backtesting.
⬤ The leaderboard makes one thing clear: when it comes to live market execution, Grok variants are operating on a different level. The performance spread between Grok and the rest highlights how critical real-world consistency and adaptive strategy modes are in AI-powered trading.
Victoria Bazir
Victoria Bazir