⬤ OpenAI just extended its lead in AI benchmarking. GPT-5.1 now sits at the top of the Artificial Analysis Intelligence Index with a score of 70 points under high reasoning effort—two points ahead of GPT-5's 68. The index chart shows GPT-5.1 leading a packed field of competing large language models.
⬤ GPT-5.1 is a minor version bump over GPT-5, with the biggest improvement showing up on TerminalBench—a test focused on agentic coding and terminal use. GPT-5.1 jumped 12 percentage points on that benchmark, which pushed its overall index score up by two points. Worth noting: when using minimal reasoning, GPT-5.1 showed no intelligence gains compared to GPT-5.
⬤ OpenAI also tweaked how the model interacts. GPT-5.1 has a warmer default personality, better tone controls, and follows system prompts and custom instructions more closely. On the efficiency side, GPT-5.1 used 81 million output tokens to complete the Intelligence Index tests—down from 85 million for GPT-5. That brought the total test cost to $859 versus $913 for GPT-5, even though pricing remains the same.
⬤ For anyone tracking the AI race, GPT-5.1's performance offers a clear comparison point. The higher score, targeted coding improvements, and stable pricing show OpenAI is refining its flagship model without raising costs—key factors for competitiveness and future enterprise adoption.
Saad Ullah
Saad Ullah