
Share
Grok 4 emerges as the new champion in AI benchmarks, outperforming rivals like OpenAI o3 and Google Gemini with superior reasoning skills and performance across coding and math tasks.
xAI has released Grok 4, the latest iteration of their reasoning-focused large language model (LLM), and it's making waves. After running a comprehensive suite of benchmarks, we can confirm that Grok 4 is now the leading AI model, surpassing competitors like OpenAI o3, Google Gemini 2.5 Pro, Anthropic Claude 4 Opus, and DeepSeek R1 0528.
Grok 4 is designed to "think" before providing answers, a feature that sets it apart from other LLMs. This reasoning capability is evident in its strong performance across various benchmarks:

Grok 4 is currently available via the xAI API. The version deployed on X/Twitter may differ slightly due to additional instructions and logic that can affect style and behavior. We also anticipate Grok 4 being available through Microsoft Azure AI Foundry, following the footsteps of Grok 3 and Grok 3 mini.
Grok 4 marks a significant milestone for xAI, placing them at the forefront of the AI landscape. Its strong performance in reasoning, coding, and mathematical tasks, combined with competitive pricing, makes it an attractive choice for developers and researchers alike. As we continue to explore its capabilities, it will be interesting to see how Grok 4 evolves and influences future advancements in AI.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
10 July 2025
88 articles
Related Articles
Related Articles
More Stories