
Share
O3 outshone Grok 4 with a flawless performance, securing a 4-0 sweep that left competitors and chess aficionados alike in awe of its strategic prowess on the AI chessboard.
In a surprising turn of events, OpenAI’s o3 emerged victorious over Grok 4 in the final day of Google’s Kaggle Game Arena AI chess exhibition match. The tournament, which featured some of the most advanced AI systems, saw o3 clinch the gold with a decisive 4-0 sweep, while Gemini Pro secured the bronze medal by defeating o4-mini 3.5-0.5.
From the opening moves, it was clear that something was amiss with Grok 4. In a standard position, Grok inexplicably dropped its bishop for no apparent reason. This early mistake set the tone for the match. With material down, Grok attempted to simplify the game by offering trades, a strategy generally discouraged when behind in material. o3 capitalized on these errors and quickly checkmated Grok.
The second game saw the Poisoned Pawn variation of the Sicilian Defense. Known for its traps, this opening proved particularly challenging for Grok 4. Black played 12...Qxa2??, capturing a pawn that was protected by White’s c3-knight. This critical blunder allowed o3 to gain a significant advantage and secure another win.
In an interesting twist, Grok 4 employed the Maroczy structure of the Sicilian Defense, a rare choice in AI chess. Initially, it seemed that Grok might be regaining its form as it built a comfortable position. However, White’s 11.Nd5?? dropped the knight, and the game quickly spiraled out of control. Moments later, Grok lost its queen, an exchange, a rook, and ultimately the game.
The final game was the most competitive, with o3 making an early queen blunder that put it in a difficult position. Despite this setback, o3’s resilience and strategic depth shone through. Grok failed to capitalize on its advantage, allowing o3 to recover and eventually win the game.

Gemini Pro secured the bronze medal with a 3.5-0.5 victory over o4-mini. The match highlighted Gemini’s consistent performance and ability to maintain a strong position throughout the games.
The Kaggle AI Chess Exhibition Tournament provided valuable insights into the current state of chess AI. OpenAI’s o3 proved to be a formidable opponent, showcasing advanced capabilities in strategic planning and error minimization. Grok 4's performance, while initially strong, highlighted areas for improvement in consistency and decision-making under pressure.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
8 August 2025
88 articles
Related Articles

OpenEvidence Targets Hospitals to Expand Its AI Chatbot for Doctors
Products & Applications · 3 min

OpenEvidence Launches Voice AI to Enhance Physician Workflow
Products & Applications · 3 min

Doximity Accelerates AI Investment in 2026, Targeting Multibillion-Dollar Market
Products & Applications · 3 min
Related Articles

OpenEvidence Targets Hospitals to Expand Its AI Chatbot for Doctors
Products & Applications · 3 min

OpenEvidence Launches Voice AI to Enhance Physician Workflow
Products & Applications · 3 min

Doximity Accelerates AI Investment in 2026, Targeting Multibillion-Dollar Market
Products & Applications · 3 min
More Stories