
Share
GPT-5 faces its toughest challenge yet by parsing EVTX logs in Zig, showcasing the AI’s ability to tackle complex, real-world coding tasks that span thousands of lines.
Over the past few days, I've been putting GPT-5 through its paces by benchmarking it on a real-world programming task. The goal? To write a parser for the EVTX format (a binary format used for Windows Event Logs) in Zig. This is a medium-sized problem that requires a few thousand lines of code and has a well-defined spec, making it an excellent test case for evaluating AI-generated code.
I chose this task for several reasons:
I maintain an EVTX parser in Rust called evtx, which is still quite popular. When I originally wrote it, I used the spec from the Windows XML Event Log (EVTX) documentation.
The task for GPT-5 was straightforward:
For an added challenge, I initially withheld access to the actual Rust code to see how well GPT-5 could interpret the spec on its own. Eventually, I allowed it to peek at the Rust code to refine its solution.
I chose Zig for a few reasons:

GPT-5 delivered impressive results:
Here are some key points:
I've attempted this task with other models before, including Sonnet 3.5. Here's how GPT-5 compares:
GPT-5's ability to write and optimize a complex binary parser in Zig is a testament to its advanced code-generation capabilities. This real-world benchmark shows that GPT-5 can handle medium-sized programming tasks with high accuracy and performance.
If you're interested in the final result, the parser is
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
19 August 2025
88 articles
Related Articles
Related Articles
More Stories