GPT‑5.1‑Codex‑Max just got added to the OpenAI API. I benchmarked it the moment it became available. Here's what I found.
+ +Where It Lands
+ +The Takeaway
+Max scores one point better than regular Codex. That's something. But it's still worse than Gemini 3 Pro, Claude Opus 4.5, and DeepSeek v3.2. It's only on par with Claude Sonnet 4.5.
+Current Lynchmark Ranking
+What's coming: The rumors say OpenAI's upcoming model (codenamed "Garlic") arrives next week. The pressure is on. The anticipation is building. I'll benchmark it the moment it drops.
+ +— Lynchmark
+