Lynchmark Newsletter

gpt‑5.1‑codex‑max just got added to API after having been added to Codex 2 weeks ago. I benchmarked it. Here's what I found.

GPT‑5.1‑Codex‑Max

Just released to API

8/11 C‑

versus

GPT‑5.1‑Codex

Previous version

7/11 D

Where It Lands

Gemini 3 Pro

Claude Opus 4.5

DeepSeek v3.2

GPT‑5.1‑Codex‑Max

Claude Sonnet 4.5

GPT‑5.1‑Codex

The Takeaway

Max scores one point better than regular Codex. That's something. But it's still worse than Gemini 3 Pro, Claude Opus 4.5, and DeepSeek v3.2. It's only on par with Claude Sonnet 4.5.

Current Lynchmark Ranking

Google Gemini 3 Pro (Temperature: 0.35)

Anthropic Claude Opus 4.5

DeepSeek‑v3.2

GPT‑5.1‑Codex‑Max (new)

Claude Sonnet 4.5

The reality check: Even with this release, OpenAI is still far behind. This shows exactly why they declared "code red." The gap is real. They're not closing it fast enough.

What's coming: The rumors say OpenAI's upcoming model (codenamed "Garlic") arrives next week. The pressure is on. The anticipation is building. I'll benchmark it the moment it drops.

— Lynchmark