From 3264242b136b1e7ce1daee70151ecb7132e49bbd Mon Sep 17 00:00:00 2001 From: "github-actions[bot]" Date: Mon, 13 Oct 2025 12:37:53 +0000 Subject: [PATCH] Docs: Update benchmark results --- README.md | 7 ++++++- 1 file changed, 6 insertions(+), 1 deletion(-) diff --git a/README.md b/README.md index 6855515..1d8d73c 100644 --- a/README.md +++ b/README.md @@ -19,6 +19,11 @@ openai/gpt-5-codex The table below shows the pass/fail status for each model on each test. -*No results yet. Run the benchmark workflow to generate them.* +| Model | 1_dijkstra | 2_convex_hull | 3_lis | 4_determinant | +| --- | --- | --- | --- | --- | +| google/gemini-2.5-pro | ❌ Fail | ❌ Fail | ❌ Fail | ❌ Fail | +| anthropic/claude-sonnet-4.5 | ❌ Fail | ❌ Fail | ❌ Fail | ❌ Fail | +| openai/gpt-5-codex | ❌ Fail | ❌ Fail | ❌ Fail | ❌ Fail | +