Docs: Update benchmark results

2026-07-18 13:55:46 +00:00 · 2025-10-13 12:58:11 +00:00
parent 4d3146e916
commit 1d4e1d84ac
4 changed files with 278 additions and 5 deletions
--- a/10
+++ b/10
@@ -27,9 +27,9 @@ openai/gpt-5-codex
 The table below shows the pass/fail status for each model on each test.

 <!-- RESULTS_START -->
-    Model                       | 1_dijkstra | 2_convex_hull | 3_lis   | 4_determinant
-    ----------------------------|------------|---------------|---------|---------------
-    google/gemini-2.5-pro       | ❌ Fail    | ❌ Fail       | ❌ Fail | ❌ Fail
-    anthropic/claude-sonnet-4.5 | ❌ Fail    | ❌ Fail       | ❌ Fail | ❌ Fail
-    openai/gpt-5-codex          | ❌ Fail    | ❌ Fail       | ❌ Fail | ❌ Fail
+    Model                       | 1_dijkstra | 2_convex_hull | 3_lis     | 4_determinant
+    --------------------------- | ---------- | ------------- | --------- | -------------
+    google/gemini-2.5-pro       | ❌ Fail     | ⚪ Not Run     | ⚪ Not Run | ⚪ Not Run    
+    anthropic/claude-sonnet-4.5 | ❌ Fail     | ⚪ Not Run     | ⚪ Not Run | ⚪ Not Run    
+    openai/gpt-5-codex          | ❌ Fail     | ⚪ Not Run     | ⚪ Not Run | ⚪ Not Run    
 <!-- RESULTS_END -->