Docs: Update benchmark results

This commit is contained in:
github-actions[bot]
2025-10-13 12:58:11 +00:00
parent 4d3146e916
commit 1d4e1d84ac
4 changed files with 278 additions and 5 deletions

10
README
View File

@@ -27,9 +27,9 @@ openai/gpt-5-codex
The table below shows the pass/fail status for each model on each test.
<!-- RESULTS_START -->
Model | 1_dijkstra | 2_convex_hull | 3_lis | 4_determinant
----------------------------|------------|---------------|---------|---------------
google/gemini-2.5-pro | ❌ Fail | ❌ Fail | ❌ Fail | ❌ Fail
anthropic/claude-sonnet-4.5 | ❌ Fail | ❌ Fail | ❌ Fail | ❌ Fail
openai/gpt-5-codex | ❌ Fail | ❌ Fail | ❌ Fail | ❌ Fail
Model | 1_dijkstra | 2_convex_hull | 3_lis | 4_determinant
--------------------------- | ---------- | ------------- | --------- | -------------
google/gemini-2.5-pro | ❌ Fail | ⚪ Not Run | ⚪ Not Run | ⚪ Not Run
anthropic/claude-sonnet-4.5 | ❌ Fail | ⚪ Not Run | ⚪ Not Run | ⚪ Not Run
openai/gpt-5-codex | ❌ Fail | ⚪ Not Run | ⚪ Not Run | ⚪ Not Run
<!-- RESULTS_END -->