Feat: Create initial README for benchmark

2026-07-20 23:05:44 +00:00 · 2025-10-13 05:28:00 -07:00
parent d4cf599991
commit 959db28c25
1 changed files with 24 additions and 0 deletions
--- a/README.md
+++ b/README.md
@@ -0,0 +1,24 @@
 # LLM Algorithmic Benchmark
 This repository contains a suite of difficult algorithmic tests to benchmark the code generation capabilities of various Large Language Models.
 The tests are run automatically via GitHub Actions, and the results are updated in this README.
 ## Models Under Test
 The following models are included in the benchmark run.
 <!-- MODELS_START -->
 google/gemini-2.5-pro
 anthropic/claude-sonnet-4.5
 openai/gpt-5-codex
 <!-- MODELS_END -->
 ## Benchmark Results
 The table below shows the pass/fail status for each model on each test.
 <!-- RESULTS_START -->
 *No results yet. Run the benchmark workflow to generate them.*
 <!-- RESULTS_END -->