Refactor: Point to live results page in README

2026-05-18 12:32:14 +00:00 · 2025-10-13 10:33:35 -07:00
parent b5ae50e0fe
commit 1ea8cb5852
1 changed files with 29 additions and 0 deletions
--- a/README.md
+++ b/README.md
@@ -0,0 +1,29 @@
+# LLM Algorithmic Benchmark
+
+This repository contains a suite of difficult algorithmic tests to benchmark the code generation capabilities of various Large Language Models.
+
+The tests are run automatically via GitHub Actions, and the results are updated in this README.
+
+## Configuration
+
+Set the percentage of tests to run during the benchmark. 100% runs all tests.
+
+<!-- CONFIG_START -->
+RUN_PERCENTAGE: 25
+<!-- CONFIG_END -->
+
+## Models Under Test
+
+The following models are included in the benchmark run.
+
+<!-- MODELS_START -->
+google/gemini-2.5-pro
+anthropic/claude-sonnet-4.5
+openai/gpt-5-codex
+<!-- MODELS_END -->
+
+## Benchmark Results
+
+Live benchmark results, including pass/fail status and code generation time, are available on our [results page](https://multipleof4.github.io/benchmark/).
+
+The results are updated automatically via GitHub Actions.