Refactor: Point to live results page in README

2026-07-18 13:55:46 +00:00 · 2025-10-13 10:33:35 -07:00
parent b5ae50e0fe
commit 1ea8cb5852
1 changed files with 29 additions and 0 deletions
--- a/README.md
+++ b/README.md
@@ -0,0 +1,29 @@
 # LLM Algorithmic Benchmark
 This repository contains a suite of difficult algorithmic tests to benchmark the code generation capabilities of various Large Language Models.
 The tests are run automatically via GitHub Actions, and the results are updated in this README.
 ## Configuration
 Set the percentage of tests to run during the benchmark. 100% runs all tests.
 <!-- CONFIG_START -->
 RUN_PERCENTAGE: 25
 <!-- CONFIG_END -->
 ## Models Under Test
 The following models are included in the benchmark run.
 <!-- MODELS_START -->
 google/gemini-2.5-pro
 anthropic/claude-sonnet-4.5
 openai/gpt-5-codex
 <!-- MODELS_END -->
 ## Benchmark Results
 Live benchmark results, including pass/fail status and code generation time, are available on our [results page](https://multipleof4.github.io/benchmark/).
 The results are updated automatically via GitHub Actions.