mirror of
https://github.com/multipleof4/lynchmark.git
synced 2026-01-14 08:37:56 +00:00
Delete README.md
This commit is contained in:
29
README.md
29
README.md
@@ -1,29 +0,0 @@
|
|||||||
# LLM Algorithmic Benchmark
|
|
||||||
|
|
||||||
This repository contains a suite of difficult algorithmic tests to benchmark the code generation capabilities of various Large Language Models.
|
|
||||||
|
|
||||||
The tests are run automatically via GitHub Actions, and the results are updated in this README.
|
|
||||||
|
|
||||||
## Configuration
|
|
||||||
|
|
||||||
Set the percentage of tests to run during the benchmark. 100% runs all tests.
|
|
||||||
|
|
||||||
<!-- CONFIG_START -->
|
|
||||||
RUN_PERCENTAGE: 25
|
|
||||||
<!-- CONFIG_END -->
|
|
||||||
|
|
||||||
## Models Under Test
|
|
||||||
|
|
||||||
The following models are included in the benchmark run.
|
|
||||||
|
|
||||||
<!-- MODELS_START -->
|
|
||||||
google/gemini-2.5-pro
|
|
||||||
anthropic/claude-sonnet-4.5
|
|
||||||
openai/gpt-5-codex
|
|
||||||
<!-- MODELS_END -->
|
|
||||||
|
|
||||||
## Benchmark Results
|
|
||||||
|
|
||||||
Live benchmark results, including pass/fail status and code generation time, are available on our [results page](https://multipleof4.github.io/benchmark/).
|
|
||||||
|
|
||||||
The results are updated automatically via GitHub Actions.
|
|
||||||
Reference in New Issue
Block a user