mirror of
https://github.com/multipleof4/lynchmark.git
synced 2026-01-14 00:27:55 +00:00
7b5aad47d1712d531cbf15ce21b16b9bb16bf4c9
LLM Algorithmic Benchmark
This repository contains a suite of difficult algorithmic tests to benchmark the code generation capabilities of various Large Language Models.
The tests are run automatically via GitHub Actions, and the results are updated in this README.
Configuration
Set the percentage of tests to run during the benchmark. 100% runs all tests.
RUN_PERCENTAGE: 25
Models Under Test
The following models are included in the benchmark run.
google/gemini-2.5-pro anthropic/claude-sonnet-4.5 openai/gpt-5-codex
Benchmark Results
Live benchmark results, including pass/fail status and code generation time, are available on our results page.
The results are updated automatically via GitHub Actions.
Description
Languages
JavaScript
85.2%
HTML
14.8%