2025-11-13 16:50:45 -08:00
2025-11-13 13:01:02 -08:00
2025-11-13 16:50:25 -08:00
2025-11-13 13:00:44 -08:00
2025-11-13 13:00:54 -08:00
2025-11-13 13:39:11 -08:00
2025-11-13 21:50:29 +00:00

Set the percentage of tests to run during the benchmark. 100% runs all tests.

<!-- CONFIG_START -->
RUN_PERCENTAGE: 100
SHARED_PROMPT: "Provide production-ready and maintainable JavaScript code. Apply code golfing practices but don't put everything in a single line. No comments. Your code will execute in the browser."
<!-- CONFIG_END -->


The following models are included in the benchmark run.

<!-- MODELS_START -->
openai/gpt-5.1-codex
openai/gpt-5.1-chat
google/gemini-2.5-pro
anthropic/claude-sonnet-4.5 TEMP:0.7
<!-- MODELS_END -->

Description
LLM Benchmark
Readme 2.2 MiB
Languages
JavaScript 85.2%
HTML 14.8%