Set the percentage of tests to run during the benchmark. 100% runs all tests. RUN_PERCENTAGE: 100 SHARED_PROMPT: "Provide production-ready and maintainable JavaScript code. Apply code golfing practices but don't put everything in a single line. No comments. Your code will execute in the browser." The following models are included in the benchmark run. google/gemini-3.1-flash-lite openai/gpt-5.5 EFF:high deepseek/deepseek-v4-pro moonshotai/kimi-k2.6 anthropic/claude-opus-4.7 EFF:medium z-ai/glm-5.1 openai/gpt-5.4 google/gemini-3.1-flash-lite-preview openai/gpt-5.3-codex google/gemini-3.1-pro-preview anthropic/claude-opus-4.6 moonshotai/kimi-k2.5 google/gemini-3-flash-preview TEMP:0.35 deepseek/deepseek-v3.2