Commit Graph

53 Commits

Author SHA1 Message Date
github-actions[bot]
b7056ba038 Docs: Update benchmark for x-ai/grok-4.20-beta 2026-03-12 20:26:15 +00:00
github-actions[bot]
bd43a83707 Docs: Update benchmark for openrouter/hunter-alpha 2026-03-11 21:27:35 +00:00
github-actions[bot]
264be17557 Docs: Update benchmark for openai/gpt-5.4 2026-03-05 19:17:32 +00:00
github-actions[bot]
35f7fc8803 Refactor: Remove stale benchmark outputs 2026-03-05 07:51:26 +00:00
github-actions[bot]
47ccd20b71 Docs: Update benchmark for google/gemini-3.1-flash-lite-preview 2026-03-03 22:42:09 +00:00
github-actions[bot]
dce9c2a746 Docs: Update benchmark for openai/gpt-5.3-codex 2026-02-24 23:25:47 +00:00
github-actions[bot]
e913b42a5d Docs: Update benchmark for openai/gpt-5.3-codex 2026-02-24 21:34:38 +00:00
github-actions[bot]
0cd9a70a26 Docs: Update benchmark for openai/gpt-5.3-codex 2026-02-24 19:48:17 +00:00
github-actions[bot]
75b92c832f Docs: Update benchmark for google/gemini-3.1-pro-preview 2026-02-19 16:23:35 +00:00
github-actions[bot]
79886a435d Docs: Update benchmark for anthropic/claude-sonnet-4.6 2026-02-17 22:10:27 +00:00
github-actions[bot]
42bc15a261 Docs: Update benchmark for minimax/minimax-m2.5 2026-02-12 16:36:02 +00:00
github-actions[bot]
91aa46a299 Docs: Update benchmark for z-ai/glm-5 2026-02-12 00:32:22 +00:00
github-actions[bot]
936cb91590 Docs: Update benchmark for openrouter/aurora-alpha 2026-02-09 18:45:17 +00:00
github-actions[bot]
3385fbc925 Refactor: Remove stale benchmark outputs 2026-02-06 21:01:23 +00:00
github-actions[bot]
4be9446973 Docs: Update benchmark for openrouter/pony-alpha 2026-02-06 20:56:45 +00:00
github-actions[bot]
19600ca84b Docs: Update benchmark for anthropic/claude-opus-4.6 TEMP:0.4 2026-02-05 19:52:20 +00:00
github-actions[bot]
73a72a2b7e Docs: Update benchmark for anthropic/claude-opus-4.6 TEMP:0.7 2026-02-05 19:39:59 +00:00
github-actions[bot]
5b116b55af Docs: Update benchmark for anthropic/claude-opus-4.6 2026-02-05 19:33:26 +00:00
github-actions[bot]
0f6d112bfb Docs: Update benchmark for moonshotai/kimi-k2.5 2026-01-28 02:13:45 +00:00
github-actions[bot]
3983c8eb1a Docs: Update benchmark for minimax/minimax-m2.1 2025-12-23 02:41:05 +00:00
github-actions[bot]
401af17eb6 Docs: Update benchmark for z-ai/glm-4.7 2025-12-23 02:30:02 +00:00
github-actions[bot]
073da08edc Docs: Update benchmark for google/gemini-3-flash-preview TEMP:0.35 test 7 2025-12-17 17:18:10 +00:00
github-actions[bot]
0bea7a0d26 Docs: Update benchmark for google/gemini-3-flash-preview 2025-12-17 16:44:44 +00:00
github-actions[bot]
c394909ee1 Docs: Update benchmark for openai/gpt-5.2 EFF:xhigh 2025-12-11 21:43:10 +00:00
github-actions[bot]
901bde7c8a Docs: Update benchmark for openai/gpt-5.2 2025-12-11 18:42:54 +00:00
github-actions[bot]
f276121fb0 Docs: Update benchmark for openai/gpt-5.1-codex-max 2025-12-05 16:05:03 +00:00
github-actions[bot]
8ab1e91949 Docs: Update benchmark for minimax/minimax-m2 2025-12-02 15:05:25 +00:00
github-actions[bot]
0b47c76f63 Docs: Update benchmark for deepseek/deepseek-v3.2 2025-12-01 15:02:03 +00:00
github-actions[bot]
ba567f4017 Docs: Update benchmark results 2025-11-27 19:36:55 +00:00
9b245a178b Delete tests/7_scheduler/outputs_gemini directory 2025-11-26 18:02:37 -08:00
github-actions[bot]
a175f18319 Docs: Update benchmark for openrouter/bert-nebulon-alpha 2025-11-25 21:20:48 +00:00
61e7aaec2e Fix: Compare timestamps instead of strict ISO strings 2025-11-24 14:17:19 -08:00
github-actions[bot]
aea139fe4a Docs: Update benchmark for anthropic/claude-opus-4.5 TEMP:0.7 2025-11-24 21:54:56 +00:00
github-actions[bot]
932fcdce31 Docs: Update benchmark for x-ai/grok-4.1-fast 2025-11-20 01:45:32 +00:00
github-actions[bot]
5855cf8a6f Docs: Update benchmark results 2025-11-18 23:31:52 +00:00
github-actions[bot]
33b8150958 Docs: Update Gemini benchmark results 2025-11-18 22:04:41 +00:00
github-actions[bot]
76fb066932 Docs: Update Gemini benchmark results 2025-11-18 19:30:39 +00:00
b5080d18a0 Fix: Enforce UTC calculations in scheduler prompt 2025-11-18 09:49:13 -08:00
github-actions[bot]
afcfd09537 Docs: Update benchmark results 2025-11-18 17:37:06 +00:00
github-actions[bot]
24de0a1a87 Docs: Update benchmark for openrouter/sherlock-dash-alpha 2025-11-16 00:31:49 +00:00
github-actions[bot]
9450b6f936 Docs: Update benchmark for openrouter/sherlock-think-alpha 2025-11-16 00:31:00 +00:00
github-actions[bot]
0d5effb238 Docs: Update benchmark for moonshotai/kimi-k2-thinking 2025-11-15 00:13:33 +00:00
github-actions[bot]
9a64997884 Docs: Update benchmark results 2025-11-14 03:31:28 +00:00
08242437db Fix: Clarify scheduler slot expectations and make test more lenient 2025-11-13 17:27:49 -08:00
4214ff9e93 Revert: Update test.js 2025-11-13 16:50:18 -08:00
4f27efa895 Refactor: Export test case inputs for debug page 2025-11-13 16:45:36 -08:00
github-actions[bot]
f2ef5831a7 Docs: Update benchmark results 2025-11-13 21:50:29 +00:00
github-actions[bot]
a38ae2d0c5 Docs: Update benchmark results 2025-11-13 21:24:36 +00:00
8d8d4ed108 Revert: Update test.js 2025-11-13 13:05:15 -08:00
dfa960cb78 Feat: Return test result for output recording 2025-11-13 12:49:22 -08:00