Commit Graph

26 Commits

Author SHA1 Message Date
github-actions[bot]
932fcdce31 Docs: Update benchmark for x-ai/grok-4.1-fast 2025-11-20 01:45:32 +00:00
github-actions[bot]
5855cf8a6f Docs: Update benchmark results 2025-11-18 23:31:52 +00:00
github-actions[bot]
33b8150958 Docs: Update Gemini benchmark results 2025-11-18 22:04:41 +00:00
github-actions[bot]
76fb066932 Docs: Update Gemini benchmark results 2025-11-18 19:30:39 +00:00
github-actions[bot]
afcfd09537 Docs: Update benchmark results 2025-11-18 17:37:06 +00:00
github-actions[bot]
24de0a1a87 Docs: Update benchmark for openrouter/sherlock-dash-alpha 2025-11-16 00:31:49 +00:00
github-actions[bot]
9450b6f936 Docs: Update benchmark for openrouter/sherlock-think-alpha 2025-11-16 00:31:00 +00:00
github-actions[bot]
0d5effb238 Docs: Update benchmark for moonshotai/kimi-k2-thinking 2025-11-15 00:13:33 +00:00
github-actions[bot]
9a64997884 Docs: Update benchmark results 2025-11-14 03:31:28 +00:00
d33b11c8fd Revert: Update test.js 2025-11-13 16:49:27 -08:00
4b407b5f3d Refactor: Export test case inputs for debug page 2025-11-13 16:45:23 -08:00
github-actions[bot]
f2ef5831a7 Docs: Update benchmark results 2025-11-13 21:50:29 +00:00
github-actions[bot]
a38ae2d0c5 Docs: Update benchmark results 2025-11-13 21:24:36 +00:00
ddb18f5d70 Revert: Update test.js 2025-11-13 13:03:31 -08:00
e6fa9c76db Feat: Return test result for output recording 2025-11-13 12:49:11 -08:00
github-actions[bot]
1687dca49c Docs: Update benchmark results 2025-11-07 22:07:45 +00:00
6df4fca643 Fix: Correct LIS test assertion message 2025-11-07 13:51:48 -08:00
github-actions[bot]
d0bc3b95dd Docs: Update benchmark results 2025-11-07 21:32:49 +00:00
9fed40296c Update test.js 2025-10-14 05:26:34 -07:00
6395540454 Fix: Remove duplicate export from model output 2025-10-13 11:56:34 -07:00
github-actions[bot]
af1053eeb0 Docs: Update benchmark results 2025-10-13 18:37:08 +00:00
84f6eed585 Refactor: Use shared prompt from README config 2025-10-13 10:57:52 -07:00
25d46a0d8b Refactor: Make test harness browser-compatible 2025-10-13 10:24:42 -07:00
285cf114db Feat: Update prompt instructions for LLMs 2025-10-13 06:22:59 -07:00
fc7b5e22e4 Refactor: Convert test definition to ES Module 2025-10-13 06:05:28 -07:00
e53b4e2bc8 Feat: Reorganizing tests into subdirectories 2025-10-13 05:50:48 -07:00