45 Commits

Author SHA1 Message Date
github-actions[bot]
3983c8eb1a Docs: Update benchmark for minimax/minimax-m2.1 2025-12-23 02:41:05 +00:00
github-actions[bot]
401af17eb6 Docs: Update benchmark for z-ai/glm-4.7 2025-12-23 02:30:02 +00:00
github-actions[bot]
3c12fd855f Docs: Update benchmark for google/gemini-3-flash-preview TEMP:0.35 2025-12-17 16:55:05 +00:00
github-actions[bot]
0bea7a0d26 Docs: Update benchmark for google/gemini-3-flash-preview 2025-12-17 16:44:44 +00:00
github-actions[bot]
c394909ee1 Docs: Update benchmark for openai/gpt-5.2 EFF:xhigh 2025-12-11 21:43:10 +00:00
github-actions[bot]
901bde7c8a Docs: Update benchmark for openai/gpt-5.2 2025-12-11 18:42:54 +00:00
github-actions[bot]
f276121fb0 Docs: Update benchmark for openai/gpt-5.1-codex-max 2025-12-05 16:05:03 +00:00
github-actions[bot]
8ab1e91949 Docs: Update benchmark for minimax/minimax-m2 2025-12-02 15:05:25 +00:00
github-actions[bot]
0b47c76f63 Docs: Update benchmark for deepseek/deepseek-v3.2 2025-12-01 15:02:03 +00:00
github-actions[bot]
ba567f4017 Docs: Update benchmark results 2025-11-27 19:36:55 +00:00
ef2e4a9dd1 Delete tests/1_dijkstra/outputs_gemini directory 2025-11-26 18:01:34 -08:00
github-actions[bot]
a175f18319 Docs: Update benchmark for openrouter/bert-nebulon-alpha 2025-11-25 21:20:48 +00:00
github-actions[bot]
aea139fe4a Docs: Update benchmark for anthropic/claude-opus-4.5 TEMP:0.7 2025-11-24 21:54:56 +00:00
github-actions[bot]
932fcdce31 Docs: Update benchmark for x-ai/grok-4.1-fast 2025-11-20 01:45:32 +00:00
github-actions[bot]
5855cf8a6f Docs: Update benchmark results 2025-11-18 23:31:52 +00:00
github-actions[bot]
33b8150958 Docs: Update Gemini benchmark results 2025-11-18 22:04:41 +00:00
github-actions[bot]
76fb066932 Docs: Update Gemini benchmark results 2025-11-18 19:30:39 +00:00
github-actions[bot]
afcfd09537 Docs: Update benchmark results 2025-11-18 17:37:06 +00:00
github-actions[bot]
24de0a1a87 Docs: Update benchmark for openrouter/sherlock-dash-alpha 2025-11-16 00:31:49 +00:00
github-actions[bot]
9450b6f936 Docs: Update benchmark for openrouter/sherlock-think-alpha 2025-11-16 00:31:00 +00:00
github-actions[bot]
0d5effb238 Docs: Update benchmark for moonshotai/kimi-k2-thinking 2025-11-15 00:13:33 +00:00
github-actions[bot]
9a64997884 Docs: Update benchmark results 2025-11-14 03:31:28 +00:00
e6f5a570a8 Fix: Make graph truly bidirectional as stated 2025-11-13 17:29:00 -08:00
f2e9b766dc Revert: Update test.js 2025-11-13 16:48:59 -08:00
8830581edb Refactor: Export test case inputs for debug page 2025-11-13 16:45:18 -08:00
github-actions[bot]
f2ef5831a7 Docs: Update benchmark results 2025-11-13 21:50:29 +00:00
github-actions[bot]
a38ae2d0c5 Docs: Update benchmark results 2025-11-13 21:24:36 +00:00
2a70b34478 Revert: Update test.js 2025-11-13 13:02:46 -08:00
94e9f9db94 Feat: Return test result for output recording 2025-11-13 12:48:59 -08:00
github-actions[bot]
1687dca49c Docs: Update benchmark results 2025-11-07 22:07:45 +00:00
github-actions[bot]
d0bc3b95dd Docs: Update benchmark results 2025-11-07 21:32:49 +00:00
70d0bf27e6 Update test.js 2025-10-14 05:11:32 -07:00
github-actions[bot]
af1053eeb0 Docs: Update benchmark results 2025-10-13 18:37:08 +00:00
ac4d26a964 Refactor: Use shared prompt from README config 2025-10-13 10:57:46 -07:00
github-actions[bot]
def79ffc8a Docs: Update benchmark results 2025-10-13 17:40:23 +00:00
136cfaa309 Refactor: Make test harness browser-compatible 2025-10-13 10:24:36 -07:00
4ba46b035c Delete: Old generated file format 2025-10-13 10:24:33 -07:00
eb61775ecf Delete: Old generated file format 2025-10-13 10:24:30 -07:00
7c950bf7e9 Delete: Old generated file format 2025-10-13 10:24:25 -07:00
github-actions[bot]
3e9fd184d5 Docs: Update benchmark results 2025-10-13 13:28:41 +00:00
315813b3be Feat: Update prompt instructions for LLMs 2025-10-13 06:22:35 -07:00
github-actions[bot]
f2defd7b70 Docs: Update benchmark results 2025-10-13 13:07:35 +00:00
43619dd01c Refactor: Convert test definition to ES Module 2025-10-13 06:05:23 -07:00
github-actions[bot]
1d4e1d84ac Docs: Update benchmark results 2025-10-13 12:58:11 +00:00
7f55b7aa6e Feat: Reorganizing tests into subdirectories 2025-10-13 05:50:40 -07:00