|
|
cb478aa21a
|
Delete: Replacing with Transpiler test
|
2025-11-26 16:49:30 -08:00 |
|
github-actions[bot]
|
a175f18319
|
Docs: Update benchmark for openrouter/bert-nebulon-alpha
|
2025-11-25 21:20:48 +00:00 |
|
github-actions[bot]
|
aea139fe4a
|
Docs: Update benchmark for anthropic/claude-opus-4.5 TEMP:0.7
|
2025-11-24 21:54:56 +00:00 |
|
github-actions[bot]
|
932fcdce31
|
Docs: Update benchmark for x-ai/grok-4.1-fast
|
2025-11-20 01:45:32 +00:00 |
|
github-actions[bot]
|
5855cf8a6f
|
Docs: Update benchmark results
|
2025-11-18 23:31:52 +00:00 |
|
github-actions[bot]
|
33b8150958
|
Docs: Update Gemini benchmark results
|
2025-11-18 22:04:41 +00:00 |
|
github-actions[bot]
|
76fb066932
|
Docs: Update Gemini benchmark results
|
2025-11-18 19:30:39 +00:00 |
|
github-actions[bot]
|
afcfd09537
|
Docs: Update benchmark results
|
2025-11-18 17:37:06 +00:00 |
|
github-actions[bot]
|
24de0a1a87
|
Docs: Update benchmark for openrouter/sherlock-dash-alpha
|
2025-11-16 00:31:49 +00:00 |
|
github-actions[bot]
|
9450b6f936
|
Docs: Update benchmark for openrouter/sherlock-think-alpha
|
2025-11-16 00:31:00 +00:00 |
|
github-actions[bot]
|
0d5effb238
|
Docs: Update benchmark for moonshotai/kimi-k2-thinking
|
2025-11-15 00:13:33 +00:00 |
|
github-actions[bot]
|
9a64997884
|
Docs: Update benchmark results
|
2025-11-14 03:31:28 +00:00 |
|
|
|
9c33f7f591
|
Revert: Update test.js
|
2025-11-13 16:49:32 -08:00 |
|
|
|
7ed0b15f54
|
Refactor: Export test case inputs for debug page
|
2025-11-13 16:45:26 -08:00 |
|
github-actions[bot]
|
f2ef5831a7
|
Docs: Update benchmark results
|
2025-11-13 21:50:29 +00:00 |
|
github-actions[bot]
|
a38ae2d0c5
|
Docs: Update benchmark results
|
2025-11-13 21:24:36 +00:00 |
|
|
|
d811c29f99
|
Revert: Update test.js
|
2025-11-13 13:03:54 -08:00 |
|
|
|
268c81d873
|
Revert: Update anthropic_claude-sonnet-4.5.js
|
2025-11-13 13:03:47 -08:00 |
|
|
|
851c07845d
|
Feat: Return test result for output recording
|
2025-11-13 12:49:14 -08:00 |
|
github-actions[bot]
|
1687dca49c
|
Docs: Update benchmark results
|
2025-11-07 22:07:45 +00:00 |
|
github-actions[bot]
|
d0bc3b95dd
|
Docs: Update benchmark results
|
2025-11-07 21:32:49 +00:00 |
|
github-actions[bot]
|
af1053eeb0
|
Docs: Update benchmark results
|
2025-10-13 18:37:08 +00:00 |
|
|
|
d0917e8b3e
|
Refactor: Use shared prompt from README config
|
2025-10-13 10:57:55 -07:00 |
|
|
|
b8a8d2fa75
|
Refactor: Make test harness browser-compatible
|
2025-10-13 10:24:44 -07:00 |
|
|
|
7a0f9d6d82
|
Feat: Update prompt instructions for LLMs
|
2025-10-13 06:23:03 -07:00 |
|
|
|
7efae39296
|
Refactor: Convert test definition to ES Module
|
2025-10-13 06:05:31 -07:00 |
|
|
|
06d66aa1ad
|
Feat: Reorganizing tests into subdirectories
|
2025-10-13 05:50:51 -07:00 |
|