Commit Graph

221 Commits

Author SHA1 Message Date
9e7072eae2 Refactor: Move blog link to top, remove analysis section 2025-11-18 17:22:22 -08:00
12b58a2a7f Fix: Update canonical and OG URLs to lynchmark.com 2025-11-18 17:15:45 -08:00
f11a009ef8 Feat: Add blog section and update canonical URL 2025-11-18 17:14:04 -08:00
5796fb1da4 Update gemini-optimal-temperature.html 2025-11-18 16:26:51 -08:00
85d8870da1 Update gemini-optimal-temperature.html 2025-11-18 16:25:54 -08:00
646aa75c7a Feat: Add SEO-optimized analysis page for Gemini temperature findings 2025-11-18 16:18:54 -08:00
github-actions[bot]
5855cf8a6f Docs: Update benchmark results 2025-11-18 23:31:52 +00:00
341252fec1 Update README 2025-11-18 14:42:01 -08:00
3bb1fd3b49 Feat: Display both Average and Median optimal temperatures 2025-11-18 14:32:05 -08:00
04c4e6f393 Fix: Change optimal temp metric to average of highest scoring temps 2025-11-18 14:17:15 -08:00
github-actions[bot]
33b8150958 Docs: Update Gemini benchmark results 2025-11-18 22:04:41 +00:00
31a18dd4ee Update README 2025-11-18 12:13:18 -08:00
8aa2338b35 Feat: Calculate optimal temperature using weighted mean of correctness 2025-11-18 12:01:10 -08:00
a3c0e76a85 Feat: Calculate and display best performing temperature in gemini benchmark 2025-11-18 11:54:14 -08:00
github-actions[bot]
76fb066932 Docs: Update Gemini benchmark results 2025-11-18 19:30:39 +00:00
51a98c1e1b Fix: Import from outputs_gemini directory 2025-11-18 10:30:37 -08:00
53592ed9e3 Fix: Save Gemini tests to dedicated outputs_gemini directory 2025-11-18 10:30:33 -08:00
a9e3b6114a Feat: Add Gemini benchmark results page 2025-11-18 10:28:11 -08:00
7420f7d2cb Feat: Support Google API and Gemini mode 2025-11-18 10:28:01 -08:00
5decb550ab Feat: Add workflow for Gemini benchmark 2025-11-18 10:27:58 -08:00
e871eb0416 Update README 2025-11-18 10:14:32 -08:00
22faf89e9a Update README 2025-11-18 10:13:27 -08:00
d79ea01c2f Update README 2025-11-18 10:07:03 -08:00
7b97b484a3 Update README 2025-11-18 10:01:48 -08:00
36282d0a40 Delete debug.html 2025-11-18 09:50:14 -08:00
b5080d18a0 Fix: Enforce UTC calculations in scheduler prompt 2025-11-18 09:49:13 -08:00
github-actions[bot]
afcfd09537 Docs: Update benchmark results 2025-11-18 17:37:06 +00:00
39d057f079 Update README 2025-11-18 08:50:09 -08:00
f238f40d48 Create gemini.html 2025-11-18 08:45:07 -08:00
7012269f53 Feat: Add geospatial analysis benchmark 2025-11-18 08:33:42 -08:00
ced91e61b5 Feat: Add scrypt-js benchmark test 2025-11-15 17:53:03 -08:00
github-actions[bot]
24de0a1a87 Docs: Update benchmark for openrouter/sherlock-dash-alpha 2025-11-16 00:31:49 +00:00
github-actions[bot]
9450b6f936 Docs: Update benchmark for openrouter/sherlock-think-alpha 2025-11-16 00:31:00 +00:00
fc98f13849 Update README 2025-11-15 16:24:54 -08:00
fd29f4653e Fix: add per-model grade summary 2025-11-14 16:17:25 -08:00
github-actions[bot]
0d5effb238 Docs: Update benchmark for moonshotai/kimi-k2-thinking 2025-11-15 00:13:33 +00:00
da087e9ca0 Feat: Add workflow for single model benchmarks 2025-11-14 16:01:08 -08:00
ab4f7671c0 Refactor: Allow benchmark runs for a single model 2025-11-14 16:01:04 -08:00
57f89cc881 Revert: Update index.html 2025-11-14 11:49:05 -08:00
3f399a20fb Update README 2025-11-14 11:46:08 -08:00
9d188647b1 Fix: add per-model grade summary 2025-11-13 19:49:15 -08:00
1b4541a603 Feat: Add summary grade row per model 2025-11-13 19:44:10 -08:00
github-actions[bot]
9a64997884 Docs: Update benchmark results 2025-11-14 03:31:28 +00:00
7052d4f4b5 Refactor: remove explicit CDN hint 2025-11-13 19:19:01 -08:00
1c9b2174d6 Revert: Update index.html 2025-11-13 18:56:21 -08:00
9932f76e57 Feat: add summaries trophies + table view 2025-11-13 18:46:02 -08:00
eb8cff6256 Fix: Clarify exact average calculation expectation 2025-11-13 17:29:02 -08:00
e6f5a570a8 Fix: Make graph truly bidirectional as stated 2025-11-13 17:29:00 -08:00
08242437db Fix: Clarify scheduler slot expectations and make test more lenient 2025-11-13 17:27:49 -08:00
09c5e5dc3c Fix: Correct expected value format in test 2025-11-13 17:18:29 -08:00