Kimi-K2.6 / .eval_results /swe_bench_verified.yaml
bigeagle's picture
Add evaluation results for HLE, GPQA, AIME, HMMT, SWE-Bench, and Terminal-Bench (#4)
d9cb81b
- dataset:
id: SWE-bench/SWE-bench_Verified
task_id: swe_bench_%_resolved
value: 80.2
date: '2026-04-20'
source:
url: https://huggingface.co/moonshotai/Kimi-K2.6
name: Model Card
user: SaylorTwift