KiloBench
AI Coding Model Benchmark Results
KiloBench is Kilo's proprietary benchmark suite and public results/leaderboard for AI coding models, running Terminal Bench 2.0 tasks through Kilo's actual agent harness to measure harness-specific completion rates and real production costs.
Recent stories
0 linked stories
No linked stories yet.