SlopCodeBench
Community driven benchmark for measuring code erosion under iterative specification refinement.
Community-driven benchmark for measuring code erosion under iterative specification refinement; evaluates coding agents across repeated checkpoints as requirements change.

Recent stories
0 linked stories
No linked stories yet.