The Open Platform for Cloud Coding Agents
Open-source, model-agnostic AI coding-agent platform for autonomous software development tasks, available locally, in the cloud, via CLI, SDK, and self-hosted enterprise deployments.
OpenHands introduced EvoClaw, a benchmark that reconstructs milestone DAGs from repo history to test continuous software evolution instead of isolated tasks. The first results show agents can clear single tasks yet still collapse under regressions and technical debt over longer runs.
OpenHands published a skill-eval recipe with bounded tasks, deterministic verifiers, and no-skill baselines, then showed some skills speed agents up while others make them brittle. Teams shipping skill libraries should measure them per task and model before rollout.