AppWorld
A Controllable World of Apps and People for Benchmarking Interactive Coding Agents
Open-source benchmark and execution environment for interactive coding agents, distributed as a Python package with a local CLI, task explorer, and notebook support.
Recent stories
0 linked stories
No linked stories yet.