CUA-World
Benchmark for computer-use agents.
A benchmark and evaluation platform for computer-use agents on real-world tasks.

Recent stories
0 linked stories
No linked stories yet.
Benchmark for computer-use agents.
A benchmark and evaluation platform for computer-use agents on real-world tasks.
