CUA-World
Benchmark for computer-use agents.
A Carnegie Mellon University software product for benchmarking or evaluating computer-use agents on realistic GUI tasks.

Recent stories
0 linked stories
No linked stories yet.
Benchmark for computer-use agents.
A Carnegie Mellon University software product for benchmarking or evaluating computer-use agents on realistic GUI tasks.
