AssistantBench
Benchmark for evaluating AI assistants
A benchmark for evaluating AI assistants on task-oriented workflows and text-based interactions.

Recent stories
0 linked stories
No linked stories yet.
Benchmark for evaluating AI assistants
A benchmark for evaluating AI assistants on task-oriented workflows and text-based interactions.
