AssistantBench
Benchmark for AI assistant evaluation
Open benchmark software for evaluating assistant-style language tasks.

Recent stories
0 linked stories
No linked stories yet.
Benchmark for AI assistant evaluation
Open benchmark software for evaluating assistant-style language tasks.
