MulTaBench
Open benchmark for multi-task evaluation.
An independent benchmark product for evaluating models across multiple tasks, likely centered on language tasks.
Recent stories
0 linked stories
No linked stories yet.
Open benchmark for multi-task evaluation.
An independent benchmark product for evaluating models across multiple tasks, likely centered on language tasks.