KramaBench
Open-source benchmark for end-to-end data-science agents.
An open-source benchmark for end-to-end data-science agents that evaluate full data-lake-to-insight pipelines across multiple domains.
Recent stories
0 linked stories
No linked stories yet.