IFBench
A benchmark for precise instruction following.
IFBench is an open benchmark and evaluation codebase for testing precise instruction following by language models. It includes out-of-domain verifiable constraints, corresponding verification functions, optional multi-turn constraint-isolation evaluation data, released Hugging Face datasets, and scripts for running evaluations against model outputs.
Recent stories
0 linked stories
No linked stories yet.