Evals
Framework for evaluating and comparing model performance.
OpenAI's Evals is a framework for building, running, and comparing evaluations of model outputs and prompt/model changes.

Recent stories
0 linked stories
No linked stories yet.
Framework for evaluating and comparing model performance.
OpenAI's Evals is a framework for building, running, and comparing evaluations of model outputs and prompt/model changes.
