OpenAI's framework for evaluating models and prompts
OpenAI's evaluation toolkit for creating, running, and analyzing model and prompt evaluations.