Deployment Simulation
Using realistic conversation contexts to better estimate undesired model behavior before release.
Deployment Simulation is OpenAI's pre-deployment safety evaluation method/pipeline for simulating future model deployments by replaying de-identified prior conversations with a candidate model to estimate undesired behavior rates and surface new risks before release.
Recent stories
0 linked stories
No linked stories yet.