OpenAI's multimodal flagship model release
OpenAI's GPT-4o is a multimodal flagship model release announced on 2024-05-13, designed to reason across text, audio, vision, and video in real time.
First-party model page pricing; GPT-4o is described as a multimodal flagship model. The page also shows Batch API price as $2.50 input / $10.00 output per 1M tokens, but the standard API pricing recorded here is the main public pricing.
OpenAI’s first-party GPT-4o model page lists token pricing of $2.50 per 1M input tokens, $1.25 per 1M cached input tokens, and $10.00 per 1M output tokens. The same page identifies GPT-4o as a multimodal model with text and image inputs and text outputs.