OpenAI's multimodal flagship model release
OpenAI's GPT-4o is a multimodal flagship model release announced on 2024-05-13, designed to reason across text, audio, vision, and video in real time.
Standard API pricing shown on the GPT-4o model page; the same rates appear on the official pricing page. GPT-4o is described as accepting text and image inputs and producing text outputs.
OpenAI’s official GPT-4o model page lists standard text token pricing at $2.50 per 1M input tokens, $1.25 per 1M cached input tokens, and $10.00 per 1M output tokens.