About GPT-4o Audio
The gpt-4o-audio-preview model adds support for audio inputs as prompts. This enhancement allows the model to detect nuances within audio recordings and add depth to generated user experiences. Audio outputs are currently not supported. Audio tokens are priced at $40 per million input audio tokens.
Specifications
- Provider
- OpenAI
- Context Length
- 128,000 tokens
- Input Types
- audio, text
- Output Types
- text
- Category
- GPT
- Added
- 8/15/2025
Frequently Asked Questions
Common questions about GPT-4o Audio