Last updated: 16/04/2025

OpenAIOfficial Docs

GPT-4o mini Audio

gpt-4o-mini-audio-preview

active

GPT-4o mini Audio

A smaller version of the GPT-4o model, capable of handling audio inputs and outputs in addition to text. This makes it well-suited for applications involving speech recognition, text-to-speech, and audio-based interactions.

Supports a unknown token context window. Handles Text, Image, Video, Audio, Transcription, Text-to-Speech inputs and outputs. Supports fine-tuning for custom applications.

Model Timeline

Launch Date

12/17/2024

Capabilities

Text

Input Pricing

$0.15/ MTok

Context: N/A tokens

Output Pricing

$0.60/ MTok

Video

Input Pricing

$0.0025/second

Audio

Input Pricing

$ 10.00 /minute

Generation Pricing

$20.00 /minute

Transcription

Transcription Pricing

$0.003/minute

Text-to-Speech

Text-to-Speech Pricing

$0.01/1k characters

Embeddings

Embeddings Pricing

$0.02/1k tokens

Fine-Tuning

Fine-Tuning Pricing

$3.00/MTok training

Flatten your repo for AI in seconds

Flatten repos. Prompt faster. One click → one GPT-ready file

Free Online & Desktop