Last updated: 16/04/2025

OpenAIOfficial Docs

GPT-4o

gpt-4o-2024-11-20

active

GPT-4o

Fast, intelligent, and flexible GPT model with a wide range of capabilities, including text, image, video, audio, transcription, and text-to-speech.

Supports a wide token context window. Handles Text, Image, Video, Audio, Transcription, Text-to-Speech inputs and outputs. Supports fine-tuning for custom applications. Supports tool use for advanced automation. Capable of generating structured output formats.

Model Timeline

Launch Date

11/20/2024

Capabilities

Text

Input Pricing

$2.50/ MTok

Context: N/A tokens

Output Pricing

$10.00/ MTok

Vision Capabilities

Image

Generation Pricing

Per image pricing not available

Resolutions:
  • 1024x1024: $0.04
  • 1024x1792: $0.08

Video

Input Pricing

$0.008/second

Audio

Input Pricing

$ 0.006 /minute

Generation Pricing

$0.01 /minute

Transcription

Transcription Pricing

$0.006/minute

Text-to-Speech

Text-to-Speech Pricing

$0.01/1k characters

Embeddings

Embeddings Pricing

$0.13/1k tokens

Fine-Tuning

Fine-Tuning Pricing

$25.00/MTok training

Additional Model Information

Tool Use

Yes

Structured Output

Yes

Reasoning

Yes

Flatten your repo for AI in seconds

Flatten repos. Prompt faster. One click → one GPT-ready file

Free Online & Desktop