Last updated: 17/03/2025

OpenAIOfficial Docs

o1

o1-preview

active

## o1 OpenAI's o1 model is a versatile AI assistant capable of handling a wide range of tasks, including text, image, video, audio, transcription, and text-to-speech. With a generous 128,000 token context window, this model is well-equipped to tackle complex queries and maintain coherent conversations. Supports a 128,000 token context window. Handles Text, Image, Video, Audio, Transcription, Text-to-Speech inputs and outputs. Supports fine-tuning for custom applications.

Additional Information

Notes

Pricing: $15.00 per 1M input tokens, $7.50 per 1M cached input tokens, $60.00 per 1M output tokens

Capabilities

Text

Input Pricing

$0.01/ KTok

Context: 128,000 tokens

Output Pricing

$0.07/ KTok

Max tokens: 4,096

Vision Capabilities

Max resolution: 2048x2048
Max images per prompt: 20

Image

Input Pricing

85 tokens/image

Transcription

Transcription Pricing

$0.006/minute

Text-to-Speech

Text-to-Speech Pricing

$0.01/1k characters

Embeddings

Embeddings Pricing

$0.13/1k tokens

Fine-Tuning

Fine-Tuning Pricing

$15.00/MTok training