Last updated: 17/03/2025

OpenAIOfficial Docs

computer-use-preview

computer-use-preview-2025-03-11

active

## computer-use-preview A versatile model that provides a wide range of computer use capabilities, including text, image, video, audio, transcription, and text-to-speech functionalities. Supports a 128,000 token context window. Handles Text, Image, Video, Audio, Transcription, Text-to-Speech inputs and outputs. Supports fine-tuning for custom applications.

Model Timeline

Launch Date

3/11/2025

Capabilities

Text

Input Pricing

$0.01/ KTok

Context: 128,000 tokens

Output Pricing

$0.03/ KTok

Max tokens: 4,096

Vision Capabilities

Max resolution: 2048x2048
Max images per prompt: 10

Video

Input Pricing

$0.0025/second

Transcription

Transcription Pricing

$0.006/minute

Text-to-Speech

Text-to-Speech Pricing

$0.01/1k characters

Embeddings

Embeddings Pricing

$0.02/1k tokens