Flatten your repo for AI in seconds
Flatten repos. Prompt faster. One click → one GPT-ready file
Free Online & Desktop
whisper-large-v3
Whisper Large v3 is a powerful automatic speech recognition (ASR) model developed by OpenAI, capable of transcribing audio to text with impressive accuracy and speed.
Supports a 448 token context window. Handles Text, Image, Video, Audio, Transcription, Text-to-Speech inputs and outputs. Supports fine-tuning for custom applications.
Capable of generating structured output formats.
This model is well-suited for a wide range of speech-to-text applications, from meeting transcription to podcast captioning. With its large context window and multi-modal capabilities, Whisper Large v3 can provide reliable and efficient transcription services.
$0.11/ second
$0.11/ second
Per image pricing not available
$0.00025/second
$ 0.006 /minute
Not available
$0.00185/minute
$0.11/1k tokens
Flatten repos. Prompt faster. One click → one GPT-ready file
Free Online & Desktop