mistral-ocr-2503
## Mistral OCR Introducing the world's best document understanding API. Mistral OCR is a powerful model that can handle a wide range of inputs, including text, images, video, and audio, and provides capabilities like transcription and text-to-speech. Supports a 32768 token context window. Handles Text, Image, Video, Audio, Transcription, Text-to-Speech inputs and outputs. Supports fine-tuning for custom applications.
3/1/2025
$0.70/ MTok
$0.70/ MTok
85 tokens/image
$0.0001/1k tokens