Last updated: 16/04/2025

MistralOfficial Docs

Mistral OCR

mistral-ocr-latest

active

Mistral OCR

Introducing the world's best document understanding API, capable of extracting interleaved text and images from a wide range of document formats.

Supports a 32768 token context window. Handles Text, Image, Video, Audio, Transcription, Text-to-Speech inputs and outputs. Supports fine-tuning for custom applications.

Capable of generating structured output formats.

Additional Information

Notes

Also known as mistral-ocr-2503. Context length of 32768 tokens.

Model Timeline

Launch Date

3/1/2025

Last Updated

3/1/2025

Capabilities

Text

Input Pricing

$0.70/ MTok

Context: 32,768 tokens

Output Pricing

$0.70/ MTok

Max tokens: 4,096

Vision Capabilities

Max resolution: 4096x4096
Max images per prompt: 10

Image

Input Pricing

1000 tokens/image

Embeddings

Embeddings Pricing

$0.0001/1k tokens

Additional Model Information

Tool Use

No

Structured Output

Yes

Reasoning

No

Flatten your repo for AI in seconds

Flatten repos. Prompt faster. One click → one GPT-ready file

Free Online & Desktop