Last updated: 16/04/2025

Google Vertex AIOfficial Docs

Gemini 2.0 Flash Thinking Experimental 01-21

gemini-2.0-flash-thinking-exp-01-21

active

Gemini 2.0 Flash Thinking Experimental 01-21

Gemini 2.0 is an experimental multimodal AI model that can handle a wide range of inputs and outputs, including text, images, video, and audio. With a large 1048576 token context window, this model is designed for advanced language and content generation tasks.

Supports a 1048576 token context window. Handles Text, Image, Video, Audio, Transcription, Text-to-Speech inputs and outputs. Supports fine-tuning for custom applications.

Model Timeline

Launch Date

1/21/2025

Capabilities

Text

Input Pricing

$0.15/ MTok

Context: 1,048,576 tokens

Output Pricing

$0.60/ MTok

Max tokens: 1,048,576

Vision Capabilities

Max resolution: 1920x1080
Max images per prompt: 16

Image

Input Pricing

1290 tokens/image

Video

Input Pricing

$0.05/second

Audio

Input Pricing

$ 1.50 /minute

Generation Pricing

$6.00 /minute

Transcription

Transcription Pricing

$3.00/minute

Embeddings

Embeddings Pricing

$0.000025/1k tokens

Fine-Tuning

Fine-Tuning Pricing

$3.00/MTok training

Additional Model Information

Tool Use

No

Structured Output

No

Reasoning

Yes

Flatten your repo for AI in seconds

Flatten repos. Prompt faster. One click → one GPT-ready file

Free Online & Desktop