Last updated: 16/04/2025

Google Vertex AIOfficial Docs

Gemini 2.0 Pro Experimental 02-05

gemini-2.0-pro-exp-02-05

active

Gemini 2.0 Pro Experimental 02-05

Gemini 2.0 Pro is an experimental, multimodal AI model capable of handling a wide range of inputs and outputs, including text, images, video, audio, transcription, and text-to-speech. With a large 1048576 token context window, this model is well-suited for complex, long-form tasks and supports fine-tuning, tool use, and structured output generation.

Supports a 1048576 token context window. Handles Text, Image, Video, Audio, Transcription, Text-to-Speech inputs and outputs. Supports fine-tuning for custom applications. Supports tool use for advanced automation. Capable of generating structured output formats.

Model Timeline

Launch Date

3/25/2025

Capabilities

Text

Input Pricing

$1.25/ MTok

Context: 1,048,576 tokens
Long context: 200,000 + @ 2.5x

Output Pricing

$10.00/ MTok

Image

Input Pricing

1290 tokens/image

Resolutions:
  • 1024x1024: $1290.00

Video

Input Pricing

$0.08/second

Audio

Input Pricing

$ 0.50 /minute

Generation Pricing

$2.00 /minute

Transcription

Transcription Pricing

$1.50/minute

Embeddings

Embeddings Pricing

$0.000025/1k tokens

Additional Model Information

Tool Use

Yes

Structured Output

Yes

Reasoning

Yes

Flatten your repo for AI in seconds

Flatten repos. Prompt faster. One click → one GPT-ready file

Free Online & Desktop