Last updated: 16/04/2025

Google Vertex AIOfficial Docs

Gemini 1.5 Pro 002

gemini-1.5-pro-002

active

Gemini 1.5 Pro 002

Gemini 1.5 Pro 002 is a stable, mid-size multimodal AI model that can handle a wide range of inputs and outputs, including text, images, video, and audio. With a context window of up to 2 million tokens, it is a versatile tool for various applications.

Supports a 2,000,000 token context window. Handles Text, Image, Video, Audio inputs and outputs. Supports fine-tuning for custom applications. Supports tool use for advanced automation. Capable of generating structured output formats.

Model Timeline

Launch Date

9/1/2024

Capabilities

Text

Input Pricing

$0.0003125/ KTok

Context: 2,000,000 tokens
Long context: 128,000 + @ x

Output Pricing

$0.00125/ KTok

Vision Capabilities

Image

Input Pricing

$0.00032875 /image

Video

Input Pricing

$0.00032875/second

Audio

Input Pricing

$ 0.001875 /minute

Generation Pricing

Not available

Fine-Tuning

Fine-Tuning Pricing

$80.00/MTok training

Additional Model Information

Tool Use

Yes

Structured Output

Yes

Reasoning

Yes

Flatten your repo for AI in seconds

Flatten repos. Prompt faster. One click → one GPT-ready file

Free Online & Desktop