Last updated: 17/03/2025

Google Vertex AIOfficial Docs

Gemini 1.0 Pro Vision

gemini-1.0-pro-vision

deprecated

## Gemini 1.0 Pro Vision The original Gemini 1.0 Pro Vision model was optimized for image understanding, providing a powerful multimodal AI solution for a variety of applications. This model supports a 12,288 token context window and handles inputs and outputs across Text, Image, Video, Audio, Transcription, and Text-to-Speech capabilities. Gemini 1.0 Pro Vision was deprecated on July 12, 2024, so users are encouraged to migrate to a newer Gemini version for the latest features and performance. Supports a 12,288 token context window. Handles Text, Image, Video, Audio, Transcription, Text-to-Speech inputs and outputs. Supports fine-tuning for custom applications.

Model Timeline

Launch Date

2/15/2024

Knowledge Cutoff

2/1/2023

Marked Deprecated

7/12/2024

Capabilities

Text

Input Pricing

$0.0025/ KTok

Context: 12,288 tokens

Output Pricing

$0.0025/ KTok

Max tokens: 2,048

Vision Capabilities

Max resolution: 1024x1024
Max images per prompt: 16

Image

Input Pricing

Per image pricing not available

Video

Input Pricing

$0.002/second

Embeddings

Embeddings Pricing

$0.000025/1k tokens

Additional Model Information

Tool Use

No

Structured Output

No

Reasoning

Yes