Last updated: 17/03/2025

Google Vertex AIOfficial Docs

Gemini 2.0 Flash-Lite

gemini-2.0-flash-lite-001

active

## Gemini 2.0 Flash-Lite Gemini 2.0 Flash-Lite is a cost-effective, high-throughput version of the Gemini 2.0 model, designed to support a wide range of multimodal applications including text, image, video, and audio processing, transcription, and text-to-speech. Supports a 1,048,576 token context window. Handles Text, Image, Video, Audio, Transcription, Text-to-Speech inputs and outputs. Supports fine-tuning for custom applications. Supports tool use for advanced automation. Capable of generating structured output formats.

Additional Information

Notes

This is the cost-effective version of Gemini 2.0 Flash, designed to support high throughput. It has a context length of 1,048,576 tokens and supports multimodal inputs including text, images, video, audio, and PDF. The model supports function calling, batch prediction, and system instructions.

Model Timeline

Launch Date

2/25/2025

Last Updated

2/25/2025

Knowledge Cutoff

1/1/2025

Capabilities

Text

Input Pricing

$-/ KTok

Context: 1,048,576 tokens

Output Pricing

$-/ KTok

Vision Capabilities

Image

Input Pricing

$0.00002 /image

Audio

Input Pricing

$ 0.07 /minute

Generation Pricing

Not available

Embeddings

Embeddings Pricing

$0.0002/1k tokens

Additional Model Information

Tool Use

Yes

Structured Output

Yes

Reasoning

Yes