Last updated: 16/04/2025

Google Vertex AIOfficial Docs

Gemma 3 12B

gemma-3-12b-it

active

Gemma 3 12B

Gemma 3 is Google's latest open model, featuring a powerful 32,768 token context window and the ability to handle a wide variety of inputs and outputs including text, images, video, audio, transcription, and text-to-speech. With support for over 140 languages and capabilities for fine-tuning, tool use, and structured output, Gemma 3 is a versatile model suitable for a broad range of applications.

Supports a 32,768 token context window. Handles Text, Image, Video, Audio, Transcription, Text-to-Speech inputs and outputs. Supports fine-tuning for custom applications. Supports tool use for advanced automation. Capable of generating structured output formats.

Model Timeline

Launch Date

2/25/2025

Capabilities

Text

Input Pricing

$0.00/ KTok

Context: 32,768 tokens

Output Pricing

$0.00/ KTok

Max tokens: 8,192

Image

Input Pricing

$0.0015 /image

Generation Pricing

$0.04 /image

Video

Input Pricing

$0.00002/second

Audio

Input Pricing

$ 3.00 /minute

Generation Pricing

$12.00 /minute

Embeddings

Embeddings Pricing

$0.0001/1k tokens

Fine-Tuning

Fine-Tuning Pricing

$3.00/MTok training

Additional Model Information

Tool Use

Yes

Structured Output

Yes

Reasoning

Yes

Flatten your repo for AI in seconds

Flatten repos. Prompt faster. One click → one GPT-ready file

Free Online & Desktop