Groq Official Docs

Llama-4-Maverick-17B-128E-Instruct

meta-llama/llama-4-maverick-17b-128e-instruct

active

Llama-4-Maverick-17B-128E-Instruct

Llama-4-Maverick is a powerful 17B parameter model with 128 experts, designed for a wide range of instruction-following tasks. With a large 131,072 token context window and the ability to generate up to 8,192 tokens in a single completion, this model is well-suited for complex, multi-step prompts.

Supports a 131,072 token context window. Handles Text, Image, Video, Audio, Transcription, Text-to-Speech inputs and outputs. Supports fine-tuning for custom applications.

Capabilities

Text

Input Pricing

$0.20/ MTok

Context: 131,072 tokens

Output Pricing

$0.60/ MTok

Max tokens: 8,192

Image

Input Pricing

6400 tokens/image

Text-to-Speech

Text-to-Speech Pricing

$0.05/1k characters

Embeddings

Embeddings Pricing

$0.18/1k tokens

Groq Official Docs

Llama-4-Maverick-17B-128E-Instruct

Llama-4-Maverick-17B-128E-Instruct

Capabilities

Text

Input Pricing

Output Pricing

Image

Input Pricing

Text-to-Speech

Text-to-Speech Pricing

Embeddings

Embeddings Pricing

Anthropic

Cohere

DeepSeek

Google Vertex AI

Groq

Mistral

OpenAI

X.AI

Capabilities

Text

Input Pricing

Output Pricing

Image

Input Pricing

Text-to-Speech

Text-to-Speech Pricing

Embeddings

Embeddings Pricing

Flatten your repo for AI in seconds

Anthropic

Cohere

DeepSeek

Google Vertex AI

Groq

Mistral

OpenAI

X.AI