Last updated: 16/04/2025

GroqOfficial Docs

Distil-Whisper Large v3 English

distil-whisper-large-v3-en

active

Distil-Whisper Large v3 English

Distil-Whisper is a high-performance, distilled version of OpenAI's Whisper model for automatic speech recognition (ASR), optimized for fast and accurate English language transcription.

Supports a 448 token context window. Handles Text, Image, Video, Audio, Transcription, Text-to-Speech inputs and outputs. Supports fine-tuning for custom applications.

Additional Information

Notes

This is an ASR (Automatic Speech Recognition) model with a 250x speed factor. It costs $0.02 per hour transcribed with a minimum charge of 10 seconds per request. Maximum file size is 25 MB.

Capabilities

Text

Input Pricing

$0.02/ second

Context: 448 tokens

Video

Input Pricing

$0.000005556/second

Audio

Input Pricing

$ 0.0003333333333333333 /minute

Generation Pricing

Not available

Transcription

Transcription Pricing

$0.0003333333333333333/minute

Embeddings

Embeddings Pricing

$0.0001/1k tokens

Flatten your repo for AI in seconds

Flatten repos. Prompt faster. One click → one GPT-ready file

Free Online & Desktop