Last updated: 16/04/2025

CohereOfficial Docs

Embed Multilingual V3.0 Image

embed-multilingual-v3.0-image

active

Embed Multilingual V3.0 Image

Embed is the leading multimodal embedding model, acting as an intelligent retrieval engine for semantic search and retrieval-augmented generation (RAG) systems.

Handles Image inputs and outputs.

Additional Information

Notes

Embeddings perform best when the text to be embedded is less than 512 tokens. You can create up to 96 text embeddings per API call. You can create 1 image embedding per API call.

Capabilities

Image

Input Pricing

$0.0001 /image

Embeddings

Embeddings Pricing

$0.10/1k tokens

Flatten your repo for AI in seconds

Flatten repos. Prompt faster. One click → one GPT-ready file

Free Online & Desktop