Multimodal Embedding 001 is Google's first multimodal embedding model. We currently support mapping text and images into a unified vector space for semantic search and retrieval-augmented generation (RAG).
Headline benchmark standings and comparison context.
Benchmark updates coming soon
Multimodal Embedding 001 is not fully available on the API yet. Benchmark results will be published here as soon as rollout is complete. Please check back soon.
No benchmark data yet
No benchmark data is available for this model yet.