Google: Multimodal Embedding 001

Multimodal Embedding 001 is Google's first multimodal embedding model. We currently support mapping text and images into a unified vector space for semantic search and retrieval-augmented generation (RAG).

Overview Playground Providers Pricing Performance Apps Activity Quickstart Benchmarks Family Timeline

AI Stats by Phaseo brings together model, provider, and gateway data for teams building with AI APIs.

Read the docs

Check status

View GitHub

Explore

Models
Playground
Compare
Providers
Apps
Rankings
Monitor

Build

Documentation
API Reference
Quickstart
SDKs
Methodology

Company

Announcements
Pricing
Works With
Support
Privacy
Terms

Community

Discord
GitHub
Reddit
LinkedIn
X

Spotted a data issue or broken page?Open an issueorcontact support

Models Playground Compare Providers Apps Rankings

Quickstart

Start calling this model with endpoint-specific examples.

Benchmarks

Headline benchmark standings and comparison context.

About

Key dates, capabilities, and model metadata.

Multimodal Embedding 001 Benchmarks - Performance Highlights | AI Stats

Google: Multimodal Embedding 001

Chat Compare

Overview Playground Providers Pricing Performance Apps Activity Quickstart Benchmarks Family Timeline

Benchmarks

Headline benchmark standings and comparison context.

Benchmark updates coming soon

Multimodal Embedding 001 is not fully available on the API yet. Benchmark results will be published here as soon as rollout is complete. Please check back soon.

No benchmark data yet

No benchmark data is available for this model yet.