Checking statusChecking statusVisit status page

Component-level status is unavailable.

Explore

Models
Chat
Compare
Providers
Apps
Rankings
Monitor

Build

Documentation
API Reference
Quickstart
SDKs
Methodology

Company

Blog
Pricing
Works With
Support
Privacy
Terms

Community

Discord
GitHub
LinkedIn
Reddit
X

© 2025 • Phaseo

Report:Issue Support

Spotted a data issue or broken page?Open an issueorcontact support

Models Chat Compare Providers Apps Rankings

Multilingual MMLU Benchmark | Phaseo

Multilingual MMLU

Type: percentageGeneral

Recorded Results

2

Average Score

65%

Score Range

49.30% - 80.70%

Leading Model

80.70% - o3 mini

Scores Over Time

Individual benchmark scores plotted by date.

Models Using This Benchmark

Organisation	Model	Reported	Top Score	Info	Self Reported	Source
OpenAI	o3 mini	30 Jan 2025	80.70%	-	Yes	Source
Microsoft	Phi 4 Mini	01 Feb 2025	49.30%	-	Yes	Source