Search...
Ctrl K
Models
Providers
Apps
Rankings
Playground
Models
Providers
Apps
Rankings
Playground
Search...
Ctrl K
Sign In
Sign In
Multilingual MMLU - Benchmark Leaderboard & Model Performance | AI Stats
Multilingual MMLU
Overview
Overview
Type: percentage
General
Recorded Results
2
Average Score
65%
Score Range
49.30% - 80.70%
Leading Model
80.70% - o3 mini
Scores Over Time
Individual benchmark scores plotted by date.
Models Using This Benchmark
Organisation
Model
Reported
Top Score
Info
Self Reported
Source
OpenAI
o3 mini
30 Jan 2025
80.70%
-
Yes
Source
Microsoft
Phi 4 Mini
01 Feb 2025
49.30%
-
Yes
Source