Search...
Ctrl K
Models
Providers
Apps
Rankings
Playground
Models
Providers
Apps
Rankings
Playground
Search...
Ctrl K
Sign In
Sign In
MultiChallenge (o3-mini grader) - Benchmark Leaderboard & Model Performance | AI Stats
MultiChallenge (o3-mini grader)
Overview
Overview
Type: percentage
Language
Recorded Results
3
Average Score
46.73%
Score Range
39.90% - 50.20%
Leading Model
50.20% - o3 mini
Scores Over Time
Individual benchmark scores plotted by date.
Models Using This Benchmark
Organisation
Model
Reported
Top Score
Info
Self Reported
Source
OpenAI
o3 mini
30 Jan 2025
50.20%
-
Yes
Source
OpenAI
GPT 4.5
27 Feb 2025
50.10%
-
Yes
Source
OpenAI
GPT 4o (2024-08-06)
06 Aug 2024
39.90%
-
Yes
Source