Individual benchmark scores plotted by date.
| Organisation | Model | Reported | Top Score | Info | Self Reported | Source |
|---|---|---|---|---|---|---|
| Claude Opus 4.8 | 28 May 2026 | 96.70% | Max effort | Yes | Source | |
| Claude Sonnet 5 | 30 Jun 2026 | 79.50% | High effort; 300k token limit; average over ten attempts per problem | Yes | Source |