Individual benchmark scores plotted by date.
| Organisation | Model | Reported | Top Score | Info | Self Reported | Source |
|---|---|---|---|---|---|---|
| Claude Sonnet 4.6 | 17 Feb 2026 | 63.30% | Max thinking; Vals AI | Yes | Source | |
| GPT 5.4 Pro | 05 Mar 2026 | 61.50% | - | Yes | Source | |
| Claude Opus 4.6 | 05 Feb 2026 | 60.70% | - | Yes | Source | |
| GPT 5.4 | 05 Mar 2026 | 56% | - | Yes | Source | |
| GPT 5 Pro | 07 Aug 2025 | 56% | inferred family alias from gpt-5.4 (score=0.4083; benches=19) | Yes | Source | |
| GPT 5 Search API | 14 Oct 2025 | 56% | inferred family alias from gpt-5.4 (score=0.3050; benches=19) | Yes | Source |