Individual benchmark scores plotted by date.
| Organisation | Model | Reported | Top Score | Info | Self Reported | Source |
|---|---|---|---|---|---|---|
| DeepSeek V4 Pro | 24 Apr 2026 | 95.20% | Best reported mode. Non-think: 31.7, High: 94.0, Max: 95.2 | Yes | Source | |
| DeepSeek V4 Flash | 24 Apr 2026 | 94.80% | Best reported mode. Non-think: 40.8, High: 91.9, Max: 94.8 | Yes | Source |