Individual benchmark scores plotted by date.
| Organisation | Model | Reported | Top Score | Info | Self Reported | Source |
|---|---|---|---|---|---|---|
| Qwen 3 VL 235B A22B Thinking | - | 80.60% | - | Yes | - | |
| Qwen 3 VL 235B A22B Instruct | - | 78.70% | - | Yes | - | |
| Qwen 2 VL 72B | - | 64.50% | - | Yes | Source | |
| Qwen 2 VL 2B | - | 64.50% | inferred family alias from qwen2-vl-72b (score=0.4083; benches=15) | Yes | Source |