Global PIQA Benchmark

Global PIQA Benchmark | Phaseo

Models Using This Benchmark

Organisation	Model	Reported	Top Score	Info	Self Reported	Source
Google	Gemini 3 Pro Image Preview (Nano Banana Pro)	20 Nov 2025	93.40%	inferred modality/version alias from gemini-3-pro-preview	Yes	Source
Google	Gemini 3 Pro Preview	18 Nov 2025	93.40%	Commonsense reasoning across 100 Languages and Cultures	Yes	Source
Google	Gemini 3 Flash Preview	17 Dec 2025	92.80%	Commonsense reasoning across 100 Languages and Cultures	Yes	Source
ByteDance	Seed 2.0 Pro	14 Feb 2026	92.30%	Seed2 official benchmark table \| Global PIQA	Yes	Source
ByteDance	Seed 2.0 Lite	14 Feb 2026	92.10%	Seed2 official benchmark table \| Global PIQA	Yes	Source
Qwen	Qwen 3.5 397B A17B	16 Feb 2026	89.80%	-	Yes	Source
Qwen	Qwen 3.6 Plus	01 Apr 2026	89.80%	-	Yes	Source
ByteDance	Seed 2.0 Mini	14 Feb 2026	89.20%	Seed2 official benchmark table \| Global PIQA	Yes	Source
Qwen	Qwen 3.5 122B A10B	24 Feb 2026	88.40%	-	Yes	Source
Qwen	Qwen 3.5 27B	24 Feb 2026	87.50%	-	Yes	Source
Qwen	Qwen 3.5 Flash	23 Feb 2026	87.50%	inferred family alias from qwen3.5-27b (score=0.4147; benches=81)	Yes	Source
Qwen	Qwen 3.5 35B A3B	24 Feb 2026	86.60%	-	Yes	Source
Qwen	Qwen 3.5 9B	02 Mar 2026	83.20%	-	Yes	Source
Qwen	Qwen 3.5 4B	02 Mar 2026	78.90%	-	Yes	Source
Qwen	Qwen 3.5 2B	02 Mar 2026	69.30%	-	Yes	Source
Qwen	Qwen 3.5 0.8B	02 Mar 2026	59.40%	-	Yes	Source

Global PIQA