Search...
Ctrl K
Models
Providers
Apps
Rankings
Playground
Models
Providers
Apps
Rankings
Playground
Search...
Ctrl K
Sign In
Sign In
TAU3-Bench - Benchmark Leaderboard & Model Performance | AI Stats
TAU3-Bench
Overview
Overview
Type: percentage
Agents
Recorded Results
2
Average Score
81.05%
Score Range
70.70% - 91.40%
Leading Model
91.40% - Mistral Medium 3.5
Scores Over Time
Individual benchmark scores plotted by date.
Models Using This Benchmark
Organisation
Model
Reported
Top Score
Info
Self Reported
Source
Mistral
Mistral Medium 3.5
29 Apr 2026
91.40%
Telecom
Yes
Source
Qwen
Qwen 3.6 Plus
01 Apr 2026
70.70%
-
Yes
Source