Enjoying AI Stats?Support us
US
Nvidia

Llama 3.1 Nemotron Ultra 253B v1

Nvidia
Overview

No description provided. Want to help? Contribute on GitHub or click here!

Quick Links
Key Metrics
Max Input
131,072
Tokens
Max Output
131,072
Tokens
Throughput
-
tok/s
Latency
-
ms
Input Price
-
Per 1M Tokens
Cached Input Price
-
Per 1M Tokens
Output Price
-
Per 1M Tokens
Blended Price
-
Per 1M Tokens
Model Information
Release Details

Released

07 Apr 2025

Knowledge Cutoff

Dec 2023

License

Llama 3.1 Community License

Model Architecture

Parameters

-

Training Data

-

Context Window

Input Context Length

131,072 tokens

Output Context Length

131,072 tokens

Key Features
Web Access
No

Real-time access to current web information

Multimodal
No

Ability to process multiple data types (text, images, etc.)

Reasoning
Unknown

Advanced logical and deductive reasoning capabilities

Fine-Tunable
Unknown

Can be customized for specific use cases

Model Release & Updates
07 Apr 2025
Model Released
Model first made available to the public
Benchmarks & Performance Comparison
AI Stats Score
1842.60
#40/191
AIME 2025
72.50%
#16/27
GPQA Diamond
76.00%
#23/94
LMArena Text
1326.00
#33/76