Enjoying AI Stats?Support us

Llama 3.1 Nemotron Ultra 253B v1

Nvidia

Overview

No description provided. Want to help? Contribute on GitHub or click here!

Quick Links

Suggest Edits Announcement Paper Playground Weights

Key Metrics

Max Input

131,072

Tokens

Max Output

131,072

Tokens

Throughput

tok/s

Latency

Input Price

Per 1M Tokens

Cached Input Price

Per 1M Tokens

Output Price

Per 1M Tokens

Blended Price

Per 1M Tokens

Model Information

Release Details

Released

07 Apr 2025

Knowledge Cutoff

Dec 2023

License

Llama 3.1 Community License

Model Architecture

Parameters

Training Data

Context Window

Input Context Length

131,072 tokens

Output Context Length

131,072 tokens

Key Features

Web Access

Real-time access to current web information

Multimodal

Ability to process multiple data types (text, images, etc.)

Reasoning

Unknown

Advanced logical and deductive reasoning capabilities

Fine-Tunable

Unknown

Can be customized for specific use cases

Model Release & Updates

07 Apr 2025

Model Released

Model first made available to the public

Benchmarks & Performance Comparison

AI Stats Score

1842.60

#40/191

AIME 2025

72.50%

#16/27

GPQA Diamond

76.00%

#23/94

LMArena Text

1326.00

#33/76