Enjoying AI Stats?Support us
Llama 3.1 405B Instruct
Overview
No description provided. Want to help? Contribute on GitHub or click here!
Quick Links
Key Metrics
Max Input
32,768
Tokens
Max Output
8,192
Tokens
Throughput
-
tok/s
Latency
-
ms
Input Price
$0.8
Per 1M Tokens
Cached Input Price
-
Per 1M Tokens
Output Price
$0.8
Per 1M Tokens
Blended Price
$0.8
Per 1M Tokens
Model Information
Release Details
Released
23 Jul 2024
Knowledge Cutoff
-
License
-
Model Architecture
Parameters
-
Training Data
-
Context Window
Input Context Length
32,768 tokens
Output Context Length
8,192 tokens
Key Features
Web Access
Unknown
Real-time access to current web information
Multimodal
No
Ability to process multiple data types (text, images, etc.)
Reasoning
Unknown
Advanced logical and deductive reasoning capabilities
Fine-Tunable
Unknown
Can be customized for specific use cases
Model Release & Updates
23 Jul 2024
Model Released
Model first made available to the public
Benchmarks & Performance Comparison