Search...
Ctrl K
Models
Providers
Apps
Rankings
Playground
Models
Providers
Apps
Rankings
Playground
Search...
Ctrl K
Sign In
Sign In
DeepInfra Models - Ordered by Date Added | AI Stats
DeepInfra
Overview
Models
Models
Filter Parameters
63 models
DeepInfra: DeepSeek R1 (2025-05-28)
deepseek/deepseek-r1-2025-05-28
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.5
/ 1M tokens
Output Text Tokens:
$2.15
/ 1M tokens
Cached Read Text Tokens:
$0.35
/ 1M tokens
DeepInfra: DeepSeek V4 Flash
deepseek/deepseek-v4-flash
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.14
/ 1M tokens
Output Text Tokens:
$0.28
/ 1M tokens
Cached Read Text Tokens:
$0.028
/ 1M tokens
DeepInfra: DeepSeek V4 Pro
deepseek/deepseek-v4-pro
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$1.74
/ 1M tokens
Output Text Tokens:
$3.48
/ 1M tokens
Cached Read Text Tokens:
$0.145
/ 1M tokens
DeepInfra: GLM 5.1
z-ai/glm-5.1
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$1.05
/ 1M tokens
Output Text Tokens:
$3.5
/ 1M tokens
Cached Read Text Tokens:
$0.205
/ 1M tokens
DeepInfra: Kimi K2.6
moonshotai/kimi-k2.6
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.75
/ 1M tokens
Output Text Tokens:
$3.5
/ 1M tokens
Cached Read Text Tokens:
$0.15
/ 1M tokens
DeepInfra: Llama 3.1 Nemotron 70B Instruct
nvidia/llama-3.1-nemotron-70b-instruct
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$1.2
/ 1M tokens
Output Text Tokens:
$1.2
/ 1M tokens
DeepInfra: Llama 3.2 11B Vision Instruct
meta/llama-3.2-11b-vision
Modalities
Input
Image
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.245
/ 1M tokens
Output Text Tokens:
$0.245
/ 1M tokens
DeepInfra: Llama 3.3 70B Instruct
meta/llama-3.3-70b
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.1
/ 1M tokens
Output Text Tokens:
$0.32
/ 1M tokens
DeepInfra: Qwen 3.6 35B A3B
qwen/qwen3.6-35b-a3b
Modalities
Input
Image
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.2
/ 1M tokens
Output Text Tokens:
$1
/ 1M tokens
DeepInfra: Seed 2.0 Pro
bytedance/seed-2.0-pro
Modalities
Input
Image
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.5
/ 1M tokens
Output Text Tokens:
$3
/ 1M tokens
Cached Read Text Tokens:
$0.1
/ 1M tokens
DeepInfra: Nemotron 3 Nano Omni 30B A3B Reasoning
nvidia/nemotron-3-nano-omni-30b-a3b-reasoning
Modalities
Input
Audio
Image
Text
Video
Output
Text
Supported Parameters
-
Pricing
Input Tokens:
$0.2
/ 1M tokens
Output Tokens:
$0.8
/ 1M tokens
DeepInfra: DeepSeek V3
deepseek/deepseek-v3
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.32
/ 1M tokens
Output Text Tokens:
$0.89
/ 1M tokens
DeepInfra: Llama Guard 4 12b
meta/llama-guard-4-12b
Modalities
Input
Image
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.18
/ 1M tokens
Output Text Tokens:
$0.18
/ 1M tokens
DeepInfra: Mistral Nemo 2407
mistral/mistral-nemo-2407
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.02
/ 1M tokens
Output Text Tokens:
$0.04
/ 1M tokens
DeepInfra: Mistral Small 24b 2501
mistral/mistral-small-24b-2501
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.05
/ 1M tokens
Output Text Tokens:
$0.08
/ 1M tokens
DeepInfra: Nvidia Nemotron Nano 12b V2 VL
nvidia/nvidia-nemotron-nano-12b-v2-vl
Modalities
Input
Image
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.2
/ 1M tokens
Output Text Tokens:
$0.6
/ 1M tokens
DeepInfra: Qwen3 Max
qwen/qwen3-max
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$1.2
/ 1M tokens
Output Text Tokens:
$6
/ 1M tokens
Cached Read Text Tokens:
$0.24
/ 1M tokens
DeepInfra: DeepSeek V3 (2025-03-24)
deepseek/deepseek-v3-0324
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.2
/ 1M tokens
Output Text Tokens:
$0.77
/ 1M tokens
Cached Read Text Tokens:
$0.135
/ 1M tokens
DeepInfra: DeepSeek V3.1
deepseek/deepseek-v3.1
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.21
/ 1M tokens
Output Text Tokens:
$0.79
/ 1M tokens
Cached Read Text Tokens:
$0.13
/ 1M tokens
DeepInfra: DeepSeek V3.1 Terminus
deepseek/deepseek-v3.1-terminus
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.21
/ 1M tokens
Output Text Tokens:
$0.79
/ 1M tokens
Cached Read Text Tokens:
$0.13
/ 1M tokens
DeepInfra: DeepSeek V3.2
deepseek/deepseek-v3.2
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.26
/ 1M tokens
Output Text Tokens:
$0.38
/ 1M tokens
Cached Read Text Tokens:
$0.13
/ 1M tokens
DeepInfra: GLM 4.6
z-ai/glm-4.6
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.43
/ 1M tokens
Output Text Tokens:
$1.74
/ 1M tokens
Cached Read Text Tokens:
$0.08
/ 1M tokens
DeepInfra: GLM 4.7
z-ai/glm-4.7
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.4
/ 1M tokens
Output Text Tokens:
$1.75
/ 1M tokens
Cached Read Text Tokens:
$0.08
/ 1M tokens
DeepInfra: GLM 4.7 Flash
z-ai/glm-4.7-flash
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.06
/ 1M tokens
Output Text Tokens:
$0.4
/ 1M tokens
Cached Read Text Tokens:
$0.01
/ 1M tokens
DeepInfra: GLM 5
z-ai/glm-5
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.6
/ 1M tokens
Output Text Tokens:
$2.08
/ 1M tokens
Cached Read Text Tokens:
$0.12
/ 1M tokens
DeepInfra: Hermes 3 Llama 3.1 405B
nous/hermes-3-llama-3.1-405b
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$1
/ 1M tokens
Output Text Tokens:
$1
/ 1M tokens
DeepInfra: Hermes 3 Llama 3.1 70B
nousresearch/hermes-3-llama-3.1-70b
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.3
/ 1M tokens
Output Text Tokens:
$0.3
/ 1M tokens
DeepInfra: Kimi K2.5
moonshotai/kimi-k2.5
Modalities
Input
Image
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.45
/ 1M tokens
Output Text Tokens:
$2.25
/ 1M tokens
Cached Read Text Tokens:
$0.07
/ 1M tokens
DeepInfra: Llama 3 8B Instruct
meta/llama-3-8b
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.03
/ 1M tokens
Output Text Tokens:
$0.04
/ 1M tokens
DeepInfra: Llama 3.1 70B Instruct
meta/llama-3.1-70b
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.4
/ 1M tokens
Output Text Tokens:
$0.4
/ 1M tokens
DeepInfra: Llama 3.1 8B Instruct
meta/llama-3.1-8b
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.02
/ 1M tokens
Output Text Tokens:
$0.03
/ 1M tokens
DeepInfra: Llama 3.3 Nemotron Super 49B V1.5
nvidia/llama-3.3-nemotron-super-49b-v1.5
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.1
/ 1M tokens
Output Text Tokens:
$0.4
/ 1M tokens
DeepInfra: Llama 4 Maverick
meta/llama-4-maverick
Modalities
Input
Image
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.15
/ 1M tokens
Output Text Tokens:
$0.6
/ 1M tokens
DeepInfra: Llama 4 Scout
meta/llama-4-scout
Modalities
Input
Image
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.08
/ 1M tokens
Output Text Tokens:
$0.3
/ 1M tokens
DeepInfra: MiniMax M2.5
minimax/minimax-m2.5
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.15
/ 1M tokens
Output Text Tokens:
$1.15
/ 1M tokens
Cached Read Text Tokens:
$0.03
/ 1M tokens
DeepInfra: Mistral Small 3.2
mistral/mistral-small-3.2
Modalities
Input
Image
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.075
/ 1M tokens
Output Text Tokens:
$0.2
/ 1M tokens
DeepInfra: Mixtral 8x7b
mistral/mixtral-8x7b
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.54
/ 1M tokens
Output Text Tokens:
$0.54
/ 1M tokens
DeepInfra: Nemotron 3 Super
nvidia/nemotron-3-super-120b-a12b
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.1
/ 1M tokens
Output Text Tokens:
$0.5
/ 1M tokens
DeepInfra: Nemotron Nano 3 30B A3B
nvidia/nemotron-3-nano-30b-a3b
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.05
/ 1M tokens
Output Text Tokens:
$0.2
/ 1M tokens
DeepInfra: Nvidia Nemotron Nano 9B V2
nvidia/nvidia-nemotron-nano-9b-v2
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.04
/ 1M tokens
Output Text Tokens:
$0.16
/ 1M tokens
DeepInfra: Olmo 3.1 32B Instruct
allenai/olmo-3.1-32b
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.2
/ 1M tokens
Output Text Tokens:
$0.6
/ 1M tokens
DeepInfra: Phi 4
microsoft/phi-4
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.07
/ 1M tokens
Output Text Tokens:
$0.14
/ 1M tokens
DeepInfra: Qwen 2.5 72B
qwen/qwen2.5-72b
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.36
/ 1M tokens
Output Text Tokens:
$0.4
/ 1M tokens
DeepInfra: Qwen 3 14B
qwen/qwen3-14b
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.12
/ 1M tokens
Output Text Tokens:
$0.24
/ 1M tokens
DeepInfra: Qwen 3 235B A22B Thinking 2507
qwen/qwen3-235b-a22b-thinking-2507
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.23
/ 1M tokens
Output Text Tokens:
$2.3
/ 1M tokens
Cached Read Text Tokens:
$0.2
/ 1M tokens
DeepInfra: Qwen 3 30B A3B
qwen/qwen3-30b-a3b
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.08
/ 1M tokens
Output Text Tokens:
$0.28
/ 1M tokens
DeepInfra: Qwen 3 32B
qwen/qwen3-32b
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.08
/ 1M tokens
Output Text Tokens:
$0.28
/ 1M tokens
DeepInfra: Qwen 3 A235 A22B Instruct 2507
qwen/qwen3-235b-a22b-2507
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.071
/ 1M tokens
Output Text Tokens:
$0.1
/ 1M tokens
DeepInfra: Qwen 3 Coder 480B A35B Instruct
qwen/qwen3-coder-480b-a35b
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.3
/ 1M tokens
Output Text Tokens:
$1
/ 1M tokens
Cached Read Text Tokens:
$0.1
/ 1M tokens
DeepInfra: Qwen 3 Max Thinking
qwen/qwen3-max-thinking
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$1.2
/ 1M tokens
Output Text Tokens:
$6
/ 1M tokens
Cached Read Text Tokens:
$0.24
/ 1M tokens
DeepInfra: Qwen 3 Next 80B A3B Instruct
qwen/qwen3-next-80b-a3b-instruct
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.09
/ 1M tokens
Output Text Tokens:
$1.1
/ 1M tokens
DeepInfra: Qwen 3 VL 235B A22B Instruct
qwen/qwen3-vl-235b-a22b-instruct
Modalities
Input
Image
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.2
/ 1M tokens
Output Text Tokens:
$0.88
/ 1M tokens
Cached Read Text Tokens:
$0.11
/ 1M tokens
DeepInfra: Qwen 3 VL 30B A3B Instruct
qwen/qwen3-vl-30b-a3b-instruct
Modalities
Input
Image
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.15
/ 1M tokens
Output Text Tokens:
$0.6
/ 1M tokens
DeepInfra: Qwen 3.5 0.8B
qwen/qwen3.5-0.8b
Modalities
Input
Image
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.01
/ 1M tokens
Output Text Tokens:
$0.05
/ 1M tokens
DeepInfra: Qwen 3.5 122B A10B
qwen/qwen3.5-122b-a10b
Modalities
Input
Image
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.29
/ 1M tokens
Output Text Tokens:
$2.9
/ 1M tokens
Cached Read Text Tokens:
$0.145
/ 1M tokens
DeepInfra: Qwen 3.5 27B
qwen/qwen3.5-27b
Modalities
Input
Image
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.26
/ 1M tokens
Output Text Tokens:
$2.6
/ 1M tokens
DeepInfra: Qwen 3.5 2B
qwen/qwen3.5-2b
Modalities
Input
Image
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.02
/ 1M tokens
Output Text Tokens:
$0.1
/ 1M tokens
DeepInfra: Qwen 3.5 35B A3B
qwen/qwen3.5-35b-a3b
Modalities
Input
Image
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.2
/ 1M tokens
Output Text Tokens:
$0.95
/ 1M tokens
Cached Read Text Tokens:
$0.1
/ 1M tokens
DeepInfra: Qwen 3.5 397B A17B
qwen/qwen3.5-397b-a17b
Modalities
Input
Image
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.54
/ 1M tokens
Output Text Tokens:
$3.4
/ 1M tokens
Cached Read Text Tokens:
$0.27
/ 1M tokens
DeepInfra: Qwen 3.5 4B
qwen/qwen3.5-4b
Modalities
Input
Image
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.03
/ 1M tokens
Output Text Tokens:
$0.15
/ 1M tokens
DeepInfra: Seed 1.8
bytedance/seed-1.8
Modalities
Input
Image
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.25
/ 1M tokens
Output Text Tokens:
$2
/ 1M tokens
Cached Read Text Tokens:
$0.05
/ 1M tokens
DeepInfra: Seed 2.0 Mini
bytedance/seed-2.0-mini
Modalities
Input
Image
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.1
/ 1M tokens
Output Text Tokens:
$0.4
/ 1M tokens
Cached Read Text Tokens:
$0.02
/ 1M tokens
DeepInfra: Step 3.5 Flash
stepfun/step-3.5-flash
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.1
/ 1M tokens
Output Text Tokens:
$0.3
/ 1M tokens
Cached Read Text Tokens:
$0.02
/ 1M tokens