Models19in catalog
Largest context1MNemotron 3 Ultra
Avg context
269K
Cheapest input
$0.040/M
Nvidia Nemotron Nano 9b
Largest output
262K
Nemotron 3 Super (free)
Speed tiers
balanced 7fast 11deep 1
Capabilities
What Nvidia models support across the catalog.
Vision
5 / 19
26% of models
Tool use
15 / 19
79% of models
Function calling
15 / 19
79% of models
Extended thinking
14 / 19
74% of models
Streaming
19 / 19
100% of models
Prompt caching
2 / 19
11% of models
All Nvidia models
Sorted by context window, largest first. Tap a row for full specs.
| Model | Context | Max output | Input $/M | Speed |
|---|---|---|---|---|
| Nemotron 3 Ultra | 1M | 16K | — | balanced |
| Nemotron 3 Ultra (free) | 1M | 66K | — | balanced |
| Nemotron 3 Nano 30B A3B | 262K | — | — | fast |
| Nemotron 3 Super | 262K | — | — | balanced |
| Nemotron 3 Super (free) | 262K | 262K | — | balanced |
| Nvidia Nemotron Nano 3 30b | 262K | 8K | $0.060/M | fast |
| Nemotron 3 120b A12b | 256K | 256K | $0.500/M | balanced |
| Nemotron 3 Nano 30B A3B (free) | 256K | — | — | fast |
| Nemotron 3 Nano Omni (free) | 256K | 66K | — | fast |
| Nvidia Nemotron Super 3 120b | 256K | 33K | $0.150/M | balanced |
| Llama 3.1 Nemotron 70B InstructLlama 3.1 | 131K | 16K | — | deep |
| Nemotron Nano 12B 2 VL | 131K | — | — | fast |
| Nemotron Nano 9B V2 | 131K | — | — | fast |
| Nvidia Nemotron Nano 9b | 131K | 131K | $0.040/M | fast |
| Nemotron 3.5 Content Safety (free) | 128K | 8K | — | balanced |
| Nemotron Nano 12B 2 VL (free) | 128K | 128K | — | fast |
| Nemotron Nano 9B V2 (free) | 128K | — | — | fast |
| Nvidia Nemotron Nano 12b | 128K | 8K | $0.200/M | fast |
| Nemotron Nano V2 12b Vl | 4K | 4K | $0.100/M | fast |
See rankings
How Nvidia models rank against the full catalog.