Available models
| Model ID | Parameters | Context | Best For |
|---|---|---|---|
llama-3.3-70b ⭐ | 70B | 128K | General, code, reasoning |
llama-4-scout 🔥 | 17B (MoE) | 512K | Long documents |
llama-3.1-8b | 8B | 8K | Fast simple tasks |
llama-3.1-8b-cerebras | 8B | 8K | Ultra-fast inference |
Recommendation
Start withllama-3.3-70b — it’s the most popular and highest quality Llama model. Use llama-4-scout when you need 512K context.