How Pricing Works
- Sign up and get $0.50 free credits (no credit card needed)
- Use any model through the same API endpoint
- Pay per token after credits run out
- No monthly fees. No commitments. No hidden costs.
Model Pricing
All prices per 1 million tokens.
| Model | Input | Output | ~Cost / 1K reqs |
|---|
| For the current canonical pricing table, use: | | | |
GET /v1/models
GET /v1/credits/pricing
Common active prices today:
| Model | Input | Output | ~Cost / 1K reqs |
|---|
| gemma-4-31b | $0.19 | $0.54 | $0.20 |
| gpt-oss-120b | $0.20 | $0.81 | $0.26 |
| qwen-3-32b | $0.39 | $0.81 | $0.36 |
| minimax-m2.5 | $0.41 | $1.62 | $0.53 |
| minimax-m2.7 | $0.41 | $1.62 | $0.53 |
| qwen-3.6-plus | $0.44 | $2.63 | $0.75 |
| qwen-3-coder | $0.68 | $2.16 | $0.77 |
| llama-3.3-70b | $1.19 | $1.19 | $0.83 |
| deepseek-v3 | $0.81 | $2.30 | $0.86 |
| gemini-2.5-flash | $0.41 | $3.38 | $0.88 |
| deepseek-r1 | $0.68 | $2.90 | $0.92 |
| kimi-k2.5 | $0.68 | $3.78 | $1.09 |
| gemini-3-flash | $0.68 | $4.05 | $1.15 |
Typical cost per request assumes 500 input + 200 output tokens. Your actual cost depends on message length. Sorted cheapest first.
Cost Examples
| Use Case | Typical Tokens | Model | Cost |
|---|
| Quick chat (1 message) | 200 in / 100 out | gemma-4-31b | ~$0.00009 |
| Code review (1 file) | 2,000 in / 500 out | qwen-3-coder | ~$0.002 |
| Blog article | 500 in / 2,000 out | deepseek-v3 | ~$0.005 |
| Data extraction (1 page) | 1,000 in / 200 out | qwen-3-32b | ~$0.0004 |
| RAG query (large context) | 10,000 in / 500 out | gemini-2.5-flash | ~$0.006 |
Free Credits
- $0.50 on signup (no credit card required)
- Enough for approximately 500 to 3,000 requests depending on model
- Credits never expire
Adding Credits
| Package | Amount |
|---|
| Starter | $5 |
| Growth | $20 |
| Pro | $100 |
| Scale | $500 |
Auto top-up supported (set threshold + amount). Pay via Stripe.
Prompt Caching
Cached input tokens cost 10% of normal price (90% discount). Caching works automatically with compatible providers.
Average cache hit rate across platform: 48%.
Learn more about prompt caching
Next Steps