- Add fireworks-models.ts with static catalog of 24 serverless LLM models - Add dynamic model discovery from Fireworks API with pagination - Integrate into onboarding (interactive + non-interactive flows) - Add fireworks-api-key auth choice and --fireworks-api-key CLI flag - Add FIREWORKS_API_KEY to envMap for auto-detection - Add Fireworks to resolveImplicitProviders - Add docs/providers/fireworks.md documentation - Update docs navigation and CLI reference Model IDs use full format: accounts/fireworks/models/<model> Default model: deepseek-v3p2 (DeepSeek V3.2)
7.9 KiB
| summary | read_when | ||
|---|---|---|---|
| Use Fireworks AI serverless models in Clawdbot |
|
Fireworks AI
Fireworks AI provides fast, cost-effective serverless inference for popular open-source models including DeepSeek, Qwen, Llama, GLM, and more. All models run on optimized infrastructure with low latency and competitive pricing.
Why Fireworks in Clawdbot
- Fast inference with optimized serving infrastructure.
- Wide model selection including DeepSeek V3.2, Qwen3, Llama 3.3, GLM-4.7, and more.
- Serverless — no infrastructure management, pay per token.
- OpenAI-compatible
/v1endpoints.
Features
- Serverless inference: No GPU management, instant scaling
- OpenAI-compatible API: Standard
/v1endpoints for easy integration - Streaming: Supported on all models
- Function calling: Supported on most models
- Vision: Supported on vision-capable models (Qwen VL series)
- Reasoning models: DeepSeek R1, Qwen3 Thinking, Kimi K2 Thinking
Setup
1. Get API Key
- Sign up at fireworks.ai
- Go to Account → API Keys → Create API Key
- Copy your API key
2. Configure Clawdbot
Option A: Environment Variable
export FIREWORKS_API_KEY="fw_xxxxxxxxxxxx"
Option B: Interactive Setup (Recommended)
clawdbot onboard --auth-choice fireworks-api-key
This will:
- Prompt for your API key (or use existing
FIREWORKS_API_KEY) - Discover available Fireworks models via API
- Let you pick your default model
- Configure the provider automatically
Option C: Non-interactive
clawdbot onboard --non-interactive \
--auth-choice fireworks-api-key \
--fireworks-api-key "fw_xxxxxxxxxxxx"
3. Verify Setup
clawdbot chat --model fireworks/accounts/fireworks/models/deepseek-v3p2 "Hello, are you working?"
Model Selection
After setup, Clawdbot discovers models from the Fireworks API. Pick based on your needs:
- Default:
deepseek-v3p2(DeepSeek V3.2) — strong reasoning, balanced performance. - Best reasoning:
deepseek-r1-0528orqwen3-235b-a22b-thinking-2507 - Coding:
qwen3-coder-480b-a35b-instruct - Vision:
qwen3-vl-235b-a22b-instructorqwen2p5-vl-32b-instruct
Change your default model anytime:
clawdbot models set fireworks/accounts/fireworks/models/deepseek-v3p2
clawdbot models set fireworks/accounts/fireworks/models/qwen3-235b-a22b-thinking-2507
List all available models:
clawdbot models list | grep fireworks
Which Model Should I Use?
| Use Case | Recommended Model | Why |
|---|---|---|
| General chat | deepseek-v3p2 |
Strong all-around, reasoning support |
| Complex reasoning | deepseek-r1-0528 |
Best for step-by-step reasoning |
| Coding | qwen3-coder-480b-a35b-instruct |
Code-optimized, 262k context |
| Vision tasks | qwen3-vl-235b-a22b-instruct |
Best multimodal capabilities |
| Fast + cheap | qwen3-8b |
Lightweight, low latency |
| Long context | kimi-k2-instruct-0905 |
262k context window |
Available Models (24 Total)
Text Models
| Model ID | Name | Context | Features |
|---|---|---|---|
deepseek-r1-0528 |
DeepSeek R1 05/28 | 163k | Reasoning |
deepseek-v3-0324 |
DeepSeek V3 03-24 | 163k | General |
deepseek-v3p1 |
DeepSeek V3.1 | 163k | General |
deepseek-v3p1-terminus |
DeepSeek V3.1 Terminus | 163k | General |
deepseek-v3p2 |
DeepSeek V3.2 | 163k | Reasoning |
glm-4p6 |
GLM-4.6 | 202k | General |
glm-4p7 |
GLM-4.7 | 202k | Reasoning |
gpt-oss-120b |
OpenAI gpt-oss-120b | 131k | General |
gpt-oss-20b |
OpenAI gpt-oss-20b | 131k | General |
kimi-k2-instruct-0905 |
Kimi K2 Instruct 0905 | 262k | Long context |
kimi-k2-thinking |
Kimi K2 Thinking | 256k | Reasoning |
llama-v3p3-70b-instruct |
Llama 3.3 70B Instruct | 131k | General |
minimax-m2 |
MiniMax-M2 | 196k | General |
minimax-m2p1 |
MiniMax-M2.1 | 204k | General |
qwen3-235b-a22b |
Qwen3 235B A22B | 131k | General |
qwen3-235b-a22b-instruct-2507 |
Qwen3 235B A22B Instruct 2507 | 262k | General |
qwen3-235b-a22b-thinking-2507 |
Qwen3 235B A22B Thinking 2507 | 262k | Reasoning |
qwen3-8b |
Qwen3 8B | 40k | Fast |
qwen3-coder-480b-a35b-instruct |
Qwen3 Coder 480B A35B Instruct | 262k | Coding |
Vision Models
| Model ID | Name | Context | Features |
|---|---|---|---|
qwen2p5-vl-32b-instruct |
Qwen2.5-VL 32B Instruct | 128k | Vision |
qwen3-vl-235b-a22b-instruct |
Qwen3 VL 235B A22B Instruct | 262k | Vision |
qwen3-vl-235b-a22b-thinking |
Qwen3 VL 235B A22B Thinking | 262k | Vision, reasoning |
qwen3-vl-30b-a3b-thinking |
Qwen3 VL 30B A3B Thinking | 262k | Vision, reasoning |
Model Discovery
Clawdbot automatically discovers models from the Fireworks API when FIREWORKS_API_KEY is set. The discovery:
- Fetches all serverless models (
supports_serverless=true) - Filters out deprecated models
- Filters out non-LLM models (image generation)
- Falls back to a static catalog if the API is unreachable
Model IDs
Fireworks model IDs use the full resource path format:
accounts/fireworks/models/<model-name>
When using models in Clawdbot, prefix with the provider:
clawdbot chat --model fireworks/accounts/fireworks/models/deepseek-v3p2
Streaming and Tool Support
| Feature | Support |
|---|---|
| Streaming | All models |
| Function calling | Most models (check supportsTools in API) |
| Vision/Images | Vision models only |
| JSON mode | Supported via response_format |
Pricing
Fireworks uses pay-per-token pricing. Check fireworks.ai/pricing for current rates. Generally:
- Smaller models (8B-30B): Lower cost, faster
- Larger models (70B+): Higher quality, higher cost
- MoE models: Cost-effective for their capability
Usage Examples
# Use DeepSeek V3.2 (recommended default)
clawdbot chat --model fireworks/accounts/fireworks/models/deepseek-v3p2
# Use reasoning model
clawdbot chat --model fireworks/accounts/fireworks/models/deepseek-r1-0528
# Use coding model
clawdbot chat --model fireworks/accounts/fireworks/models/qwen3-coder-480b-a35b-instruct
# Use vision model
clawdbot chat --model fireworks/accounts/fireworks/models/qwen3-vl-235b-a22b-instruct
Troubleshooting
API key not recognized
echo $FIREWORKS_API_KEY
clawdbot models list | grep fireworks
Ensure the key is valid and has not expired.
Model not available
The Fireworks model catalog updates dynamically. Run clawdbot models list to see currently available models. Some models may be temporarily offline or deprecated.
Connection issues
Fireworks API is at https://api.fireworks.ai. Ensure your network allows HTTPS connections.
Model discovery fails
If model discovery fails, Clawdbot falls back to a static catalog of popular models. Check your API key and network connection.
Config file example
{
env: { FIREWORKS_API_KEY: "fw_..." },
agents: { defaults: { model: { primary: "fireworks/accounts/fireworks/models/deepseek-v3p2" } } },
models: {
mode: "merge",
providers: {
fireworks: {
baseUrl: "https://api.fireworks.ai/inference/v1",
apiKey: "${FIREWORKS_API_KEY}",
api: "openai-completions",
models: [
{
id: "accounts/fireworks/models/deepseek-v3p2",
name: "DeepSeek V3.2",
reasoning: true,
input: ["text"],
cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 },
contextWindow: 163840,
maxTokens: 8192
}
]
}
}
}
}