- Added Fireworks AI API key handling in onboarding and non-interactive auth choice. - Updated CLI options to include Fireworks API key. - Enhanced provider configuration to support Fireworks models. - Updated documentation to reflect Fireworks AI integration.
7.7 KiB
| summary | read_when | ||
|---|---|---|---|
| Use Fireworks AI serverless models in Moltbot |
|
Fireworks AI
Fireworks AI provides fast, cost-effective serverless inference for popular open-source models including DeepSeek, Qwen, Kimi, Llama, GLM, and more. All models run on optimized infrastructure with low latency and competitive pricing.
Why Fireworks in Moltbot
- Fast inference with optimized serving infrastructure.
- Wide model selection including DeepSeek V3.2, Qwen3, Kimi K2.5, Llama 3.3, GLM-4.7, and more.
- Serverless - no infrastructure management, pay per token.
- OpenAI-compatible
/v1endpoints.
Features
- Serverless inference: No GPU management, instant scaling
- OpenAI-compatible API: Standard
/v1endpoints for easy integration - Streaming: Supported on all models
- Function calling: Supported on most models
- Vision: Supported on vision-capable models (Qwen VL series)
- Reasoning models: DeepSeek R1, Qwen3 Thinking, Kimi K2 Thinking, Kimi K2.5
Setup
1. Get API Key
- Sign up at fireworks.ai
- Go to Settings > API Keys > Create API Key
- Copy your API key (should start with
fw_)
2. Configure Moltbot
Option A: Environment Variable
export FIREWORKS_API_KEY="fw_xxxxxxxxxxxx"
Option B: Interactive Setup (Recommended)
moltbot onboard --auth-choice fireworks-api-key
This will:
- Prompt for your API key (or use existing
FIREWORKS_API_KEY) - Configure the Fireworks provider with available models
- Let you pick your default model
- Set up the provider automatically
Option C: Non-interactive
moltbot onboard --non-interactive \
--auth-choice fireworks-api-key \
--fireworks-api-key "fw_xxxxxxxxxxxx"
3. Verify Setup
moltbot chat --model fireworks/accounts/fireworks/models/kimi-k2p5 "Hello, are you working?"
Model Selection
Moltbot includes a curated catalog of popular Fireworks models. Pick based on your needs:
- Default:
kimi-k2p5(Kimi K2.5) - vision + reasoning, 262k context. - Best reasoning:
deepseek-r1-0528orqwen3-235b-a22b-thinking-2507 - Coding:
qwen3-coder-480b-a35b-instruct - Vision:
qwen3-vl-235b-a22b-instructorqwen2p5-vl-32b-instruct
Change your default model anytime:
moltbot models set fireworks/accounts/fireworks/models/kimi-k2p5
moltbot models set fireworks/accounts/fireworks/models/deepseek-v3p2
List all available models:
moltbot models list | grep fireworks
Which Model Should I Use?
| Use Case | Recommended Model | Why |
|---|---|---|
| General chat | kimi-k2p5 |
Vision + reasoning, 262k context |
| Complex reasoning | deepseek-r1-0528 |
Best for step-by-step reasoning |
| Agentic tasks | gpt-oss-120b |
Designed for reasoning and agentic use |
| Coding | qwen3-coder-480b-a35b-instruct |
Code-optimized, 262k context |
| Vision tasks | qwen3-vl-235b-a22b-instruct |
Best multimodal capabilities |
| Fast + cheap | qwen3-8b |
Lightweight, low latency |
| Long context | kimi-k2-instruct-0905 |
262k context window |
Available Models (25 Total)
Text Models
| Model ID | Name | Context | Features |
|---|---|---|---|
deepseek-r1-0528 |
Deepseek R1 05/28 | 163k | Reasoning |
deepseek-v3-0324 |
Deepseek V3 03-24 | 163k | General |
deepseek-v3p1 |
DeepSeek V3.1 | 163k | General |
deepseek-v3p1-terminus |
DeepSeek V3.1 Terminus | 163k | General |
deepseek-v3p2 |
Deepseek v3.2 | 163k | General |
glm-4p6 |
GLM-4.6 | 202k | Reasoning |
glm-4p7 |
GLM-4.7 | 202k | Reasoning |
gpt-oss-120b |
OpenAI gpt-oss-120b | 131k | Reasoning |
gpt-oss-20b |
OpenAI gpt-oss-20b | 131k | Reasoning |
kimi-k2-instruct-0905 |
Kimi K2 Instruct 0905 | 262k | Long context |
kimi-k2p5 |
Kimi K2.5 | 262k | Vision, reasoning |
kimi-k2-thinking |
Kimi K2 Thinking | 256k | Reasoning |
llama-v3p3-70b-instruct |
Llama 3.3 70B Instruct | 131k | General |
minimax-m2 |
MiniMax-M2 | 196k | Agentic, coding |
minimax-m2p1 |
MiniMax-M2.1 | 204k | Agentic, coding |
qwen3-235b-a22b |
Qwen3 235B A22B | 131k | General |
qwen3-235b-a22b-instruct-2507 |
Qwen3 235B A22B Instruct 2507 | 262k | General |
qwen3-235b-a22b-thinking-2507 |
Qwen3 235B A22B Thinking 2507 | 262k | Reasoning |
qwen3-8b |
Qwen3 8B | 40k | Fast |
qwen3-coder-480b-a35b-instruct |
Qwen3 Coder 480B A35B Instruct | 262k | Coding |
Vision Models
| Model ID | Name | Context | Features |
|---|---|---|---|
qwen2p5-vl-32b-instruct |
Qwen2.5-VL 32B Instruct | 128k | Vision |
qwen3-vl-235b-a22b-instruct |
Qwen3 VL 235B A22B Instruct | 262k | Vision |
qwen3-vl-235b-a22b-thinking |
Qwen3 VL 235B A22B Thinking | 262k | Vision, reasoning |
qwen3-vl-30b-a3b-instruct |
Qwen3 VL 30B A3B Instruct | 262k | Vision |
qwen3-vl-30b-a3b-thinking |
Qwen3 VL 30B A3B Thinking | 262k | Vision, reasoning |
Model IDs
Fireworks model IDs use the full resource path format:
accounts/fireworks/models/<model-name>
When using models in Moltbot, prefix with the provider:
moltbot chat --model fireworks/accounts/fireworks/models/deepseek-v3p2
Streaming and Tool Support
| Feature | Support |
|---|---|
| Streaming | All models |
| Function calling | Most models (check supportsTools in API) |
| Vision/Images | Vision models only |
| JSON mode | Supported via response_format |
Pricing
Fireworks uses pay-per-token pricing. Check fireworks.ai/pricing for current rates. Generally:
- Smaller models (8B-30B): Lower cost, faster
- Larger models (70B+): Higher quality, higher cost
- MoE models: Cost-effective for their capability
Usage Examples
# Use Kimi K2.5 (recommended default)
moltbot chat --model fireworks/accounts/fireworks/models/kimi-k2p5
# Use DeepSeek V3.2
moltbot chat --model fireworks/accounts/fireworks/models/deepseek-v3p2
# Use coding model
moltbot chat --model fireworks/accounts/fireworks/models/qwen3-coder-480b-a35b-instruct
# Use vision model
moltbot chat --model fireworks/accounts/fireworks/models/qwen3-vl-235b-a22b-instruct
Troubleshooting
API key not recognized
echo $FIREWORKS_API_KEY
moltbot models list | grep fireworks
Ensure the key is valid and has not expired.
Model not available
Run moltbot models list to see currently available models in the catalog. If a model you need is missing, you can add it manually to your config file.
Connection issues
Fireworks API is at https://api.fireworks.ai. Ensure your network allows HTTPS connections.
Config file example
{
env: { FIREWORKS_API_KEY: "fw_..." },
agents: { defaults: { model: { primary: "fireworks/accounts/fireworks/models/kimi-k2p5" } } },
models: {
mode: "merge",
providers: {
fireworks: {
baseUrl: "https://api.fireworks.ai/inference/v1",
apiKey: "${FIREWORKS_API_KEY}",
api: "openai-completions",
models: [
{
id: "accounts/fireworks/models/kimi-k2p5",
name: "Kimi K2.5",
reasoning: true,
input: ["text", "image"],
cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 },
contextWindow: 262144,
maxTokens: 8192
}
]
}
}
}
}