openclaw/docs/providers/fireworks.md
Taekmin Kim c6acef769a feat(fireworks): integrate Fireworks AI support into onboarding and authentication
- Added Fireworks AI API key handling in onboarding and non-interactive auth choice.
- Updated CLI options to include Fireworks API key.
- Enhanced provider configuration to support Fireworks models.
- Updated documentation to reflect Fireworks AI integration.
2026-01-28 13:32:44 -08:00

7.7 KiB

summary read_when
Use Fireworks AI serverless models in Moltbot
You want fast serverless inference in Moltbot
You want Fireworks AI setup guidance

Fireworks AI

Fireworks AI provides fast, cost-effective serverless inference for popular open-source models including DeepSeek, Qwen, Kimi, Llama, GLM, and more. All models run on optimized infrastructure with low latency and competitive pricing.

Why Fireworks in Moltbot

  • Fast inference with optimized serving infrastructure.
  • Wide model selection including DeepSeek V3.2, Qwen3, Kimi K2.5, Llama 3.3, GLM-4.7, and more.
  • Serverless - no infrastructure management, pay per token.
  • OpenAI-compatible /v1 endpoints.

Features

  • Serverless inference: No GPU management, instant scaling
  • OpenAI-compatible API: Standard /v1 endpoints for easy integration
  • Streaming: Supported on all models
  • Function calling: Supported on most models
  • Vision: Supported on vision-capable models (Qwen VL series)
  • Reasoning models: DeepSeek R1, Qwen3 Thinking, Kimi K2 Thinking, Kimi K2.5

Setup

1. Get API Key

  1. Sign up at fireworks.ai
  2. Go to Settings > API Keys > Create API Key
  3. Copy your API key (should start with fw_)

2. Configure Moltbot

Option A: Environment Variable

export FIREWORKS_API_KEY="fw_xxxxxxxxxxxx"

Option B: Interactive Setup (Recommended)

moltbot onboard --auth-choice fireworks-api-key

This will:

  1. Prompt for your API key (or use existing FIREWORKS_API_KEY)
  2. Configure the Fireworks provider with available models
  3. Let you pick your default model
  4. Set up the provider automatically

Option C: Non-interactive

moltbot onboard --non-interactive \
  --auth-choice fireworks-api-key \
  --fireworks-api-key "fw_xxxxxxxxxxxx"

3. Verify Setup

moltbot chat --model fireworks/accounts/fireworks/models/kimi-k2p5 "Hello, are you working?"

Model Selection

Moltbot includes a curated catalog of popular Fireworks models. Pick based on your needs:

  • Default: kimi-k2p5 (Kimi K2.5) - vision + reasoning, 262k context.
  • Best reasoning: deepseek-r1-0528 or qwen3-235b-a22b-thinking-2507
  • Coding: qwen3-coder-480b-a35b-instruct
  • Vision: qwen3-vl-235b-a22b-instruct or qwen2p5-vl-32b-instruct

Change your default model anytime:

moltbot models set fireworks/accounts/fireworks/models/kimi-k2p5
moltbot models set fireworks/accounts/fireworks/models/deepseek-v3p2

List all available models:

moltbot models list | grep fireworks

Which Model Should I Use?

Use Case Recommended Model Why
General chat kimi-k2p5 Vision + reasoning, 262k context
Complex reasoning deepseek-r1-0528 Best for step-by-step reasoning
Agentic tasks gpt-oss-120b Designed for reasoning and agentic use
Coding qwen3-coder-480b-a35b-instruct Code-optimized, 262k context
Vision tasks qwen3-vl-235b-a22b-instruct Best multimodal capabilities
Fast + cheap qwen3-8b Lightweight, low latency
Long context kimi-k2-instruct-0905 262k context window

Available Models (25 Total)

Text Models

Model ID Name Context Features
deepseek-r1-0528 Deepseek R1 05/28 163k Reasoning
deepseek-v3-0324 Deepseek V3 03-24 163k General
deepseek-v3p1 DeepSeek V3.1 163k General
deepseek-v3p1-terminus DeepSeek V3.1 Terminus 163k General
deepseek-v3p2 Deepseek v3.2 163k General
glm-4p6 GLM-4.6 202k Reasoning
glm-4p7 GLM-4.7 202k Reasoning
gpt-oss-120b OpenAI gpt-oss-120b 131k Reasoning
gpt-oss-20b OpenAI gpt-oss-20b 131k Reasoning
kimi-k2-instruct-0905 Kimi K2 Instruct 0905 262k Long context
kimi-k2p5 Kimi K2.5 262k Vision, reasoning
kimi-k2-thinking Kimi K2 Thinking 256k Reasoning
llama-v3p3-70b-instruct Llama 3.3 70B Instruct 131k General
minimax-m2 MiniMax-M2 196k Agentic, coding
minimax-m2p1 MiniMax-M2.1 204k Agentic, coding
qwen3-235b-a22b Qwen3 235B A22B 131k General
qwen3-235b-a22b-instruct-2507 Qwen3 235B A22B Instruct 2507 262k General
qwen3-235b-a22b-thinking-2507 Qwen3 235B A22B Thinking 2507 262k Reasoning
qwen3-8b Qwen3 8B 40k Fast
qwen3-coder-480b-a35b-instruct Qwen3 Coder 480B A35B Instruct 262k Coding

Vision Models

Model ID Name Context Features
qwen2p5-vl-32b-instruct Qwen2.5-VL 32B Instruct 128k Vision
qwen3-vl-235b-a22b-instruct Qwen3 VL 235B A22B Instruct 262k Vision
qwen3-vl-235b-a22b-thinking Qwen3 VL 235B A22B Thinking 262k Vision, reasoning
qwen3-vl-30b-a3b-instruct Qwen3 VL 30B A3B Instruct 262k Vision
qwen3-vl-30b-a3b-thinking Qwen3 VL 30B A3B Thinking 262k Vision, reasoning

Model IDs

Fireworks model IDs use the full resource path format:

accounts/fireworks/models/<model-name>

When using models in Moltbot, prefix with the provider:

moltbot chat --model fireworks/accounts/fireworks/models/deepseek-v3p2

Streaming and Tool Support

Feature Support
Streaming All models
Function calling Most models (check supportsTools in API)
Vision/Images Vision models only
JSON mode Supported via response_format

Pricing

Fireworks uses pay-per-token pricing. Check fireworks.ai/pricing for current rates. Generally:

  • Smaller models (8B-30B): Lower cost, faster
  • Larger models (70B+): Higher quality, higher cost
  • MoE models: Cost-effective for their capability

Usage Examples

# Use Kimi K2.5 (recommended default)
moltbot chat --model fireworks/accounts/fireworks/models/kimi-k2p5

# Use DeepSeek V3.2
moltbot chat --model fireworks/accounts/fireworks/models/deepseek-v3p2

# Use coding model
moltbot chat --model fireworks/accounts/fireworks/models/qwen3-coder-480b-a35b-instruct

# Use vision model
moltbot chat --model fireworks/accounts/fireworks/models/qwen3-vl-235b-a22b-instruct

Troubleshooting

API key not recognized

echo $FIREWORKS_API_KEY
moltbot models list | grep fireworks

Ensure the key is valid and has not expired.

Model not available

Run moltbot models list to see currently available models in the catalog. If a model you need is missing, you can add it manually to your config file.

Connection issues

Fireworks API is at https://api.fireworks.ai. Ensure your network allows HTTPS connections.

Config file example

{
  env: { FIREWORKS_API_KEY: "fw_..." },
  agents: { defaults: { model: { primary: "fireworks/accounts/fireworks/models/kimi-k2p5" } } },
  models: {
    mode: "merge",
    providers: {
      fireworks: {
        baseUrl: "https://api.fireworks.ai/inference/v1",
        apiKey: "${FIREWORKS_API_KEY}",
        api: "openai-completions",
        models: [
          {
            id: "accounts/fireworks/models/kimi-k2p5",
            name: "Kimi K2.5",
            reasoning: true,
            input: ["text", "image"],
            cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 },
            contextWindow: 262144,
            maxTokens: 8192
          }
        ]
      }
    }
  }
}