Taekmin Kim c6acef769a feat(fireworks): integrate Fireworks AI support into onboarding and authentication

- Added Fireworks AI API key handling in onboarding and non-interactive auth choice.
- Updated CLI options to include Fireworks API key.
- Enhanced provider configuration to support Fireworks models.
- Updated documentation to reflect Fireworks AI integration.

2026-01-28 13:32:44 -08:00

7.7 KiB

Raw Blame History

summary

read_when

Use Fireworks AI serverless models in Moltbot

You want fast serverless inference in Moltbot

You want Fireworks AI setup guidance

Fireworks AI

Fireworks AI provides fast, cost-effective serverless inference for popular open-source models including DeepSeek, Qwen, Kimi, Llama, GLM, and more. All models run on optimized infrastructure with low latency and competitive pricing.

Why Fireworks in Moltbot

Fast inference with optimized serving infrastructure.
Wide model selection including DeepSeek V3.2, Qwen3, Kimi K2.5, Llama 3.3, GLM-4.7, and more.
Serverless - no infrastructure management, pay per token.
OpenAI-compatible /v1 endpoints.

Features

Serverless inference: No GPU management, instant scaling
OpenAI-compatible API: Standard /v1 endpoints for easy integration
Streaming: Supported on all models
Function calling: Supported on most models
Vision: Supported on vision-capable models (Qwen VL series)
Reasoning models: DeepSeek R1, Qwen3 Thinking, Kimi K2 Thinking, Kimi K2.5

Setup

1. Get API Key

Sign up at fireworks.ai
Go to Settings > API Keys > Create API Key
Copy your API key (should start with fw_)

2. Configure Moltbot

Option A: Environment Variable

export FIREWORKS_API_KEY="fw_xxxxxxxxxxxx"

Option B: Interactive Setup (Recommended)

moltbot onboard --auth-choice fireworks-api-key

This will:

Prompt for your API key (or use existing FIREWORKS_API_KEY)
Configure the Fireworks provider with available models
Let you pick your default model
Set up the provider automatically

Option C: Non-interactive

moltbot onboard --non-interactive \
  --auth-choice fireworks-api-key \
  --fireworks-api-key "fw_xxxxxxxxxxxx"

3. Verify Setup

moltbot chat --model fireworks/accounts/fireworks/models/kimi-k2p5 "Hello, are you working?"

Model Selection

Moltbot includes a curated catalog of popular Fireworks models. Pick based on your needs:

Default: kimi-k2p5 (Kimi K2.5) - vision + reasoning, 262k context.
Best reasoning: deepseek-r1-0528 or qwen3-235b-a22b-thinking-2507
Coding: qwen3-coder-480b-a35b-instruct
Vision: qwen3-vl-235b-a22b-instruct or qwen2p5-vl-32b-instruct

Change your default model anytime:

moltbot models set fireworks/accounts/fireworks/models/kimi-k2p5
moltbot models set fireworks/accounts/fireworks/models/deepseek-v3p2

List all available models:

moltbot models list | grep fireworks

Which Model Should I Use?

Use Case	Recommended Model	Why
General chat	`kimi-k2p5`	Vision + reasoning, 262k context
Complex reasoning	`deepseek-r1-0528`	Best for step-by-step reasoning
Agentic tasks	`gpt-oss-120b`	Designed for reasoning and agentic use
Coding	`qwen3-coder-480b-a35b-instruct`	Code-optimized, 262k context
Vision tasks	`qwen3-vl-235b-a22b-instruct`	Best multimodal capabilities
Fast + cheap	`qwen3-8b`	Lightweight, low latency
Long context	`kimi-k2-instruct-0905`	262k context window

Available Models (25 Total)

Text Models

Model ID	Name	Context	Features
`deepseek-r1-0528`	Deepseek R1 05/28	163k	Reasoning
`deepseek-v3-0324`	Deepseek V3 03-24	163k	General
`deepseek-v3p1`	DeepSeek V3.1	163k	General
`deepseek-v3p1-terminus`	DeepSeek V3.1 Terminus	163k	General
`deepseek-v3p2`	Deepseek v3.2	163k	General
`glm-4p6`	GLM-4.6	202k	Reasoning
`glm-4p7`	GLM-4.7	202k	Reasoning
`gpt-oss-120b`	OpenAI gpt-oss-120b	131k	Reasoning
`gpt-oss-20b`	OpenAI gpt-oss-20b	131k	Reasoning
`kimi-k2-instruct-0905`	Kimi K2 Instruct 0905	262k	Long context
`kimi-k2p5`	Kimi K2.5	262k	Vision, reasoning
`kimi-k2-thinking`	Kimi K2 Thinking	256k	Reasoning
`llama-v3p3-70b-instruct`	Llama 3.3 70B Instruct	131k	General
`minimax-m2`	MiniMax-M2	196k	Agentic, coding
`minimax-m2p1`	MiniMax-M2.1	204k	Agentic, coding
`qwen3-235b-a22b`	Qwen3 235B A22B	131k	General
`qwen3-235b-a22b-instruct-2507`	Qwen3 235B A22B Instruct 2507	262k	General
`qwen3-235b-a22b-thinking-2507`	Qwen3 235B A22B Thinking 2507	262k	Reasoning
`qwen3-8b`	Qwen3 8B	40k	Fast
`qwen3-coder-480b-a35b-instruct`	Qwen3 Coder 480B A35B Instruct	262k	Coding

Vision Models

Model ID	Name	Context	Features
`qwen2p5-vl-32b-instruct`	Qwen2.5-VL 32B Instruct	128k	Vision
`qwen3-vl-235b-a22b-instruct`	Qwen3 VL 235B A22B Instruct	262k	Vision
`qwen3-vl-235b-a22b-thinking`	Qwen3 VL 235B A22B Thinking	262k	Vision, reasoning
`qwen3-vl-30b-a3b-instruct`	Qwen3 VL 30B A3B Instruct	262k	Vision
`qwen3-vl-30b-a3b-thinking`	Qwen3 VL 30B A3B Thinking	262k	Vision, reasoning

Model IDs

Fireworks model IDs use the full resource path format:

accounts/fireworks/models/<model-name>

When using models in Moltbot, prefix with the provider:

moltbot chat --model fireworks/accounts/fireworks/models/deepseek-v3p2

Streaming and Tool Support

Feature	Support
Streaming	All models
Function calling	Most models (check `supportsTools` in API)
Vision/Images	Vision models only
JSON mode	Supported via `response_format`

Pricing

Fireworks uses pay-per-token pricing. Check fireworks.ai/pricing for current rates. Generally:

Smaller models (8B-30B): Lower cost, faster
Larger models (70B+): Higher quality, higher cost
MoE models: Cost-effective for their capability

Usage Examples

# Use Kimi K2.5 (recommended default)
moltbot chat --model fireworks/accounts/fireworks/models/kimi-k2p5

# Use DeepSeek V3.2
moltbot chat --model fireworks/accounts/fireworks/models/deepseek-v3p2

# Use coding model
moltbot chat --model fireworks/accounts/fireworks/models/qwen3-coder-480b-a35b-instruct

# Use vision model
moltbot chat --model fireworks/accounts/fireworks/models/qwen3-vl-235b-a22b-instruct

Troubleshooting

API key not recognized

echo $FIREWORKS_API_KEY
moltbot models list | grep fireworks

Ensure the key is valid and has not expired.

Model not available

Run moltbot models list to see currently available models in the catalog. If a model you need is missing, you can add it manually to your config file.

Connection issues

Fireworks API is at https://api.fireworks.ai. Ensure your network allows HTTPS connections.

Config file example

{
  env: { FIREWORKS_API_KEY: "fw_..." },
  agents: { defaults: { model: { primary: "fireworks/accounts/fireworks/models/kimi-k2p5" } } },
  models: {
    mode: "merge",
    providers: {
      fireworks: {
        baseUrl: "https://api.fireworks.ai/inference/v1",
        apiKey: "${FIREWORKS_API_KEY}",
        api: "openai-completions",
        models: [
          {
            id: "accounts/fireworks/models/kimi-k2p5",
            name: "Kimi K2.5",
            reasoning: true,
            input: ["text", "image"],
            cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 },
            contextWindow: 262144,
            maxTokens: 8192
          }
        ]
      }
    }
  }
}

7.7 KiB Raw Blame History