Merge 7579008543 into 4583f88626

2026-01-29 19:56:03 +01:00 · 2026-01-29 19:56:03 +01:00 · c83ef2b0f6
commit c83ef2b0f6
parent 4583f88626 7579008543
58 changed files with 85 additions and 3 deletions
--- a/AGENTS.md
+++ b/AGENTS.md
@ -1,7 +1,25 @@
-# Repository Guidelines
+# CLAUDE.md
+
+This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.
+
 - Repo: https://github.com/moltbot/moltbot
 - GitHub issues/comments/PR comments: use literal multiline strings or `-F - <<'EOF'` (or $'...') for real newlines; never embed "\\n".

+## Architecture Overview
+
+Moltbot is a local-first personal AI assistant that bridges messaging channels to AI agents.
+
+**Message flow:** Inbound message (WhatsApp, Telegram, Slack, etc.) → Gateway WebSocket server (`src/gateway/server.ts`) → Channel plugin routes message → Pi agent runtime (`src/agents/pi-embedded-runner.ts`) executes tools (browser, bash, canvas, etc.) via gateway RPC → Final reply sent back through channel's outbound adapter.
+
+**Key subsystems:**
+- **Gateway** (`src/gateway/`): WebSocket control plane. Hosts sessions, routes messages, serves the Control UI, and exposes RPC methods for agent tool calls. Config is YAML-based (`~/.clawdbot/config.yaml`), hot-reloaded via chokidar.
+- **Agents** (`src/agents/`): Pi agent runtime. Manages LLM provider config, auth profiles, tool definitions, sandbox execution, and skill loading. Sessions are isolated per-agent/group as JSONL files under `~/.clawdbot/sessions/`.
+- **Channel plugins** (`src/channels/plugins/`): Unified `ChannelPlugin` interface with `inbound` (receive) and `outbound` (send) adapters. Core channels live in `src/` (discord, telegram, slack, signal, imessage, web/WhatsApp). Extension channels live in `extensions/*` as workspace packages.
+- **Dependency injection** (`src/cli/deps.ts`): `createDefaultDeps()` wires all channel send functions and services. Used throughout for testability.
+- **Plugin registry** (`src/plugins/runtime.ts`): Channels and extensions are runtime-registered (not statically imported). Access via `getActivePluginRegistry()`.
+
+**Apps:** macOS menu bar (SwiftUI, `apps/macos/`), iOS (`apps/ios/`), Android (Kotlin, `apps/android/`), shared Swift code in `apps/shared/MoltbotKit/`.
+
 ## Project Structure & Module Organization
 - Source code: `src/` (CLI wiring in `src/cli`, commands in `src/commands`, web provider in `src/provider-web.ts`, infra in `src/infra`, media pipeline in `src/media`).
 - Tests: colocated `*.test.ts`.
@ -47,6 +65,7 @@
 - Type-check/build: `pnpm build` (tsc)
 - Lint/format: `pnpm lint` (oxlint), `pnpm format` (oxfmt)
 - Tests: `pnpm test` (vitest); coverage: `pnpm test:coverage`
+- Run a single test file: `pnpm test -- src/path/to/file.test.ts`

 ## Coding Style & Naming Conventions
 - Language: TypeScript (ESM). Prefer strict typing; avoid `any`.
--- a/README.md
+++ b/README.md
@ -1,7 +1,7 @@
 # 🦞 Moltbot — Personal AI Assistant

 <p align="center">
-  <img src="https://raw.githubusercontent.com/moltbot/moltbot/main/docs/whatsapp-clawd.jpg" alt="Clawdbot" width="400">
+  <img src="https://raw.githubusercontent.com/davendra/moltbot/main/docs/images/diagrams/30-hero-banner.png" alt="Moltbot — Your AI Agent, Everywhere You Message" width="800">
 </p>

 <p align="center">
@ -52,6 +52,8 @@ moltbot onboard --install-daemon
 The wizard installs the Gateway daemon (launchd/systemd user service) so it stays running.
 Legacy note: `clawdbot` remains available as a compatibility shim.

+![Moltbot Architecture Overview](https://raw.githubusercontent.com/davendra/moltbot/main/docs/images/diagrams/27-readme-architecture.png)
+
 ## Quick start (TL;DR)

 Runtime: **Node ≥22**.
--- a/docs/automation/cron-jobs.md
+++ b/docs/automation/cron-jobs.md
@ -15,6 +15,8 @@ the right time, and can optionally deliver output back to a chat.
 If you want *“run this every morning”* or *“poke the agent in 20 minutes”*,
 cron is the mechanism.

+![Cron Job Execution Paths](/images/diagrams/09-cron-jobs.png)
+
 ## TL;DR
 - Cron runs **inside the Gateway** (not inside the model).
 - Jobs persist under `~/.clawdbot/cron/` so restarts don’t lose schedules.
--- a/docs/automation/webhook.md
+++ b/docs/automation/webhook.md
@ -9,6 +9,8 @@ read_when:

 Gateway can expose a small HTTP webhook endpoint for external triggers.

+![Webhook Processing Flow](/images/diagrams/10-webhook.png)
+
 ## Enable

 ```json5
--- a/docs/broadcast-groups.md
+++ b/docs/broadcast-groups.md
@ -17,6 +17,8 @@ Broadcast Groups enable multiple agents to process and respond to the same messa

 Current scope: **WhatsApp only** (web channel).

+![Broadcast Group Flow](/images/diagrams/14-broadcast.png)
+
 Broadcast groups are evaluated after channel allowlists and group activation rules. In WhatsApp groups, this means broadcasts happen when Moltbot would normally reply (for example: on mention, depending on your group settings).

 ## Use Cases
--- a/docs/concepts/agent-loop.md
+++ b/docs/concepts/agent-loop.md
@ -13,6 +13,8 @@ In Moltbot, a loop is a single, serialized run per session that emits lifecycle
 as the model thinks, calls tools, and streams output. This doc explains how that authentic loop is
 wired end-to-end.

+![Agent Loop Lifecycle](/images/diagrams/02-agent-loop.png)
+
 ## Entry points
 - Gateway RPC: `agent` and `agent.wait`.
 - CLI: `agent` command.
@ -80,6 +82,8 @@ These run inside the agent loop or gateway pipeline:

 See [Plugins](/plugin#plugin-hooks) for the hook API and registration details.

+![Plugin Hooks Lifecycle](/images/diagrams/20-plugin-hooks.png)
+
 ## Streaming + partial replies
 - Assistant deltas are streamed from pi-agent-core and emitted as `assistant` events.
 - Block streaming can emit partial replies either on `text_end` or `message_end`.
--- a/docs/concepts/agent-workspace.md
+++ b/docs/concepts/agent-workspace.md
@ -9,6 +9,8 @@ read_when:
 The workspace is the agent's home. It is the only working directory used for
 file tools and for workspace context. Keep it private and treat it as memory.

+![Agent Workspace Structure](/images/diagrams/26-workspace.png)
+
 This is separate from `~/.clawdbot/`, which stores config, credentials, and
 sessions.

--- a/docs/concepts/architecture.md
+++ b/docs/concepts/architecture.md
@ -21,6 +21,8 @@ Last updated: 2026-01-22

 ## Components and flows

+![Architecture Overview](/images/diagrams/01-architecture.png)
+
 ### Gateway (daemon)
 - Maintains provider connections.
 - Exposes a typed WS API (requests, responses, server‑push events).
--- a/docs/concepts/channel-routing.md
+++ b/docs/concepts/channel-routing.md
@ -51,6 +51,8 @@ Routing picks **one agent** for each inbound message:

 The matched agent determines which workspace and session store are used.

+![Channel Routing Priority Cascade](/images/diagrams/05-channel-routing.png)
+
 ## Broadcast groups (run multiple agents)

 Broadcast groups let you run **multiple agents** for the same peer **when Moltbot would normally reply** (for example: in WhatsApp groups, after mention/activation gating).
@ -69,6 +71,8 @@ Config:

 See: [Broadcast Groups](/broadcast-groups).

+![Broadcast vs Normal Routing](/images/diagrams/29-broadcast-vs-normal.png)
+
 ## Config overview

 - `agents.list`: named agent definitions (workspace, model, etc.).
--- a/docs/concepts/compaction.md
+++ b/docs/concepts/compaction.md
@ -8,6 +8,8 @@ read_when:

 Every model has a **context window** (max tokens it can see). Long-running chats accumulate messages and tool results; once the window is tight, Moltbot **compacts** older history to stay within limits.

+![Context Compaction Flow](/images/diagrams/11-compaction.png)
+
 ## What compaction is
 Compaction **summarizes older conversation** into a compact summary entry and keeps recent messages intact. The summary is stored in the session history, so future requests use:
 - The compaction summary
--- a/docs/concepts/memory.md
+++ b/docs/concepts/memory.md
@ -9,6 +9,8 @@ read_when:
 Moltbot memory is **plain Markdown in the agent workspace**. The files are the
 source of truth; the model only "remembers" what gets written to disk.

+![Memory Organization](/images/diagrams/12-memory.png)
+
 Memory search tools are provided by the active memory plugin (default:
 `memory-core`). Disable memory plugins with `plugins.slots.memory = "none"`.

--- a/docs/concepts/messages.md
+++ b/docs/concepts/messages.md
@ -20,6 +20,8 @@ Inbound message
  -> outbound replies (channel limits + chunking)
 ```

+![Message Processing Flow](/images/diagrams/03-message-flow.png)
+
 Key knobs live in configuration:
 - `messages.*` for prefixes, queueing, and group behavior.
 - `agents.defaults.*` for block streaming and chunking defaults.
--- a/docs/concepts/model-failover.md
+++ b/docs/concepts/model-failover.md
@ -13,6 +13,8 @@ Moltbot handles failures in two stages:

 This doc explains the runtime rules and the data that backs them.

+![Model Failover Decision Tree](/images/diagrams/17-model-failover.png)
+
 ## Auth storage (keys + OAuth)

 Moltbot uses **auth profiles** for both API keys and OAuth tokens.
--- a/docs/concepts/multi-agent.md
+++ b/docs/concepts/multi-agent.md
@ -9,7 +9,9 @@ status: active

 Goal: multiple *isolated* agents (separate workspace + `agentDir` + sessions), plus multiple channel accounts (e.g. two WhatsApps) in one running Gateway. Inbound is routed to an agent via bindings.

-## What is “one agent”?
+![Multi-Agent Isolation](/images/diagrams/06-multi-agent.png)
+
+## What is "one agent"?

 An **agent** is a fully scoped brain with its own:

--- a/docs/concepts/queue.md
+++ b/docs/concepts/queue.md
@ -7,6 +7,8 @@ read_when:

 We serialize inbound auto-reply runs (all channels) through a tiny in-process queue to prevent multiple agent runs from colliding, while still allowing safe parallelism across sessions.

+![Lane-Aware FIFO Queue](/images/diagrams/07-queue-lanes.png)
+
 ## Why
 - Auto-reply runs can be expensive (LLM calls) and can collide when multiple inbound messages arrive close together.
 - Serializing avoids competing for shared resources (session files, logs, CLI stdin) and reduces the chance of upstream rate limits.
@ -27,6 +29,8 @@ Inbound messages can steer the current run, wait for a followup turn, or do both
 - `interrupt` (legacy): abort the active run for that session, then run the newest message.
 - `queue` (legacy alias): same as `steer`.

+![Queue Modes State Machine](/images/diagrams/08-queue-modes.png)
+
 Steer-backlog means you can get a followup response after the steered run, so
 streaming surfaces can look like duplicates. Prefer `collect`/`steer` if you want
 one response per inbound message.
--- a/docs/concepts/session.md
+++ b/docs/concepts/session.md
@ -7,6 +7,8 @@ read_when:

 Moltbot treats **one direct-chat session per agent** as primary. Direct chats collapse to `agent:<agentId>:<mainKey>` (default `main`), while group/channel chats get their own keys. `session.mainKey` is honored.

+![Session Lifecycle](/images/diagrams/28-session-lifecycle.png)
+
 Use `session.dmScope` to control how **direct messages** are grouped:
 - `main` (default): all DMs share the main session for continuity.
 - `per-peer`: isolate by sender id across channels.
--- a/docs/concepts/streaming.md
+++ b/docs/concepts/streaming.md
@ -13,6 +13,8 @@ Moltbot has two separate “streaming” layers:

 There is **no real token streaming** to external channel messages today. Telegram draft streaming is the only partial-stream surface.

+![Streaming Delivery Paths](/images/diagrams/13-streaming.png)
+
 ## Block streaming (channel messages)

 Block streaming sends assistant output in coarse chunks as it becomes available.
--- a/docs/gateway/index.md
+++ b/docs/gateway/index.md
@ -9,6 +9,9 @@ Last updated: 2025-12-09

 ## What it is
 - The always-on process that owns the single Baileys/Telegram connection and the control/event plane.
+
+![Gateway Lifecycle States](/images/diagrams/23-gateway-lifecycle.png)
+
 - Replaces the legacy `gateway` command. CLI entry point: `moltbot gateway`.
 - Runs until stopped; exits non-zero on fatal errors so the supervisor restarts it.

@ -22,6 +25,8 @@ moltbot gateway --force
 # dev loop (auto-reload on TS changes):
 pnpm gateway:watch
 ```
+![Config Hot-Reload](/images/diagrams/24-hot-reload.png)
+
 - Config hot reload watches `~/.clawdbot/moltbot.json` (or `CLAWDBOT_CONFIG_PATH`).
  - Default mode: `gateway.reload.mode="hybrid"` (hot-apply safe changes, restart on critical).
  - Hot reload uses in-process restart via **SIGUSR1** when needed.
--- a/docs/gateway/protocol.md
+++ b/docs/gateway/protocol.md
@ -18,6 +18,8 @@ handshake time.
 - WebSocket, text frames with JSON payloads.
 - First frame **must** be a `connect` request.

+![Gateway WebSocket Protocol](/images/diagrams/04-ws-protocol.png)
+
 ## Handshake (connect)

 Gateway → Client (pre-connect challenge):
--- a/docs/gateway/security/index.md
+++ b/docs/gateway/security/index.md
@ -5,6 +5,8 @@ read_when:
 ---
 # Security 🔒

+![Security Trust Hierarchy](/images/diagrams/31-security-threat-model.png)
+
 ## Quick check: `moltbot security audit` (formerly `clawdbot security audit`)

 See also: [Formal Verification (Security Models)](/security/formal-verification/)
@ -35,6 +37,8 @@ Moltbot is both a product and an experiment: you’re wiring frontier-model beha

 Start with the smallest access that still works, then widen it as you gain confidence.

+![Three-Layer Security Model](/images/diagrams/15-security.png)
+
 ### What the audit checks (high level)

 - **Inbound access** (DM policies, group policies, allowlists): can strangers trigger the bot?
--- a/docs/images/diagrams/01-architecture.png
+++ b/docs/images/diagrams/01-architecture.png
--- a/docs/images/diagrams/02-agent-loop.png
+++ b/docs/images/diagrams/02-agent-loop.png
--- a/docs/images/diagrams/03-message-flow.png
+++ b/docs/images/diagrams/03-message-flow.png
--- a/docs/images/diagrams/04-ws-protocol.png
+++ b/docs/images/diagrams/04-ws-protocol.png
--- a/docs/images/diagrams/05-channel-routing.png
+++ b/docs/images/diagrams/05-channel-routing.png
--- a/docs/images/diagrams/06-multi-agent.png
+++ b/docs/images/diagrams/06-multi-agent.png
--- a/docs/images/diagrams/07-queue-lanes.png
+++ b/docs/images/diagrams/07-queue-lanes.png
--- a/docs/images/diagrams/08-queue-modes.png
+++ b/docs/images/diagrams/08-queue-modes.png
--- a/docs/images/diagrams/09-cron-jobs.png
+++ b/docs/images/diagrams/09-cron-jobs.png
--- a/docs/images/diagrams/10-webhook.png
+++ b/docs/images/diagrams/10-webhook.png
--- a/docs/images/diagrams/11-compaction.png
+++ b/docs/images/diagrams/11-compaction.png
--- a/docs/images/diagrams/12-memory.png
+++ b/docs/images/diagrams/12-memory.png
--- a/docs/images/diagrams/13-streaming.png
+++ b/docs/images/diagrams/13-streaming.png
--- a/docs/images/diagrams/14-broadcast.png
+++ b/docs/images/diagrams/14-broadcast.png
--- a/docs/images/diagrams/15-security.png
+++ b/docs/images/diagrams/15-security.png
--- a/docs/images/diagrams/16-pairing.png
+++ b/docs/images/diagrams/16-pairing.png
--- a/docs/images/diagrams/17-model-failover.png
+++ b/docs/images/diagrams/17-model-failover.png
--- a/docs/images/diagrams/18-tool-groups.png
+++ b/docs/images/diagrams/18-tool-groups.png
--- a/docs/images/diagrams/19-plugin-discovery.png
+++ b/docs/images/diagrams/19-plugin-discovery.png
--- a/docs/images/diagrams/20-plugin-hooks.png
+++ b/docs/images/diagrams/20-plugin-hooks.png
--- a/docs/images/diagrams/21-browser.png
+++ b/docs/images/diagrams/21-browser.png
--- a/docs/images/diagrams/22-onboarding.png
+++ b/docs/images/diagrams/22-onboarding.png
--- a/docs/images/diagrams/23-gateway-lifecycle.png
+++ b/docs/images/diagrams/23-gateway-lifecycle.png
--- a/docs/images/diagrams/24-hot-reload.png
+++ b/docs/images/diagrams/24-hot-reload.png
--- a/docs/images/diagrams/25-install.png
+++ b/docs/images/diagrams/25-install.png
--- a/docs/images/diagrams/26-workspace.png
+++ b/docs/images/diagrams/26-workspace.png
--- a/docs/images/diagrams/27-readme-architecture.png
+++ b/docs/images/diagrams/27-readme-architecture.png
--- a/docs/images/diagrams/28-session-lifecycle.png
+++ b/docs/images/diagrams/28-session-lifecycle.png
--- a/docs/images/diagrams/29-broadcast-vs-normal.png
+++ b/docs/images/diagrams/29-broadcast-vs-normal.png
--- a/docs/images/diagrams/30-hero-banner.png
+++ b/docs/images/diagrams/30-hero-banner.png
--- a/docs/images/diagrams/31-security-threat-model.png
+++ b/docs/images/diagrams/31-security-threat-model.png
--- a/docs/images/diagrams/32-getting-started-timeline.png
+++ b/docs/images/diagrams/32-getting-started-timeline.png
--- a/docs/install/index.md
+++ b/docs/install/index.md
@ -9,6 +9,8 @@ read_when:

 Use the installer unless you have a reason not to. It sets up the CLI and runs onboarding.

+![Installation Decision Tree](/images/diagrams/25-install.png)
+
 ## Quick install (recommended)

 ```bash
--- a/docs/plugin.md
+++ b/docs/plugin.md
@ -112,6 +112,8 @@ manifest.
 If multiple plugins resolve to the same id, the first match in the order above
 wins and lower-precedence copies are ignored.

+![Plugin Discovery Precedence](/images/diagrams/19-plugin-discovery.png)
+
 ### Package packs

 A plugin directory may include a `package.json` with `moltbot.extensions`:
--- a/docs/start/getting-started.md
+++ b/docs/start/getting-started.md
@ -7,8 +7,12 @@ read_when:

 # Getting Started

+![5 Minutes to First Message](/images/diagrams/32-getting-started-timeline.png)
+
 Goal: go from **zero** → **first working chat** (with sane defaults) as quickly as possible.

+![Getting Started: Onboarding Wizard](/images/diagrams/22-onboarding.png)
+
 Fastest chat: open the Control UI (no channel setup needed). Run `moltbot dashboard`
 and chat in the browser, or open `http://127.0.0.1:18789/` on the gateway host.
 Docs: [Dashboard](/web/dashboard) and [Control UI](/web/control-ui).
--- a/docs/start/pairing.md
+++ b/docs/start/pairing.md
@ -14,6 +14,8 @@ It is used in two places:
 1) **DM pairing** (who is allowed to talk to the bot)
 2) **Node pairing** (which devices/nodes are allowed to join the gateway network)

+![Pairing Flows](/images/diagrams/16-pairing.png)
+
 Security context: [Security](/gateway/security)

 ## 1) DM pairing (inbound chat access)
--- a/docs/tools/browser.md
+++ b/docs/tools/browser.md
@ -321,6 +321,8 @@ High-level flow:
 This design keeps the agent on a stable, deterministic interface while letting
 you swap local/remote browsers and profiles.

+![Browser Automation Flow](/images/diagrams/21-browser.png)
+
 ## CLI quick reference

 All commands accept `--browser-profile <name>` to target a specific profile.
--- a/docs/tools/index.md
+++ b/docs/tools/index.md
@ -11,6 +11,8 @@ Moltbot exposes **first-class agent tools** for browser, canvas, nodes, and cron
 These replace the old `moltbot-*` skills: the tools are typed, no shelling,
 and the agent should rely on them directly.

+![Tool Groups](/images/diagrams/18-tool-groups.png)
+
 ## Disabling tools

 You can globally allow/deny tools via `tools.allow` / `tools.deny` in `moltbot.json`