Merge branch 'main' into fix/65388-active-turn-queue

Update CHANGELOG.md
fix: trim timezone suffix from pretty logs
2026-06-13 09:41:17 +08:00 · 2026-04-12 19:02:55 +01:00 · 2026-04-12 19:02:35 +01:00 · 2026-04-12 18:58:27 +01:00 · 2026-04-12 18:56:57 +01:00 · 2026-04-12 18:56:27 +01:00
2842 changed files with 118678 additions and 34278 deletions
--- a/.agents/skills/openclaw-parallels-smoke/SKILL.md
+++ b/.agents/skills/openclaw-parallels-smoke/SKILL.md
@@ -29,7 +29,13 @@ Use this skill for Parallels guest workflows and smoke interpretation. Do not lo
 ## npm install then update

 - Preferred entrypoint: `pnpm test:parallels:npm-update`
- Flow: fresh snapshot -> install npm package baseline -> smoke -> install current main tgz on the same guest -> smoke again.
+- Required coverage: every release/update regression run must include both lanes:
+  - fresh snapshot -> install requested package/baseline -> smoke
+  - same guest baseline -> run the guest's installed `openclaw update ...` command -> smoke again
+- The update lane must exercise OpenClaw's internal updater. Do not count a direct `npm install -g <tgz-or-spec>` or harness-side package swap as update-flow coverage; those are install smokes only.
+- For published targets, install the old baseline package first (for example `openclaw@2026.4.9`), then run the installed guest CLI with the intended channel/tag (for example `openclaw update --channel beta --yes --json`) and verify `openclaw --version`, `openclaw update status --json`, gateway RPC, and an agent turn after the command.
+- For unpublished targets, pack the candidate on the host, serve the `.tgz` over the harness HTTP server, and point the guest updater at that served package. Prefer `openclaw update --tag http://<host-ip>:<port>/openclaw-<version>.tgz --yes --json`; when channel persistence also matters, pass `--channel <stable|beta>` and set `OPENCLAW_UPDATE_PACKAGE_SPEC` to the same served URL in the guest update environment. The command under test must still be `openclaw update`, not direct npm.
+- For unpublished local-fix validation, remember the old baseline updater code still controls the first hop. A fix that lives only in the new updater code cannot change that already-running old process; the served candidate must either keep package/plugin metadata compatible with the baseline host or the baseline itself must include the updater fix.
 - For beta/stable verification, resolve the tag immediately before the run (`npm view openclaw@beta version dist.tarball` or `npm view openclaw@latest ...`). Tags can move while a long VM matrix is already running; restart the matrix when the intended prerelease appears after an earlier registry 404/tag-lag check.
 - Source Peter's profile in the host shell (`set -a; source "$HOME/.profile"; set +a`) before OpenAI/Anthropic lanes. Do not print profile contents or env dumps; pass provider secrets through the guest exec environment.
 - Same-guest update verification should set the default model explicitly to `openai/gpt-5.4` before the agent turn and use a fresh explicit `--session-id` so old session model state does not leak into the check.
--- a/.agents/skills/openclaw-secret-scanning-maintainer/SKILL.md
+++ b/.agents/skills/openclaw-secret-scanning-maintainer/SKILL.md
@@ -0,0 +1,201 @@
+---
+name: openclaw-secret-scanning-maintainer
+description: Maintainer-only workflow for handling GitHub Secret Scanning alerts on OpenClaw. Use when Codex needs to triage, redact, clean up, and resolve secret leakage found in issue comments, issue bodies, PR comments, or other GitHub content.
+---
+
+# OpenClaw Secret Scanning Maintainer
+
+**Maintainer-only.** This skill requires repo admin / maintainer permissions to edit or delete other users' comments and resolve secret scanning alerts.
+
+Use this skill when processing alerts from `https://github.com/openclaw/openclaw/security/secret-scanning`.
+
+**Language rule:** All notification comments and replacement comments MUST be written in English.
+
+## Script
+
+All mechanical operations (API calls, temp file management, security enforcements) are handled by:
+
+```
+$REPO_ROOT/.agents/skills/openclaw-secret-scanning-maintainer/scripts/secret-scanning.mjs
+```
+
+The script enforces:
+
+- `hide_secret=true` on all alert fetches (no plaintext secrets in stdout)
+- `mktemp` with random UUIDs for all temp files
+- `-F body=@file` for all body uploads (no inline shell quoting)
+- Notification templates branched by location type
+- Never prints `.secret` or `.body` to stdout
+
+## Overall Flow
+
+Supports single or multiple alerts. For multiple alerts, process in ascending order.
+
+For each alert:
+
+1. **Identify** — `fetch-alert` + `fetch-content` to get metadata and body
+2. **Decide** — Agent reads the body file, identifies all secrets, produces redacted version
+3. **Redact** — `redact-body` for issue/PR body; skip for comments (delete directly)
+4. **Purge** — `delete-comment` + `recreate-comment` for comments; cannot purge body history
+5. **Notify** — `notify` posts the right template per location type
+6. **Resolve** — `resolve` closes the alert
+7. **Summary** — `summary` prints formatted results
+
+## Step 1: Identify
+
+```bash
+# List all open alerts
+node secret-scanning.mjs list-open
+
+# Fetch specific alert metadata + locations
+node secret-scanning.mjs fetch-alert <NUMBER>
+
+# Fetch content for each location (saves body to temp file)
+node secret-scanning.mjs fetch-content '<location-json>'
+```
+
+The `fetch-content` output includes:
+
+- `body_file`: path to temp file with full body content
+- `author`: who posted it
+- `issue_number` / `pr_number`: where it is
+- `edit_history_count`: number of existing edits
+- `type`: location type for routing
+
+### Location type routing
+
+| type                          | Flow                     |
+| ----------------------------- | ------------------------ |
+| `issue_comment`               | Comment: delete+recreate |
+| `pull_request_comment`        | Comment: delete+recreate |
+| `pull_request_review_comment` | Comment: delete+recreate |
+| `issue_body`                  | Body: redact in place    |
+| `pull_request_body`           | Body: redact in place    |
+| `commit`                      | Notify only              |
+| _other_                       | Skip and report          |
+
+## Step 2: Decide (Agent)
+
+The agent reads the body file from `fetch-content` output and:
+
+1. Identifies ALL secrets in the content (there may be more than the alert flagged)
+2. Replaces each secret with `[REDACTED <secret_type>]` — **no partial values, no prefix/suffix**
+3. Saves the redacted content to a new temp file
+
+This is the only step that requires semantic understanding. Everything else is mechanical.
+
+## Step 3: Redact
+
+### For comments (issue_comment / PR comments)
+
+**Do NOT redact.** Skip directly to Step 4 (delete + recreate). PATCHing before DELETE creates an unnecessary edit history revision.
+
+### For issue_body / pull_request_body
+
+```bash
+node secret-scanning.mjs redact-body <issue|pr> <NUMBER> <redacted-body-file>
+```
+
+## Step 4: Purge Edit History
+
+### Comments — Delete and Recreate
+
+```bash
+# Delete original (all edit history gone)
+node secret-scanning.mjs delete-comment <COMMENT_ID>
+
+# Recreate with redacted content
+# Agent prepares the body file with maintainer header + redacted content
+node secret-scanning.mjs recreate-comment <ISSUE_NUMBER> <body-file>
+```
+
+The recreated comment should follow this format:
+
+```
+> **Note from maintainer (@<LOGIN>):** The original comment by @<AUTHOR> has been removed due to secret leakage. Below is the redacted version of the original content.
+
+---
+
+<redacted original content>
+```
+
+### issue_body / pull_request_body — Cannot Purge
+
+Editing creates an edit history revision with the pre-edit plaintext. This cannot be cleared via API.
+
+**Output to maintainer terminal only (never in public comments):**
+
+```
+⚠️ Issue/PR body edit history still contains plaintext secrets.
+Contact GitHub Support to purge: https://support.github.com/contact
+Request purge of issue/PR #{NUMBER} userContentEdits.
+```
+
+> **CRITICAL:** Do NOT mention edit history or the "edited" button in any public comment or resolution_comment.
+
+### Commits
+
+Cannot clean. Notify author to delete branch or force-push (for unmerged PRs).
+
+## Step 5: Notify
+
+```bash
+node secret-scanning.mjs notify <ISSUE_NUMBER> <AUTHOR> <LOCATION_TYPE> <SECRET_TYPES>
+```
+
+Secret types are comma-separated: `"Discord Bot Token,Feishu App Secret"`
+
+The script picks the right template:
+
+- **comment types**: "your comment … removed and replaced"
+- **body types**: "your issue/PR description … redacted in place"
+- **commit**: "code you committed"
+
+## Step 6: Resolve
+
+```bash
+node secret-scanning.mjs resolve <ALERT_NUMBER>
+# or with custom resolution:
+node secret-scanning.mjs resolve <ALERT_NUMBER> revoked "Custom comment"
+```
+
+Resolution is `revoked` by default. As maintainers we cannot control whether users rotate — our responsibility is to redact + notify. The `revoked` means "this secret should be considered leaked", not "I confirmed it was revoked".
+
+## Step 7: Summary
+
+After processing, create a JSON results file and pass it to the summary command:
+
+```bash
+node secret-scanning.mjs summary /tmp/results.json
+```
+
+The script outputs a block delimited by `---BEGIN SUMMARY---` and `---END SUMMARY---`. **You MUST output the content between these markers verbatim to the user. Do NOT rephrase, reformat, abbreviate, or create your own summary.** The script already includes full URLs for every alert and location.
+
+The JSON format:
+
+```json
+[
+  {
+    "number": 72,
+    "secret_type": "Discord Bot Token",
+    "location_label": "Issue #63101 comment",
+    "location_url": "https://github.com/openclaw/openclaw/issues/63101#issuecomment-xxx",
+    "actions": "Deleted+Recreated+Notified",
+    "history_cleared": true
+  }
+]
+```
+
+For unsupported types, add `"skipped": true, "unsupported_type": "<type>"`.
+
+## Safety Rules
+
+- **Agent reads content, identifies secrets, produces redaction.** Script handles all API calls.
+- **Never include any portion of a secret** in public comments, redaction markers, or terminal output.
+- **Never include alert URLs or numbers** in public comments.
+- **For comments, skip PATCH — go directly to DELETE + recreate.**
+- **Never mention edit history, "edited" button, or commit SHAs** in any public content.
+- **Ask for confirmation** before deleting any comment.
+- **One alert at a time** unless user requests batch.
+- **All public comments in English.**
+- **Skip unsupported location types** and report in summary.
--- a/.agents/skills/openclaw-secret-scanning-maintainer/scripts/secret-scanning.mjs
+++ b/.agents/skills/openclaw-secret-scanning-maintainer/scripts/secret-scanning.mjs
@@ -0,0 +1,531 @@
+#!/usr/bin/env node
+// Secret scanning alert handler for OpenClaw maintainers.
+// Usage: node secret-scanning.mjs <command> [options]
+
+import { execFileSync, spawnSync } from "node:child_process";
+import crypto from "node:crypto";
+import fs from "node:fs";
+import os from "node:os";
+import path from "node:path";
+
+const REPO = "openclaw/openclaw";
+const REPO_URL = `https://github.com/${REPO}`;
+
+// ─── Helpers ────────────────────────────────────────────────────────────────
+
+function fail(message) {
+  console.error(`error: ${message}`);
+  process.exit(1);
+}
+
+function tmpFile(purpose) {
+  const filePath = path.join(os.tmpdir(), `secretscan-${purpose}-${crypto.randomUUID()}`);
+  // 预创建文件，限制权限为 owner-only
+  fs.writeFileSync(filePath, "", { mode: 0o600 });
+  return filePath;
+}
+
+function gh(args, { json = true, allowFailure = false } = {}) {
+  const proc = spawnSync("gh", args, { encoding: "utf8", maxBuffer: 10 * 1024 * 1024 });
+  if (proc.status !== 0 && !allowFailure) {
+    fail(`gh ${args.slice(0, 3).join(" ")} failed:\n${(proc.stderr || proc.stdout || "").trim()}`);
+  }
+  if (!json) return proc.stdout;
+  try {
+    return JSON.parse(proc.stdout);
+  } catch {
+    return proc.stdout;
+  }
+}
+
+function ghGraphQL(query) {
+  return gh(["api", "graphql", "-f", `query=${query}`]);
+}
+
+// ─── Commands ───────────────────────────────────────────────────────────────
+
+/**
+ * fetch-alert <number>
+ * Fetch alert metadata + locations. Never exposes .secret.
+ */
+function cmdFetchAlert(alertNumber) {
+  if (!alertNumber) fail("Usage: fetch-alert <number>");
+
+  const alert = gh(["api", `repos/${REPO}/secret-scanning/alerts/${alertNumber}?hide_secret=true`]);
+
+  const locations = gh(["api", `repos/${REPO}/secret-scanning/alerts/${alertNumber}/locations`, "--paginate", "--slurp"]);
+  // --paginate + --slurp 确保多页结果合并为一个 JSON 数组
+  const flatLocations = Array.isArray(locations?.[0]) ? locations.flat() : Array.isArray(locations) ? locations : [];
+
+  const result = {
+    number: alert.number,
+    state: alert.state,
+    secret_type: alert.secret_type,
+    secret_type_display_name: alert.secret_type_display_name,
+    validity: alert.validity,
+    html_url: alert.html_url,
+    locations: flatLocations.map((loc) => ({
+      type: loc.type,
+      details: loc.details,
+    })),
+  };
+
+  console.log(JSON.stringify(result, null, 2));
+}
+
+/**
+ * fetch-content <location-json>
+ * Fetch the content and metadata for a specific location.
+ * Saves full body to a temp file. Prints metadata + file path to stdout.
+ */
+function cmdFetchContent(locationJson) {
+  if (!locationJson) fail("Usage: fetch-content '<location-json>'");
+  const location = JSON.parse(locationJson);
+  const type = location.type;
+  const details = location.details;
+
+  if (
+    type === "issue_comment" ||
+    type === "pull_request_comment" ||
+    type === "pull_request_review_comment"
+  ) {
+    // 从 url 中提取 comment ID
+    const commentUrl =
+      details.issue_comment_url ||
+      details.pull_request_comment_url ||
+      details.pull_request_review_comment_url;
+    if (!commentUrl) fail(`No comment URL in location details`);
+
+    const comment = gh(["api", commentUrl]);
+    const bodyFile = tmpFile("body.md");
+    fs.writeFileSync(bodyFile, comment.body || "");
+
+    // 获取编辑历史
+    const nodeId = comment.node_id;
+    const typeName =
+      type === "pull_request_review_comment" ? "PullRequestReviewComment" : "IssueComment";
+    const gql = ghGraphQL(`{
+      node(id: "${nodeId}") {
+        ... on ${typeName} {
+          userContentEdits(first: 50) {
+            totalCount
+          }
+        }
+      }
+    }`);
+    const editCount = gql?.data?.node?.userContentEdits?.totalCount ?? 0;
+
+    // 提取 issue number（从 html_url）
+    const htmlUrl = comment.html_url || details.html_url || "";
+    const issueMatch = htmlUrl.match(/\/(issues|pull)\/(\d+)/);
+    const issueNumber = issueMatch ? issueMatch[2] : null;
+
+    console.log(
+      JSON.stringify(
+        {
+          type,
+          comment_id: comment.id,
+          node_id: nodeId,
+          author: comment.user?.login,
+          issue_number: issueNumber,
+          html_url: htmlUrl,
+          edit_history_count: editCount,
+          body_file: bodyFile,
+        },
+        null,
+        2,
+      ),
+    );
+  } else if (type === "issue_body") {
+    const issueUrl = details.issue_body_url || details.issue_url;
+    if (!issueUrl) fail("No issue URL in location details");
+
+    const issue = gh(["api", issueUrl]);
+    const bodyFile = tmpFile("body.md");
+    fs.writeFileSync(bodyFile, issue.body || "");
+
+    const nodeId = issue.node_id;
+    const number = issue.number;
+    const gql = ghGraphQL(`{
+      node(id: "${nodeId}") {
+        ... on Issue {
+          userContentEdits(first: 50) {
+            totalCount
+          }
+        }
+      }
+    }`);
+    const editCount = gql?.data?.node?.userContentEdits?.totalCount ?? 0;
+
+    console.log(
+      JSON.stringify(
+        {
+          type,
+          issue_number: number,
+          node_id: nodeId,
+          author: issue.user?.login,
+          html_url: issue.html_url,
+          edit_history_count: editCount,
+          body_file: bodyFile,
+        },
+        null,
+        2,
+      ),
+    );
+  } else if (type === "pull_request_body") {
+    const prUrl = details.pull_request_body_url || details.pull_request_url;
+    if (!prUrl) fail("No PR URL in location details");
+
+    const pr = gh(["api", prUrl]);
+    const bodyFile = tmpFile("body.md");
+    fs.writeFileSync(bodyFile, pr.body || "");
+
+    const nodeId = pr.node_id;
+    const number = pr.number;
+    const gql = ghGraphQL(`{
+      node(id: "${nodeId}") {
+        ... on PullRequest {
+          userContentEdits(first: 50) {
+            totalCount
+          }
+        }
+      }
+    }`);
+    const editCount = gql?.data?.node?.userContentEdits?.totalCount ?? 0;
+
+    console.log(
+      JSON.stringify(
+        {
+          type,
+          pr_number: number,
+          node_id: nodeId,
+          author: pr.user?.login,
+          merged: pr.merged,
+          state: pr.state,
+          html_url: pr.html_url,
+          edit_history_count: editCount,
+          body_file: bodyFile,
+        },
+        null,
+        2,
+      ),
+    );
+  } else if (type === "commit") {
+    console.log(
+      JSON.stringify(
+        {
+          type,
+          commit_sha: details.commit_sha,
+          path: details.path,
+          start_line: details.start_line,
+          end_line: details.end_line,
+          html_url: details.html_url || details.commit_url || details.blob_url || null,
+          // commit 没有 body 文件
+          body_file: null,
+        },
+        null,
+        2,
+      ),
+    );
+  } else {
+    console.log(
+      JSON.stringify(
+        {
+          type,
+          unsupported: true,
+          details,
+        },
+        null,
+        2,
+      ),
+    );
+  }
+}
+
+/**
+ * redact-body <issue|pr> <number> <redacted-body-file>
+ * PATCH the issue or PR body with redacted content from a file.
+ */
+function cmdRedactBody(kind, number, bodyFile) {
+  if (!kind || !number || !bodyFile) {
+    fail("Usage: redact-body <issue|pr> <number> <redacted-body-file>");
+  }
+  if (!fs.existsSync(bodyFile)) fail(`File not found: ${bodyFile}`);
+
+  const endpoint =
+    kind === "pr" ? `repos/${REPO}/pulls/${number}` : `repos/${REPO}/issues/${number}`;
+
+  gh(["api", endpoint, "-X", "PATCH", "-F", `body=@${bodyFile}`]);
+  console.log(JSON.stringify({ ok: true, kind, number: Number(number) }));
+}
+
+/**
+ * delete-comment <comment-id>
+ * Delete a comment (and all its edit history).
+ */
+function cmdDeleteComment(commentId) {
+  if (!commentId) fail("Usage: delete-comment <comment-id>");
+  gh(["api", `repos/${REPO}/issues/comments/${commentId}`, "-X", "DELETE"], { json: false });
+  console.log(JSON.stringify({ ok: true, deleted_comment_id: Number(commentId) }));
+}
+
+/**
+ * recreate-comment <issue-number> <body-file>
+ * Create a new comment from a file.
+ */
+function cmdRecreateComment(issueNumber, bodyFile) {
+  if (!issueNumber || !bodyFile) fail("Usage: recreate-comment <issue-number> <body-file>");
+  if (!fs.existsSync(bodyFile)) fail(`File not found: ${bodyFile}`);
+
+  const result = gh([
+    "api",
+    `repos/${REPO}/issues/${issueNumber}/comments`,
+    "-X",
+    "POST",
+    "-F",
+    `body=@${bodyFile}`,
+  ]);
+
+  console.log(
+    JSON.stringify({
+      ok: true,
+      comment_id: result.id,
+      html_url: result.html_url,
+    }),
+  );
+}
+
+/**
+ * notify <issue-or-pr-number> <author> <location-type> <secret-types>
+ * Post a notification comment with the correct template for the location type.
+ */
+function cmdNotify(issueNumber, author, locationType, secretTypes) {
+  if (!issueNumber || !author || !locationType || !secretTypes) {
+    fail("Usage: notify <issue-or-pr-number> <author> <location-type> <secret-types-comma-sep>");
+  }
+
+  const types = secretTypes.split(",").map((s) => s.trim());
+  const typeList = types.map((t, i) => `${i + 1}. **${t}**`).join("\n");
+
+  let locationDesc;
+  let actionDesc;
+  if (
+    locationType === "issue_comment" ||
+    locationType === "pull_request_comment" ||
+    locationType === "pull_request_review_comment"
+  ) {
+    locationDesc = "your comment";
+    actionDesc = "The affected comment has been removed and replaced with a redacted version.";
+  } else if (locationType === "issue_body") {
+    locationDesc = "your issue description";
+    actionDesc = "The affected content has been redacted in place.";
+  } else if (locationType === "pull_request_body") {
+    locationDesc = "your pull request description";
+    actionDesc = "The affected content has been redacted in place.";
+  } else if (locationType === "commit") {
+    locationDesc = "code you committed";
+    actionDesc = "";
+  } else {
+    locationDesc = "your content";
+    actionDesc = "";
+  }
+
+  const body = [
+    `@${author} :warning: **Security Notice: Secret Leakage Detected**`,
+    "",
+    `GitHub Secret Scanning detected the following exposed secret types in ${locationDesc}:`,
+    "",
+    typeList,
+    "",
+    actionDesc,
+    "",
+    "**Please rotate these credentials immediately.**",
+    "",
+    "These secrets were publicly exposed and should be considered compromised.",
+  ]
+    .filter((line) => line !== undefined)
+    .join("\n");
+
+  const bodyFile = tmpFile("notify.md");
+  fs.writeFileSync(bodyFile, body);
+
+  const result = gh([
+    "api",
+    `repos/${REPO}/issues/${issueNumber}/comments`,
+    "-X",
+    "POST",
+    "-F",
+    `body=@${bodyFile}`,
+  ]);
+
+  console.log(
+    JSON.stringify({
+      ok: true,
+      comment_id: result.id,
+      html_url: result.html_url,
+    }),
+  );
+}
+
+/**
+ * resolve <alert-number> [resolution] [comment]
+ * Close a secret scanning alert.
+ */
+function cmdResolve(alertNumber, resolution, comment) {
+  if (!alertNumber) fail("Usage: resolve <alert-number> [resolution] [comment]");
+
+  const res = resolution || "revoked";
+  const resComment = comment || "Content redacted and author notified to rotate credentials.";
+
+  const result = gh([
+    "api",
+    `repos/${REPO}/secret-scanning/alerts/${alertNumber}`,
+    "-X",
+    "PATCH",
+    "-f",
+    `state=resolved`,
+    "-f",
+    `resolution=${res}`,
+    "-f",
+    `resolution_comment=${resComment}`,
+  ]);
+
+  console.log(
+    JSON.stringify({
+      ok: true,
+      number: result.number,
+      state: result.state,
+      resolution: result.resolution,
+      resolved_at: result.resolved_at,
+    }),
+  );
+}
+
+/**
+ * list-open
+ * List all open secret scanning alerts.
+ */
+function cmdListOpen() {
+  const alerts = gh([
+    "api",
+    `repos/${REPO}/secret-scanning/alerts?hide_secret=true&state=open`,
+    "--paginate",
+    "--slurp",
+  ]);
+
+  // --slurp 将分页结果合并为 [[page1], [page2], ...] 需要 flat
+  const flat = Array.isArray(alerts?.[0]) ? alerts.flat() : Array.isArray(alerts) ? alerts : [];
+  const rows = flat.map((a) => ({
+    number: a.number,
+    secret_type_display_name: a.secret_type_display_name,
+    html_url: a.html_url,
+    first_location_html_url: a.first_location_detected?.html_url || null,
+  }));
+
+  console.log(JSON.stringify(rows, null, 2));
+}
+
+/**
+ * summary <json-file>
+ * Print a formatted summary table from a JSON results file.
+ */
+function cmdSummary(jsonFile) {
+  if (!jsonFile) fail("Usage: summary <json-file>");
+  if (!fs.existsSync(jsonFile)) fail(`File not found: ${jsonFile}`);
+
+  const results = JSON.parse(fs.readFileSync(jsonFile, "utf8"));
+  const lines = [];
+
+  lines.push("---BEGIN SUMMARY---");
+  lines.push("");
+  lines.push("## Secret Scanning Results");
+  lines.push("");
+  lines.push("| Alert | Type | Location | Actions | Edit History |");
+  lines.push("|-------|------|----------|---------|--------------|");
+
+  const needsPurge = [];
+
+  for (const r of results) {
+    const alertLink = `#${r.number} ${REPO_URL}/security/secret-scanning/${r.number}`;
+    const locationLink = r.location_url
+      ? `${r.location_label} ${r.location_url}`
+      : r.location_label;
+    const history = r.history_cleared ? "Cleared" : "⚠️ History remains";
+
+    lines.push(
+      `| ${alertLink} | ${r.secret_type} | ${locationLink} | ${r.actions} | ${history} |`,
+    );
+
+    if (!r.history_cleared && r.location_url) {
+      needsPurge.push(r);
+    }
+  }
+
+  if (needsPurge.length > 0) {
+    lines.push("");
+    lines.push("Issues requiring GitHub Support to purge edit history:");
+    for (const r of needsPurge) {
+      lines.push(`- ${r.location_label} ${r.location_url} — ${r.secret_type}`);
+    }
+    lines.push(
+      `Contact: https://support.github.com/contact — request purge of userContentEdits for the above issues.`,
+    );
+  }
+
+  const skipped = results.filter((r) => r.skipped);
+  if (skipped.length > 0) {
+    lines.push("");
+    lines.push(
+      "⚠️ The following alerts were skipped because their location type is not supported:",
+    );
+    for (const r of skipped) {
+      lines.push(
+        `- Alert #${r.number}: unsupported type "${r.unsupported_type}" — ${REPO_URL}/security/secret-scanning/${r.number}`,
+      );
+    }
+    lines.push("Please update the skill to define handling for these types.");
+  }
+
+  lines.push("");
+  lines.push("---END SUMMARY---");
+
+  console.log(lines.join("\n"));
+}
+
+// ─── Dispatch ───────────────────────────────────────────────────────────────
+
+const [command, ...args] = process.argv.slice(2);
+
+const commands = {
+  "fetch-alert": () => cmdFetchAlert(args[0]),
+  "fetch-content": () => cmdFetchContent(args[0]),
+  "redact-body": () => cmdRedactBody(args[0], args[1], args[2]),
+  "delete-comment": () => cmdDeleteComment(args[0]),
+  "recreate-comment": () => cmdRecreateComment(args[0], args[1]),
+  notify: () => cmdNotify(args[0], args[1], args[2], args[3]),
+  resolve: () => cmdResolve(args[0], args[1], args[2]),
+  "list-open": () => cmdListOpen(),
+  summary: () => cmdSummary(args[0]),
+};
+
+if (!command || !commands[command]) {
+  console.error(
+    [
+      "Usage: node secret-scanning.mjs <command> [args]",
+      "",
+      "Commands:",
+      "  fetch-alert <number>             Fetch alert metadata + locations",
+      "  fetch-content '<location-json>'   Fetch content for a location",
+      "  redact-body <issue|pr> <n> <file> PATCH body with redacted file",
+      "  delete-comment <comment-id>       Delete a comment",
+      "  recreate-comment <issue-n> <file> Create replacement comment",
+      "  notify <n> <author> <type> <types> Post notification",
+      "  resolve <n> [resolution] [comment] Close alert",
+      "  list-open                          List open alerts",
+      "  summary <json-file>               Print formatted summary",
+    ].join("\n"),
+  );
+  process.exit(1);
+}
+
+commands[command]();
--- a/.env.example
+++ b/.env.example
@@ -14,12 +14,15 @@
 # -----------------------------------------------------------------------------
 # Gateway auth + paths
 # -----------------------------------------------------------------------------
-# Recommended if the gateway binds beyond loopback.
-OPENCLAW_GATEWAY_TOKEN=change-me-to-a-long-random-token
-# Example generator: openssl rand -hex 32
+# Required if the gateway binds beyond loopback. Leave blank to have OpenClaw
+# auto-generate a token on first start, or provide your own using
+# `openssl rand -hex 32`. The gateway will refuse to start if this is set to
+# the documented example placeholder, so never copy-paste an example value
+# from docs or tutorials into this file verbatim.
+OPENCLAW_GATEWAY_TOKEN=

 # Optional alternative auth mode (use token OR password).
-# OPENCLAW_GATEWAY_PASSWORD=change-me-to-a-strong-password
+# OPENCLAW_GATEWAY_PASSWORD=

 # Optional path overrides (defaults shown for reference).
 # OPENCLAW_STATE_DIR=~/.openclaw
--- a/.github/workflows/ci.yml
+++ b/.github/workflows/ci.yml
@@ -694,7 +694,7 @@ jobs:
          EOF

  checks-node-core-test:
-    name: checks-node-core-test
+    name: checks-node-core
    needs: [preflight, checks-node-core-test-shard]
    if: always() && needs.preflight.outputs.run_checks == 'true'
    runs-on: blacksmith-16vcpu-ubuntu-2404
@@ -894,6 +894,11 @@ jobs:
        continue-on-error: true
        run: pnpm check:import-cycles

+      - name: Run madge import cycle guard
+        id: madge_import_cycles
+        continue-on-error: true
+        run: pnpm check:madge-import-cycles
+
      - name: Upload gateway watch regression artifacts
        if: always()
        uses: actions/upload-artifact@v7
@@ -927,6 +932,7 @@ jobs:
          CONTROL_UI_I18N_OUTCOME: ${{ steps.control_ui_i18n.outcome == 'skipped' && 'success' || steps.control_ui_i18n.outcome }}
          GATEWAY_WATCH_REGRESSION_OUTCOME: ${{ steps.gateway_watch_regression.outcome }}
          IMPORT_CYCLES_OUTCOME: ${{ steps.import_cycles.outcome }}
+          MADGE_IMPORT_CYCLES_OUTCOME: ${{ steps.madge_import_cycles.outcome }}
        run: |
          failures=0
          for result in \
@@ -951,7 +957,8 @@ jobs:
            "lint:ui:no-raw-window-open|$NO_RAW_WINDOW_OPEN_OUTCOME" \
            "ui:i18n:check|$CONTROL_UI_I18N_OUTCOME" \
            "gateway-watch-regression|$GATEWAY_WATCH_REGRESSION_OUTCOME" \
-            "check:import-cycles|$IMPORT_CYCLES_OUTCOME"; do
+            "check:import-cycles|$IMPORT_CYCLES_OUTCOME" \
+            "check:madge-import-cycles|$MADGE_IMPORT_CYCLES_OUTCOME"; do
            name="${result%%|*}"
            outcome="${result#*|}"
            if [ "$outcome" != "success" ]; then
--- a/.github/workflows/install-smoke.yml
+++ b/.github/workflows/install-smoke.yml
@@ -194,6 +194,13 @@ jobs:
          push: false
          provenance: false

+      - name: Setup Node environment for local pack smoke
+        uses: ./.github/actions/setup-node-env
+        with:
+          install-bun: "false"
+          install-deps: "true"
+          use-sticky-disk: "false"
+
      - name: Run installer docker tests
        env:
          OPENCLAW_INSTALL_URL: https://openclaw.ai/install.sh
--- a/.github/workflows/openclaw-npm-release.yml
+++ b/.github/workflows/openclaw-npm-release.yml
@@ -493,6 +493,7 @@ jobs:
          RELEASE_VERSION: ${{ env.RELEASE_VERSION }}
        run: |
          set -euo pipefail
+          printf '//registry.npmjs.org/:_authToken=%s\n' "${NODE_AUTH_TOKEN}" > "${HOME}/.npmrc"
          npm whoami >/dev/null
          npm dist-tag add "openclaw@${RELEASE_VERSION}" latest
          promoted_latest="$(npm view openclaw dist-tags.latest)"
--- a/.oxfmtrc.jsonc
+++ b/.oxfmtrc.jsonc
@@ -20,6 +20,7 @@
    "pnpm-lock.yaml/",
    "src/gateway/server-methods/CLAUDE.md",
    "src/auto-reply/reply/export-html/",
+    "src/canvas-host/a2ui/a2ui.bundle.js",
    "Swabble/",
    "vendor/",
  ],
--- a/.oxlintrc.json
+++ b/.oxlintrc.json
@@ -8,23 +8,23 @@
  },
  "rules": {
    "curly": "error",
-    "eslint-plugin-unicorn/prefer-array-find": "off",
+    "eslint-plugin-unicorn/prefer-array-find": "error",
    "eslint/no-await-in-loop": "off",
-    "eslint/no-new": "off",
+    "eslint/no-new": "error",
    "eslint/no-shadow": "off",
-    "eslint/no-unmodified-loop-condition": "off",
-    "eslint-plugin-unicorn/prefer-set-size": "off",
-    "oxc/no-accumulating-spread": "off",
+    "eslint/no-unmodified-loop-condition": "error",
+    "eslint-plugin-unicorn/prefer-set-size": "error",
+    "oxc/no-accumulating-spread": "error",
    "oxc/no-async-endpoint-handlers": "off",
    "oxc/no-map-spread": "off",
    "typescript/consistent-return": "error",
    "typescript/no-explicit-any": "error",
-    "typescript/no-extraneous-class": "off",
-    "typescript/no-unnecessary-type-conversion": "off",
+    "typescript/no-extraneous-class": "error",
+    "typescript/no-unnecessary-type-conversion": "error",
    "typescript/no-unsafe-type-assertion": "off",
    "unicorn/consistent-function-scoping": "off",
-    "unicorn/prefer-set-size": "off",
-    "unicorn/require-post-message-target-origin": "off"
+    "unicorn/prefer-set-size": "error",
+    "unicorn/require-post-message-target-origin": "error"
  },
  "ignorePatterns": [
    "assets/",
--- a/.vscode/settings.json
+++ b/.vscode/settings.json
@@ -17,6 +17,5 @@
  "typescript.preferences.importModuleSpecifierEnding": "js",
  "typescript.reportStyleChecksAsWarnings": false,
  "typescript.updateImportsOnFileMove.enabled": "always",
-  "typescript.tsdk": "node_modules/typescript/lib",
-  "typescript.experimental.useTsgo": true
+  "typescript.tsdk": "node_modules/typescript/lib"
 }
--- a/AGENTS.md
+++ b/AGENTS.md
@@ -30,11 +30,16 @@
  - `src/plugins/*` = plugin discovery, manifest validation, loader, registry, and contract enforcement
  - `src/gateway/protocol/*` = typed Gateway control-plane and node wire protocol
 - Progressive disclosure lives in local boundary guides:
-  - bundled-plugin-tree `AGENTS.md`
+  - repo root `AGENTS.md`
+  - bundled-plugin-tree `extensions/AGENTS.md`
  - `src/plugin-sdk/AGENTS.md`
  - `src/channels/AGENTS.md`
  - `src/plugins/AGENTS.md`
  - `src/gateway/protocol/AGENTS.md`
+- Workflow hygiene:
+  - Do not grep or existence-check every `docs/*.md`, `AGENTS.md`, or guide path mentioned in this file before starting work.
+  - Read only the guides and docs that are directly relevant to the files or boundary you are touching.
+  - Only do full broken-link or missing-guide sweeps when the task is explicitly about docs or repo-instruction maintenance.
 - Plugin and extension boundary:
  - Public docs: `docs/plugins/building-plugins.md`, `docs/plugins/architecture.md`, `docs/plugins/sdk-overview.md`, `docs/plugins/sdk-entrypoints.md`, `docs/plugins/sdk-runtime.md`, `docs/plugins/manifest.md`, `docs/plugins/sdk-channel-plugins.md`, `docs/plugins/sdk-provider-plugins.md`
  - Definition files: `src/plugin-sdk/plugin-entry.ts`, `src/plugin-sdk/core.ts`, `src/plugin-sdk/provider-entry.ts`, `src/plugin-sdk/channel-contract.ts`, `scripts/lib/plugin-sdk-entrypoints.json`, `package.json`
@@ -68,7 +73,7 @@
  - `hooks.internal.entries` is the canonical public hook config model. `hooks.internal.handlers` is compatibility-only input and must not be re-exposed in public schema/help/baseline surfaces.
 - Bundled plugin contract boundary:
  - Public docs: `docs/plugins/architecture.md`, `docs/plugins/manifest.md`, `docs/plugins/sdk-overview.md`
- Definition files: `src/plugins/contracts/registry.ts`, `src/plugins/types.ts`, `src/plugins/public-artifacts.ts`
+  - Definition files: `src/plugins/contracts/registry.ts`, `src/plugins/types.ts`, `src/plugins/public-artifacts.ts`
  - Rule: keep manifest metadata, runtime registration, public SDK exports, and contract tests aligned. Do not create a hidden path around the declared plugin interfaces.
 - Extension test boundary:
  - Keep extension-owned onboarding/config/provider coverage under the owning bundled plugin package when feasible.
@@ -76,37 +81,25 @@
  - Shared helpers under `test/helpers/**` are part of that same boundary. Do not hardcode repo-relative `extensions/**` imports there, and do not keep plugin-local deep mocks in shared helpers just because multiple tests use them.
  - When core tests or shared helpers need bundled plugin public surfaces, use `src/test-utils/bundled-plugin-public-surface.ts` for `api.ts`, `runtime-api.ts`, `contract-api.ts`, `test-api.ts`, plugin entrypoint `index.js`, and resolved module ids for dynamic import or mocking.
  - If a core test is asserting extension-specific behavior instead of a generic contract, move it to the owning extension package.
+- Scoped guides still matter:
+  - `extensions/AGENTS.md` expands extension/plugin boundary rules.
+  - `src/channels/AGENTS.md` expands core channel boundary and hot-path rules.
+  - `src/plugin-sdk/AGENTS.md` expands public SDK contract rules.
+  - `src/plugins/AGENTS.md` expands plugin loading, registry, and manifest rules.
+  - `src/gateway/protocol/AGENTS.md` expands typed Gateway protocol rules.
+  - `test/helpers/AGENTS.md` and `test/helpers/channels/AGENTS.md` expand shared test helper boundary rules.
+- Plugin architecture direction:
+  - Keep a manifest-first control plane: discovery, validation, enablement, setup hints, and activation planning should stay metadata-driven by default.
+  - Keep runtime execution separate: actual provider/channel/tool execution should resolve through narrow targeted loaders, not broad registry materialization.
+  - Host loads plugins; plugins do not load host internals. Prefer a small versioned host/kernel seam plus documented SDK entrypoints over ambient reachability.
+  - Treat broad runtime registries and mutable global plugin state as transitional compatibility surfaces, not the target architecture.
+  - If a setup or config flow truly needs plugin runtime, make that explicit instead of silently importing runtime code on the cold path.

-## Docs Linking (Mintlify)
+## Scoped Workflow Guides

- Docs are hosted on Mintlify (docs.openclaw.ai).
- Internal doc links in `docs/**/*.md`: root-relative, no `.md`/`.mdx` (example: `[Config](/configuration)`).
- When working with documentation, read the mintlify skill.
- For docs, UI copy, and picker lists, order services/providers alphabetically unless the section is explicitly describing runtime behavior (for example auto-detection or execution order).
- Section cross-references: use anchors on root-relative paths (example: `[Hooks](/configuration#hooks)`).
- Doc headings and anchors: avoid em dashes and apostrophes in headings because they break Mintlify anchor links.
- When the user asks for links, reply with full `https://docs.openclaw.ai/...` URLs (not root-relative).
- When you touch docs, end the reply with the `https://docs.openclaw.ai/...` URLs you referenced.
- README (GitHub): keep absolute docs URLs (`https://docs.openclaw.ai/...`) so links work on GitHub.
- Docs content must be generic: no personal device names/hostnames/paths; use placeholders like `user@gateway-host` and “gateway host”.
-
-## Docs i18n (generated publish locales)
-
- Foreign-language docs are not maintained in this repo. The generated publish output lives in the separate `openclaw/docs` repo (often cloned locally as the sibling `openclaw-docs` directory); do not add or edit localized docs under `docs/<locale>/**` here.
- Those localized docs are autogenerated. Treat this repo's English docs plus glossary files as the source of truth, and let the publish/translation pipeline update `openclaw/docs`.
- Pipeline: update English docs here → adjust the matching `docs/.i18n/glossary.<locale>.json` entries → let the publish-repo sync + `scripts/docs-i18n` run in `openclaw/docs` / local `openclaw-docs` clone → apply targeted fixes only if instructed.
- Before rerunning `scripts/docs-i18n`, add glossary entries for any new technical terms, page titles, or short nav labels that must stay in English or use a fixed translation (for example `Doctor` or `Polls`).
- `pnpm docs:check-i18n-glossary` enforces glossary coverage for changed English doc titles and short internal doc labels before translation reruns.
- Translation memory lives in generated `docs/.i18n/*.tm.jsonl` files in the publish repo.
- See `docs/.i18n/README.md`.
- The pipeline can be slow/inefficient; if it’s dragging, ping @jospalmbier on Discord instead of hacking around it.
-
-## Control UI i18n (generated in repo)
-
- Control UI foreign-language locale bundles are generated in this repo; do not hand-edit `ui/src/i18n/locales/*.ts` for non-English locales or `ui/src/i18n/.i18n/*` unless a targeted generated-output fix is explicitly requested.
- Source of truth is `ui/src/i18n/locales/en.ts` plus the generator/runtime wiring in `scripts/control-ui-i18n.ts`, `ui/src/i18n/lib/types.ts`, and `ui/src/i18n/lib/registry.ts`.
- Pipeline: update English control UI strings and locale wiring here → run `pnpm ui:i18n:sync` (or let `Control UI Locale Refresh` do it) → commit the regenerated locale bundles and `.i18n` metadata.
- If the control UI locale outputs drift, regenerate them; do not manually translate or hand-maintain the generated locale files by default.
+- `docs/AGENTS.md` owns Mintlify docs, docs links, and docs i18n rules.
+- `ui/AGENTS.md` owns Control UI i18n and generated locale rules.
+- `scripts/AGENTS.md` owns script-runner, local-check lock, and test/lint wrapper rules.

 ## exe.dev VM ops (general)

@@ -186,6 +179,7 @@
 - New runtime control-flow code should not branch on `error: string` or `reason: string` when a closed code union would be reasonable.
 - Dynamic import guardrail: do not mix `await import("x")` and static `import ... from "x"` for the same module in production code paths. If you need lazy loading, create a dedicated `*.runtime.ts` boundary (that re-exports from `x`) and dynamically import that boundary from lazy callers only.
 - Dynamic import verification: after refactors that touch lazy-loading/module boundaries, run `pnpm build` and check for `[INEFFECTIVE_DYNAMIC_IMPORT]` warnings before submitting.
+- Circular dependencies: keep both `pnpm check:import-cycles` and `pnpm check:madge-import-cycles` green; do not reintroduce runtime import cycles or madge-detected import loops.
 - Extension SDK self-import guardrail: inside an extension package, do not import that same extension via `openclaw/plugin-sdk/<extension>` from production files. Route internal imports through a local barrel such as `./api.ts` or `./runtime-api.ts`, and keep the `plugin-sdk/<extension>` path as the external contract only.
 - Extension package boundary guardrail: inside a bundled plugin package, do not use relative imports/exports that resolve outside that same package root. If shared code belongs in the plugin SDK, import `openclaw/plugin-sdk/<subpath>` instead of reaching into `src/plugin-sdk/**` or other repo paths via `../`.
 - Extension API surface rule: `openclaw/plugin-sdk/<subpath>` is the only public cross-package contract for extension-facing SDK code. If an extension needs a new seam, add a public subpath first; do not reach into `src/plugin-sdk/**` by relative path.
@@ -315,7 +309,7 @@
  - Only ask when changes are semantic (logic/data/behavior).
 - **Multi-agent safety:** focus reports on your edits; avoid guard-rail disclaimers unless truly blocked; when multiple agents touch the same file, continue if safe; end with a brief “other files present” note only if relevant.
 - Bug investigations: read source code of relevant npm dependencies and all related local code before concluding; aim for high-confidence root cause.
- Code style: add brief comments for tricky logic; keep files under ~500 LOC when feasible (split/refactor as needed).
+- Code style: add brief comments for tricky logic; keep files under ~700 LOC when feasible (split/refactor as needed).
 - Tool schema guardrails (google-antigravity): avoid `Type.Union` in tool input schemas; no `anyOf`/`oneOf`/`allOf`. Use `stringEnum`/`optionalStringEnum` (Type.Unsafe enum) for string lists, and `Type.Optional(...)` instead of `... | null`. Keep top-level tool schema as `type: "object"` with `properties`.
 - Tool schema guardrails: avoid raw `format` property names in tool schemas; some validators treat `format` as a reserved keyword and reject the schema.
 - Never send streaming/partial replies to external messaging surfaces (WhatsApp, Telegram); only final replies should be delivered there. Streaming/tool events may still go to internal UIs/control channel.
--- a/CHANGELOG.md
+++ b/CHANGELOG.md
@@ -2,48 +2,146 @@

 Docs: https://docs.openclaw.ai

-## Unreleased
+## 2026.4.12

 ### Changes

- Memory/Active Memory: add a new optional Active Memory plugin that gives OpenClaw a dedicated memory sub-agent right before the main reply, so ongoing chats can automatically pull in relevant preferences, context, and past details without making users remember to manually say "remember this" or "search memory" first. Includes configurable message/recent/full context modes, live `/verbose` inspection, advanced prompt/thinking overrides for tuning, and opt-in transcript persistence for debugging. Docs: https://docs.openclaw.ai/concepts/active-memory. (#63286) Thanks @Takhoffman.
- macOS/Talk: add an experimental local MLX speech provider for Talk Mode, with explicit provider selection, local utterance playback, interruption handling, and system-voice fallback. (#63539) Thanks @ImLukeF.
- CLI/exec policy: add a local `openclaw exec-policy` command with `show`, `preset`, and `set` subcommands for synchronizing requested `tools.exec.*` config with the local exec approvals file, plus follow-up hardening for node-host rejection, rollback safety, and sync conflict detection. (#64050)
- Gateway: add a `commands.list` RPC so remote gateway clients can discover runtime-native, text, skill, and plugin commands with surface-aware naming and serialized argument metadata. (#62656) Thanks @samzong.
- Models/providers: add per-provider `models.providers.*.request.allowPrivateNetwork` for trusted self-hosted OpenAI-compatible endpoints, keep the opt-in scoped to model request surfaces, and refresh cached WebSocket managers when request transport overrides change. (#63671) Thanks @qas.
- QA/testing: add a `--runner multipass` lane for `openclaw qa suite` so repo-backed QA scenarios can run inside a disposable Linux VM and write back the usual report, summary, and VM logs. (#63426) Thanks @shakkernerd.
- Docs i18n: chunk raw doc translation, reject truncated tagged outputs, avoid ambiguous body-only wrapper unwrapping, and recover from terminated Pi translation sessions without changing the default `openai/gpt-5.4` path. (#62969, #63808) Thanks @hxy91819.
- Control UI/dreaming: simplify the Scene and Diary surfaces, preserve unknown phase state for partial status payloads, and stabilize waiting-entry recency ordering so Dreaming status and review lists stay clear and deterministic. (#64035) Thanks @davemorin.
- Gateway: split startup and runtime seams so gateway lifecycle sequencing, reload state, and shutdown behavior stay easier to maintain without changing observed behavior. (#63975) Thanks @gumadeiras.
- Matrix/partial streaming: add MSC4357 live markers to draft preview sends and edits so supporting Matrix clients can render a live/typewriter animation and stop it when the final edit lands. (#63513) Thanks @TigerInYourDream.
- QA/Telegram: add a live `openclaw qa telegram` lane for private-group bot-to-bot checks, harden its artifact handling, and preserve native Telegram command reply threading for QA verification. (#64303) Thanks @obviyus.
- Models/Codex: add the bundled Codex provider and plugin-owned app-server harness so `codex/gpt-*` models use Codex-managed auth, native threads, model discovery, and compaction while `openai/gpt-*` stays on the normal OpenAI provider path. (#64298) Thanks @steipete.
- Agents: add an opt-in strict-agentic embedded Pi execution contract for GPT-5-family runs so plan-only or filler turns keep acting until they hit a real blocker. (#64241) Thanks @100yenadmin.
+- Plugins/loading: narrow CLI, provider, and channel activation to manifest-declared needs so startup, command discovery, and runtime activation avoid loading unrelated plugin runtime. (#65120, #65259, #65429)
+- Memory/active-memory: default QMD recall to search and surface better search-path telemetry so memory-backed recall works more predictably out of the box. (#65068) Thanks @Takhoffman.
+- Docs/providers: expand bundled provider docs with richer capability, env-var, and setup guidance across provider pages.
+- Docs/memory-wiki: add the recommended QMD + bridge-mode hybrid recipe plus zero-artifact troubleshooting guidance for `memory-wiki` bridge setups. (#63165) Thanks @sercada and @vincentkoc.

 ### Fixes

- CLI/WhatsApp media sends: route gateway-mode outbound sends with `--media` through the channel `sendMedia` path and preserve media access context, so WhatsApp document and attachment sends stop silently dropping the file while still delivering the caption. (#64478) Thanks @ShionEria.
- fix(nostr): require operator.admin scope for profile mutation routes [AI]. (#63553) Thanks @pgondhi987.
- Gateway/startup: keep WebSocket RPC available while channels and plugin sidecars start, hold `chat.history` unavailable until startup sidecars finish so synchronous history reads cannot stall startup (reported in #63450), refresh advertised gateway methods after deferred plugin reloads, and enforce the pre-auth WebSocket upgrade budget before the no-handler 503 path so upgrade floods cannot bypass connection limits during that window. (#63480) Thanks @neeravmakwana.
+- CLI/update: respawn tracked plugin refresh from the updated entrypoint after package self-updates so `openclaw update` stops failing on stale hashed `dist/install.runtime-*.js` chunk imports. (#65471)
+- Memory/active-memory: keep recall runs on the resolved channel when wrappers like `mx-claw` are enabled, improve lexical fallback ranking, and keep lexical boosts out of hybrid search so recall finds the right memories more consistently. (#65049, #65395) Thanks @Takhoffman.
+- Dreaming: consume managed heartbeat events exactly once, stage light-sleep confidence from all recorded short-term signals, wake scheduled jobs immediately, raise dreaming-only promotion enough to cross the durable-memory gate, and stop dreaming from re-ingesting its own narrative transcripts.
+- Dreaming/narrative: harden transient narrative cleanup by retrying timed-out deletes, scrubbing stale dreaming session artifacts through the lock-aware session-store path, and isolating transient narrative session keys per workspace. (#65320, #61674)
+- Memory/wiki: preserve Unicode letters, digits, and combining marks in wiki slugs and contradiction clustering, and cap Unicode filename segments to safe byte lengths so non-ASCII titles stop collapsing or overflowing path limits. (#64742) Thanks @zhouhe-xydt.
+- UI/WebChat: hide synthetic transcript-repair tool results from chat history reloads so internal recovery markers do not leak into visible chat after reconnects. (#65247) Thanks @wangwllu.
+- WhatsApp/outbound: fall back to the first `mediaUrls` entry when `mediaUrl` is empty so gateway media sends stop silently dropping attachments that already have a resolved media list. (#64394) Thanks @eric-fr4 and @vincentkoc.
+- Doctor/Discord: stop `openclaw doctor --fix` from rewriting legacy Discord preview-streaming config into the nested modern shape, so downgrades can still recover without hand-editing `channels.discord.streaming`. (#65035) Thanks @vincentkoc.
+- Gateway/auth: blank the shipped example gateway credential in `.env.example` and fail startup when a copied placeholder token or password is still configured, so operators cannot accidentally launch with a publicly known secret. (#64586) Thanks @navarrotech and @vincentkoc.
+- Memory/active-memory+dreaming: keep active-memory recall runs on the strongest resolved channel, consume managed dreaming heartbeat events exactly once, stop dreaming from re-ingesting its own narrative transcripts, and add explicit repair/dedupe recovery flows in CLI, doctor, and the Dreams UI.
+- Agents/queueing: carry orphaned active-turn user text into the next prompt before repairing transcript ordering, so follow-up messages that arrive mid-run are no longer silently dropped. (#65388) Thanks @adminfedres and @vincentkoc.
+- Gateway/keepalive: stop marking WebSocket tick broadcasts as droppable so slow or backpressured clients do not self-disconnect with `tick timeout` while long-running work is still alive. (#65256) Thanks @100yenadmin and @vincentkoc.
+- Matrix/mentions: keep room mention gating strict while accepting visible `@displayName` Matrix URI labels, so `requireMention` works for non-OpenClaw Matrix clients again. (#64796) Thanks @hclsys.
+- Doctor: warn when on-disk agent directories still exist under `~/.openclaw/agents/<id>/agent` but the matching `agents.list[]` entries are missing from config. (#65113) Thanks @neeravmakwana.
+- Telegram: route approval button callback queries onto a separate sequentializer lane so plugin approval clicks can resolve immediately instead of deadlocking behind the blocked agent turn. (#64979) Thanks @nk3750.
+- Telegram/direct sessions: keep commentary-only assistant fallback payloads out of visible direct delivery, so Codex planning chatter cannot leak into Telegram DMs when a run has no `final_answer` text.
+- Gateway/keepalive: stop marking WebSocket tick broadcasts as droppable so slow or backpressured clients do not self-disconnect with `tick timeout` while long-running work is still alive. (#65436)
+- Gateway/plugins: always send a non-empty `idempotencyKey` for plugin subagent runs, so dreaming narrative jobs stop failing gateway schema validation. (#65354) Thanks @CodeForgeNet.
+- Gateway/auth: blank the shipped example gateway credential in `.env.example` and fail startup when a copied placeholder token or password is still configured, so operators cannot accidentally launch with a publicly known secret. (#64586) Thanks @navarrotech.
+- Plugins/memory-core dreaming: keep bundled `memory-core` loaded alongside an explicit external memory slot owner only when that owner enables dreaming, while preserving `plugins.slots.memory = "none"` disable semantics. (#65411) Thanks @pradeep7127.
+- Doctor/Discord: stop `openclaw doctor --fix` from rewriting legacy Discord preview-streaming config into the nested modern shape, so downgrades can still recover without hand-editing `channels.discord.streaming`.
+- Doctor: warn when on-disk agent directories still exist under `~/.openclaw/agents/<id>/agent` but the matching `agents.list[]` entries are missing from config. (#65113) Thanks @neeravmakwana.
+- CLI/plugins: honor `memory-wiki` when `plugins.allow` is set for `openclaw wiki`, and pass the active app config into the metadata registrar so plugin-owned wiki commands resolve the live plugin config instead of falling back to defaults. (#64779, #65012)
+- QA/packaging: stop packaged QA helpers from crashing when optional scenario execution config is unavailable, so npm distributions can skip the repo-only scenario pack without breaking completion-cache and startup paths. (#65118) Thanks @EdderTalmor.
+- Media/audio transcription: surface the real provider failure when every audio transcription attempt fails, so status output and the CLI stop collapsing those errors into generic skips. (#65096) Thanks @l0cka.
+- Infra/net: fix multipart FormData fields (including `model`) being silently dropped when a guarded runtime fetch body crosses a FormData implementation boundary, restoring OpenAI audio transcription requests that failed with HTTP 400. (#64349) Thanks @petr-sloup.
+- Dreaming/diary: use the host local timezone for diary timestamps when `dreaming.timezone` is unset, and include the timezone abbreviation so `DREAMS.md` and the UI make local or UTC time explicit. (#65034, #65057)
+- Dreaming/promotion: raise phase reinforcement enough for repeated dreaming-only revisits to clear the default durable-memory gate after multiple days, instead of stalling just below the score threshold. (#64068) Thanks @vincentkoc.
+- Dreaming/light-sleep: compute staged candidate confidence from all recorded short-term signals instead of recall-only counts, so dreaming-only entries stop rendering as `confidence: 0.00`. (#64599) Thanks @vincentkoc.
+- Plugins/memory: restore cached memory capability public artifacts on plugin-registry cache hits so memory-backed artifact surfaces stay visible after warm loads.
+- Gateway/cron: preserve requested isolated-agent config across runtime reloads so subagent jobs and heartbeat overrides keep the right workspace and heartbeat settings when the hot-loaded snapshot is stale.
+- Cron/isolated sessions: persist the right transcript path for each isolated run, including fresh session rollovers, so cron runs stop appending to stale session files.
+- Discord/gateway: clear stale heartbeat timers before reconnecting so zombie gateway callbacks cannot crash the process and drop in-flight replies. (#65009) Thanks @SARAMALI15792.
+- Matrix/mentions: keep room mention gating strict while accepting visible `@displayName` Matrix URI labels, so `requireMention` works for non-OpenClaw Matrix clients again. (#64796) Thanks @hclsys.
+- Agents/Anthropic replay: preserve immutable signed-thinking replay safety across stored and live reruns, keep non-thinking embedded `tool_result` user blocks intact, and drop conflicting preserved tool IDs before validation so retries stop degrading into omitted tool calls. (#65126) Thanks @shakkernerd.
+- Memory/QMD: allow channel sessions in the shipped default QMD scope, while still denying groups.
+- Memory/QMD: stop registering the legacy lowercase root memory file as a separate default collection, so QMD now prefers `MEMORY.md` and the `memory/` tree without duplicate collection-add warnings.
+- Memory/memory-core: watch the `memory` directory directly and ignore non-markdown churn so nested note changes still sync on macOS + Node 25 environments where recursive `memory/**/*.md` glob watching fails. (#64711) Thanks @jasonxargs-boop and @vincentkoc.
+
+## 2026.4.11
+
+### Changes
+
+- Dreaming/memory-wiki: add ChatGPT import ingestion plus new `Imported Insights` and `Memory Palace` diary subtabs so Dreaming can inspect imported source chats, compiled wiki pages, and full source pages directly from the UI. (#64505)
+- Control UI/webchat: render assistant media/reply/voice directives as structured chat bubbles, add the `[embed ...]` rich output tag, and gate external embed URLs behind config. (#64104)
+- Tools/video_generate: add URL-only generated asset delivery, typed `providerOptions`, reference audio inputs, per-asset role hints, `adaptive` aspect-ratio support, and a higher image-input cap so video providers can expose richer generation modes without forcing large files into memory. (#61987, #61988) Thanks @xieyongliang.
+- Feishu: improve document comment sessions with richer context parsing, comment reactions, and typing feedback so document-thread conversations behave more like chat conversations. (#63785)
+- Microsoft Teams: add reaction support, reaction listing, Graph pagination, and delegated OAuth setup for sending reactions while preserving application-auth read paths. (#51646)
+- Plugins: allow plugin manifests to declare activation and setup descriptors so plugin setup flows can describe required auth, pairing, and configuration steps without hardcoded core special cases. (#64780)
+- Ollama: cache `/api/show` context-window and capability metadata during model discovery so repeated picker refreshes stop refetching unchanged models, while still retrying after empty responses and invalidating on digest changes. (#64753) Thanks @ImLukeF.
+- Models/providers: surface how configured OpenAI-compatible endpoints are classified in embedded-agent debug logs, so local and proxy routing issues are easier to diagnose. (#64754) Thanks @ImLukeF.
+- QA/parity: add the GPT-5.4 vs Opus 4.6 agentic parity report gate with shared scenario coverage checks, stricter evidence heuristics, and skipped-scenario accounting for maintainer review. (#64441) Thanks @100yenadmin.
+
+### Fixes
+
+- Windows/onboarding: open provider OAuth and sign-in URLs with `explorer.exe` instead of routing them through `cmd /c start`, so quoted provider URLs cannot break out into host command execution. (#64161) Thanks @coygeek and @vincentkoc.
+- OpenAI/Codex OAuth: stop rewriting the upstream authorize URL scopes so new Codex sign-ins do not fail with `invalid_scope` before returning an authorization code. (#64713) Thanks @fuller-stack-dev.
+- Audio transcription: disable pinned DNS only for OpenAI-compatible multipart requests, while still validating hostnames, so OpenAI, Groq, and Mistral transcription works again without weakening other request paths. (#64766) Thanks @GodsBoy.
+- macOS/Talk Mode: after granting microphone permission on first enable, continue starting Talk Mode instead of requiring a second toggle. (#62459) Thanks @ggarber.
+- Control UI/webchat: persist agent-run TTS audio replies into webchat history and preserve interleaved tool card pairing so generated audio and mixed tool output stay attached to the right messages. (#63514) Thanks @bittoby.
+- WhatsApp: honor the configured default account when the active listener helper is used without an explicit account id, so named default accounts do not get registered under `default`. (#53918) Thanks @yhyatt.
+- ACP/agents: suppress commentary-phase child assistant relay text in ACP parent stream updates, so spawned child runs stop leaking internal progress chatter into the parent session. Thanks @vincentkoc.
+- Agents/timeouts: honor explicit run timeouts in the LLM idle watchdog and align default timeout config so slow models can keep working until the configured limit instead of using the wrong idle window.
+- Config: include `asyncCompletion` in the generated zod schema so documented async completion config no longer fails with an unrecognized-key error. (#63618)
+- Google/Veo: stop sending the unsupported `numberOfVideos` request field so Gemini Developer API Veo runs do not fail before OpenClaw can complete the intended Google video generation path. (#64723) Thanks @velvet-shark.
+- QA/packaging: stop packaged CLI startup and completion cache generation from reading repo-only QA scenario markdown, ship the bundled QA scenario pack in npm releases, and keep `openclaw completion --write-state` working even if QA setup is broken. (#64648) Thanks @obviyus.
+- Codex/QA: keep Codex app-server coordination chatter out of visible replies, add a live QA leak scenario, and classify leaked harness meta text as a QA failure instead of a successful reply. Thanks @vincentkoc.
+- WhatsApp: route `message react` through the gateway-owned action path so reactions use the live WhatsApp listener in both DM and group chats, matching `message send` and `message poll`. Thanks @mcaxtr.
+- Auto-reply/WhatsApp: preserve inbound image attachment notes after media understanding so image edits keep the real saved media path instead of hallucinating a missing local path. (#64918) Thanks @ngutman.
+- Telegram/sessions: keep topic-scoped session initialization on the canonical topic transcript path when inbound turns omit `MessageThreadId`, so one topic session no longer alternates between bare and topic-qualified transcript files. (#64869) Thanks @jalehman.
+- Agents/failover: scope assistant-side fallback classification and surfaced provider errors to the current attempt instead of stale session history, so cross-provider fallback runs stop inheriting the previous provider's failure. (#62907) Thanks @stainlu.
+- MiniMax/OAuth: write `api: "anthropic-messages"` and `authHeader: true` into the `minimax-portal` config patch during `openclaw configure`, so re-authenticated portal setups keep Bearer auth routing working. (#64964) Thanks @ryanlee666.
+
+## 2026.4.10
+
+### Changes
+
+- Models/Codex: add the bundled Codex provider and plugin-owned app-server harness so `codex/gpt-*` models use Codex-managed auth, native threads, model discovery, and compaction while `openai/gpt-*` stays on the normal OpenAI provider path. (#64298)
+- Memory/Active Memory: add a new optional Active Memory plugin that gives OpenClaw a dedicated memory sub-agent right before the main reply, so ongoing chats can automatically pull in relevant preferences, context, and past details without making users remember to manually say "remember this" or "search memory" first. Includes configurable message/recent/full context modes, live `/verbose` inspection, advanced prompt/thinking overrides for tuning, and opt-in transcript persistence for debugging. Docs: https://docs.openclaw.ai/concepts/active-memory. (#63286) Thanks @Takhoffman.
+- macOS/Talk: add an experimental local MLX speech provider for Talk Mode, with explicit provider selection, local utterance playback, interruption handling, and system-voice fallback. (#63539) Thanks @ImLukeF.
+- Tools/video generation: add Seedance 2.0 model refs to the bundled fal provider and submit the provider-specific duration, resolution, audio, and seed metadata fields needed for live Seedance 2.0 runs.
+- Microsoft Teams: add message actions for pin, unpin, read, react, and listing reactions. (#53432) Thanks @sudie-codes.
+- QA/Matrix: add a live `openclaw qa matrix` lane backed by a disposable Matrix homeserver, shared live-transport seams, and Matrix-specific transport coverage for threading, reactions, restart, and allowlist behavior. (#64489) Thanks @gumadeiras.
+- QA/Telegram: add a live `openclaw qa telegram` lane for private-group bot-to-bot checks, harden its artifact handling, and preserve native Telegram command reply threading for QA verification. (#64303) Thanks @obviyus.
+- QA/testing: add a `--runner multipass` lane for `openclaw qa suite` so repo-backed QA scenarios can run inside a disposable Linux VM and write back the usual report, summary, and VM logs. (#63426) Thanks @shakkernerd.
+- CLI/exec policy: add a local `openclaw exec-policy` command with `show`, `preset`, and `set` subcommands for synchronizing requested `tools.exec.*` config with the local exec approvals file, plus follow-up hardening for node-host rejection, rollback safety, and sync conflict detection. (#64050)
+- Gateway: add a `commands.list` RPC so remote gateway clients can discover runtime-native, text, skill, and plugin commands with surface-aware naming and serialized argument metadata. (#62656) Thanks @samzong.
+- Models/providers: add per-provider `models.providers.*.request.allowPrivateNetwork` for trusted self-hosted OpenAI-compatible endpoints, keep the opt-in scoped to model request surfaces, and refresh cached WebSocket managers when request transport overrides change. (#63671) Thanks @qas.
+- Feishu: standardize request user agents and register the bot as an AI agent so Feishu deployments identify OpenClaw consistently. (#63835) Thanks @evandance.
+- Docs i18n: chunk raw doc translation, reject truncated tagged outputs, avoid ambiguous body-only wrapper unwrapping, and recover from terminated Pi translation sessions without changing the default `openai/gpt-5.4` path. (#62969, #63808) Thanks @hxy91819.
+- Gateway: split startup and runtime seams so gateway lifecycle sequencing, reload state, and shutdown behavior stay easier to maintain without changing observed behavior. (#63975) Thanks @gumadeiras.
+- Control UI/webchat: normalize assistant `MEDIA:`/reply/voice directives into structured bubble rendering, rename the unreleased rich web shortcode to `[embed ...]`, and surface session runtime roots so hosted web content is written to the correct document path instead of guessed local files.
+- Matrix/partial streaming: add MSC4357 live markers to draft preview sends and edits so supporting Matrix clients can render a live/typewriter animation and stop it when the final edit lands. (#63513) Thanks @TigerInYourDream.
+- Control UI/dreaming: simplify the Scene and Diary surfaces, preserve unknown phase state for partial status payloads, and stabilize waiting-entry recency ordering so Dreaming status and review lists stay clear and deterministic. (#64035) Thanks @davemorin.
+- Agents: add an opt-in strict-agentic embedded Pi execution contract for GPT-5-family runs so plan-only or filler turns keep acting until they hit a real blocker. (#64241) Thanks @100yenadmin.
+- Agents/OpenAI: add provider-owned OpenAI/Codex tool schema compatibility and surface embedded-run replay/liveness state for long-running runs. (#64300) Thanks @100yenadmin.
+- Dreaming/memory-wiki: add ChatGPT import ingestion plus new `Imported Insights` and `Memory Palace` diary subtabs so Dreaming can inspect imported source chats, compiled wiki pages, and full source pages directly from the UI. (#64505)
+
+### Fixes
+
+- Browser/security: tighten browser and sandbox navigation defenses across strict SSRF defaults, hostname allowlists, interaction-driven redirects, subframes, CDP discovery, existing sessions, tab actions, noVNC, marker-span sanitization, and Docker CDP source-range enforcement. (#61404, #63332, #63882, #63885, #63889, #64367, #64370, #64371)
+- Security/tools: harden exec preflight reads, host env denylisting, node output boundaries, outbound host-media reads, profile-mutation authorization, plugin install dependency scanning, ACPX tool hooks, Gmail watcher token redaction, and oversized realtime WebSocket frame handling. (#62333, #62661, #62662, #63277, #63551, #63553, #63886, #63890, #63891, #64459)
+- OpenAI/Codex: add required Codex OAuth scopes, classify provider/runtime failures more clearly, stop suggesting `/elevated full` when auto-approved host exec is unavailable, add OpenAI/Codex tool-schema compatibility, and preserve embedded-run replay/liveness truth across compaction retries and mutating side effects. (#64300, #64439) Thanks @100yenadmin.
+- CLI/WhatsApp media sends: route gateway-mode outbound sends with `--media` through the channel `sendMedia` path and preserve media access context, so WhatsApp document and attachment sends stop silently dropping the file while still delivering the caption. (#64478, #64492) Thanks @ShionEria.
+- Microsoft Teams: restore media downloads for personal DMs, Bot Framework `a:` conversations, OneDrive/SharePoint shared files, and Graph-backed chat IDs; accept Bot Framework audience tokens; prevent feedback-learning filename collisions; keep long tool chains alive with typing indicators; add SSO sign-in callbacks; inject parent context for thread replies; and deliver cron announcements to Teams conversation IDs. (#54932, #55383, #55386, #58001, #58249, #58774, #59731, #60956, #62219, #62674, #63063, #63942, #63945, #63949, #63951, #63953, #64087, #64088, #64089)
 - Gateway/tailscale: start Tailscale exposure and the gateway update check before awaiting channel and plugin sidecar startup so remote operators are not locked out when startup sidecars stall.
+- Gateway/startup: keep WebSocket RPC available while channels and plugin sidecars start, hold `chat.history` unavailable until startup sidecars finish so synchronous history reads cannot stall startup (reported in #63450), refresh advertised gateway methods after deferred plugin reloads, and enforce the pre-auth WebSocket upgrade budget before the no-handler 503 path so upgrade floods cannot bypass connection limits during that window. (#63480) Thanks @neeravmakwana.
 - WhatsApp: keep inbound replies, media, composing indicators, and queued outbound deliveries attached to the current socket across reconnect gaps, including fresh retry-eligible sends after the listener comes back. (#30806, #46299, #62892, #63916) Thanks @mcaxtr.
- Microsoft Teams: restore media downloads for personal DMs, Bot Framework `a:` conversations, OneDrive/SharePoint shared files, and Graph-backed chat IDs; accept Bot Framework audience tokens; and deliver cron announcements to Teams conversation IDs. (#55383, #58001, #58249, #62219, #62674, #63063, #63942, #63951, #63953) Thanks @obviyus.
- Gateway/thread routing: preserve Slack, Telegram, Mattermost, and ACP parent-thread delivery targets so subagent, cron, and stream-relay completion messages land back in the originating thread or topic. (#54840, #57056, #63228, #63506) Thanks @yzzymt.
+- Gateway/thread routing: preserve Slack, Telegram, Mattermost, Matrix, ACP, restart-sentinel, and agent announce delivery targets so subagent, cron, stream-relay, session fallback, and restart messages land back in the originating thread, topic, or room casing. (#54840, #57056, #63143, #63228, #63506, #64343, #64391)
+- Models/fallback: preserve `/models` selection across transient primary-model failures and config reloads, allow timeout cooldown probes, classify OpenRouter no-endpoints responses, detect llama.cpp context overflows, and keep provider/runtime context metadata stable through reloads. (#61472, #64196, #64471)
+- Agents/BTW: keep `/btw` side questions working after tool-use turns by stripping replayed tool blocks, hidden reasoning, and malformed image payloads, omitting empty tool arrays, allowing Bedrock `auth: "aws-sdk"`, and routing Feishu `/btw` plus `/stop` through bounded out-of-band lanes. (#64218, #64219, #64225, #64324) Thanks @ngutman.
+- Control UI/BTW: render `/btw` side results as dismissible ephemeral cards in the browser, send `/btw` immediately during active runs, and clear stale BTW cards on reset flows so webchat matches the intended detached side-question behavior. (#64290) Thanks @ngutman.
+- Commands/targeting: use the selected agent or session for command output, send policy, usage/cost, context reports, model lists, bash sandbox hints, BTW/compact working directories, plugin commands, and session exports so multi-agent commands describe and mutate the intended target instead of the requester.
+- Conversation bindings: normalize focused/current conversation ids, preserve binding metadata on account and Discord rebinds, avoid stale Discord lifecycle windows, and keep generic activity touches persisted so reply routing survives rebinds and restarts.
+- iMessage/self-chat: distinguish normal DM outbound rows from true self-chat using `destination_caller_id` plus chat participants, preserve multi-handle self-chat aliases, drop ambiguous reflected echoes, and strip wrapped imsg RPC text fields. (#61619, #63868, #63980, #63989, #64000) Thanks @neeravmakwana.
+- Matrix: keep multi-account room scoping consistent, keep packaged crypto migrations warning-only when appropriate, preserve ordered block streaming, add explicit Matrix block-streaming opt-in, and resolve verification/bootstrap from the packaged runtime entry. (#58449, #59249, #59266, #64373) Thanks @gumadeiras.
+- Telegram/security: tighten Telegram `allowFrom` sender validation and keep `/whoami` allowlist reporting in sync with command auth checks.
 - Agents/timeouts: extend the default LLM idle window to 120s and keep silent no-token idle timeouts on recovery paths, so slow models can retry or fall back before users see an error.
 - Gateway/agents: preserve configured model selection and richer `IDENTITY.md` content across agent create/update flows and workspace moves, and fail safely instead of silently overwriting unreadable identity files. (#61577) Thanks @samzong.
- Skills/TaskFlow: restore valid frontmatter fences for the bundled `taskflow` and `taskflow-inbox-triage` skills so they stay discoverable and loadable after updates. (#64469) Thanks @extrasmall0.
+- Skills/TaskFlow: restore valid frontmatter fences for the bundled `taskflow` and `taskflow-inbox-triage` skills and copy bundled `SKILL.md` files as hard dist-runtime copies so skills stay discoverable and loadable after updates. (#64166, #64469) Thanks @extrasmall0.
+- Skills: respect overridden home directories when loading personal skills so service, test, and custom launch environments read the intended user skill directory instead of the process home.
 - Windows/exec: settle supervisor waits from child exit state after stdout and stderr drain even when `close` never arrives, so CLI commands stop hanging or dying with forced `SIGKILL` on Windows. (#64072) Thanks @obviyus.
 - Browser/sandbox: prevent sandbox browser CDP startup hangs by recreating containers when the browser security hash changes and by waiting on the correct sandbox browser lifecycle. (#62873) Thanks @Syysean.
- iMessage/self-chat: distinguish normal DM outbound rows from true self-chat using `destination_caller_id` plus chat participants, while preserving multi-handle self-chat aliases so outbound DM replies stop looping back as inbound messages. (#61619) Thanks @neeravmakwana.
 - QQBot/streaming: make block streaming configurable per QQ bot account via `streaming.mode` (`"partial"` | `"off"`, default `"partial"`) instead of hardcoding it off, so responses can be delivered incrementally. (#63746)
 - QQBot/config: allow extra fields in `channels.qqbot` and `channels.qqbot.accounts.*` so extended qqbot builds can add new config options without gateway startup failing on schema validation. (#64075) Thanks @WideLee.
 - Dreaming/gateway: require `operator.admin` for persistent `/dreaming on|off` changes and treat missing gateway client scopes as unprivileged instead of silently allowing config writes. (#63872) Thanks @mbelinky.
- Matrix/multi-account: keep room-level `account` scoping, inherited room overrides, and implicit account selection consistent across top-level default auth, named accounts, and cached-credential env setups. (#58449) thanks @Daanvdplas and @gumadeiras.
 - Gateway/pairing: prefer explicit QR bootstrap auth over earlier Tailscale auth classification so iOS `/pair qr` silent bootstrap pairing does not fall through to `pairing required`. (#59232) Thanks @ngutman.
 - Browser/control: auto-generate browser-control auth tokens for `none` and `trusted-proxy` modes, and route browser auth/profile/doctor helpers through the public browser plugin facades. (#63280, #63957) Thanks @pgondhi987.
 - Browser/act: centralize `/act` request normalization and execution dispatch while adding stable machine-readable route-level error codes for invalid requests, selector misuse, evaluate-disabled gating, target mismatch, and existing-session unsupported actions. (#63977) Thanks @joshavant.
- Security/exec: replace script-preflight check-then-read logic with an atomic pinned-file-descriptor open, and expand the host environment denylist for dangerous runtime-control variables. (#62333, #63277) Thanks @pgondhi987.
- Security/nodes: keep `nodes` tool output paths inside the workspace boundary so model-driven node writes cannot escape the intended workspace. (#63551) Thanks @pgondhi987.
 - Security/QQBot: enforce media storage boundaries for all outbound local file paths and route image-size probes through SSRF-guarded media fetching instead of raw `fetch()`. (#63271, #63495) Thanks @pgondhi987.
 - Channel setup: ignore workspace plugin shadows when resolving trusted channel setup catalog entries so onboarding and setup flows keep using the bundled, trusted setup contract.
 - Gateway/memory startup: load the explicitly selected memory-slot plugin during gateway startup, while keeping restrictive allowlists and implicit default memory slots from auto-starting unrelated memory plugins. (#64423) Thanks @EronFan.
@@ -64,70 +162,46 @@ Docs: https://docs.openclaw.ai
 - Claude CLI: clear inherited Anthropic auth/header environment aliases before spawning Claude Code and add sanitized CLI backend auth-env diagnostics for debugging gateway-run provider selection.
 - Agents/failover: classify AbortError and stream-abort messages as timeout so Ollama NDJSON stream aborts stop showing `reason=unknown` in model fallback logs. (#58324) Thanks @yelog.
 - Fireworks/FirePass: disable Kimi K2.5 Turbo reasoning output by forcing thinking off on the FirePass path and hardening the provider wrapper so hidden reasoning no longer leaks into visible replies. (#63607) Thanks @frankekn.
- Matrix/multi-account: keep room-level `account` scoping, inherited room overrides, and implicit account selection consistent across top-level default auth, named accounts, and cached-credential env setups. (#58449) Thanks @gumadeiras.
- Matrix/runtime: resolve the verification/bootstrap runtime from a distinct packaged Matrix entry so global npm installs stop failing on crypto bootstrap with missing-module or recursive runtime alias errors. (#59249) Thanks @gumadeiras.
- Matrix/streaming: preserve ordered block flushes before tool, message, and agent boundaries, add explicit `channels.matrix.blockStreaming` opt-in so Matrix `streaming: "off"` stays final-only by default, and move MiniMax plain-text final handling into the MiniMax provider runtime instead of the shared core heuristic. (#59266) Thanks @gumadeiras.
- QQBot/streaming: make block streaming configurable per QQ bot account via `streaming.mode` (`"partial"` | `"off"`, default `"partial"`) instead of hardcoding it off, so responses can be delivered incrementally. (#63746)
 - Discord: update Carbon to v0.15.0. Thanks @thewilloftheshadow.
 - Config/Discord: coerce safe integer numeric Discord IDs to strings during config validation, keep unsafe or precision-losing numeric snowflakes rejected, and align `openclaw doctor` repair guidance with the same fail-closed behavior. (#45125) Thanks @moliendocode.
 - BlueBubbles/config: accept `enrichGroupParticipantsFromContacts` in the core strict config schema so gateways no longer fail validation or startup when the BlueBubbles plugin writes that field. (#56889) Thanks @zqchris.
 - Feishu/webhooks: read webhook bodies through the pre-auth guard so unauthenticated webhook traffic stays under the same body budget as other protected channel ingress paths.
 - Tools/web_fetch: add an opt-in `tools.web.fetch.ssrfPolicy.allowRfc2544BenchmarkRange` config so fake-IP proxy environments that resolve public sites into `198.18.0.0/15` can use `web_fetch` without weakening the default SSRF block. (#61830) Thanks @xing-xing-coder.
- Dreaming/gateway: require `operator.admin` for persistent `/dreaming on|off` changes and treat missing gateway client scopes as unprivileged instead of silently allowing config writes. (#63872) Thanks @mbelinky.
 - Dreaming/cron: reconcile managed dreaming cron from startup config and runtime lifecycle changes, but only recover managed dreaming cron state during heartbeat-triggered dreaming checks so ordinary chat traffic does not recreate removed jobs. (#63873, #63929, #63938) Thanks @mbelinky.
 - Memory/lancedb: accept `dreaming` config when `memory-lancedb` owns the memory slot so Dreaming surfaces can read slot-owner settings without schema rejection. (#63874) Thanks @mbelinky.
 - Control UI/dreaming: keep the Dreaming trace area contained and scrollable so overlays no longer cover tabs or blow out the page layout. (#63875) Thanks @mbelinky.
+- Dreaming/narrative: harden request-scoped diary fallback so scheduled dreaming only falls back on the dedicated subagent-runtime error, stop trusting spoofable raw error-code objects, and avoid leaking workspace paths when local fallback writes fail. (#64156) Thanks @mbelinky.
 - Dreaming/diary: add idempotent narrative subagent runs, preserve restrictive `DREAMS.md` permissions during atomic writes, and surface temp cleanup failures so repeated sweeps do not double-run the same narrative request or silently weaken diary safety. (#63876) Thanks @mbelinky.
 - Heartbeats/sessions: remove stale accumulated isolated heartbeat session keys when the next tick converges them back to the canonical sibling, so repaired sessions stop showing orphaned `:heartbeat:heartbeat` variants in session listings. (#59606) Thanks @rogerdigital.
 - Gateway/run cleanup: fix stale run-context TTL cleanup so the new maintenance sweep resets orphaned run sequence state and prevents unbounded run-context growth. (#52731) Thanks @artwalker.
 - UI/compaction: keep the compaction indicator in a retry-pending state until the run actually finishes, so the UI does not show `Context compacted` before compaction actually finishes. (#55132) Thanks @mpz4life.
 - Cron/tool schemas: keep cron tool schemas strict-model-friendly while still preserving `failureAlert=false`, nullable `agentId`/`sessionKey`, and flattened add/update recovery for the newly exposed cron job fields. (#55043) Thanks @brunolorente.
 - Git metadata: read commit ids from packed refs as well as loose refs so version and status metadata stay accurate after repository maintenance. (#63943)
- Gateway: keep `commands.list` skill entries categorized under tools and include provider-aware plugin `nativeName` metadata even when `scope=text`, so remote clients can group skills correctly and map text-surface plugin commands back to native aliases.
+- Gateway: keep `commands.list` skill entries categorized under tools and include provider-aware plugin `nativeName` metadata even when `scope=text`, so remote clients can group skills correctly and map text-surface plugin commands back to native aliases. (#64147)
 - TUI: reset footer activity to idle when switching sessions so a stale streaming indicator cannot persist after the selection changes. (#63988) Thanks @neeravmakwana.
- iMessage: treat `sender === chat_identifier` as self-chat only when `destination_caller_id` is present and matches the sender, fixing DM outbound rows that omit destination from being run through self-chat echo handling. (#63980) Thanks @neeravmakwana.
- Cron/Telegram: collapse isolated announce delivery to the final assistant-visible text only for Telegram targets, while preserving existing multi-message direct delivery semantics for other channels. (#63228) Thanks @welfo-beo.
- Gateway/thread routing: preserve Slack, Telegram, and Mattermost thread-child delivery targets so bound subagent completion messages land in the originating thread instead of top-level channels. (#54840) Thanks @yzzymt.
- ACP/stream relay: pass parent delivery context to ACP stream relay system events so `streamTo="parent"` updates route to the correct thread or topic instead of falling back to the main DM. (#57056) Thanks @pingren.
- Agents/sessions: preserve announce `threadId` when `sessions.list` fallback rehydrates agent-to-agent announce targets so final announce messages stay in the originating thread/topic. (#63506) Thanks @SnowSky1.
- iMessage/self-chat: remember ambiguous `sender === chat_identifier` outbound rows with missing `destination_caller_id` in self-chat dedupe state so the later reflected inbound copy still drops instead of re-entering inbound handling when the echo cache misses. Thanks @neeravmakwana.
 - Claude CLI: stop marking spawned Claude Code runs as host-managed so they keep using normal CLI subscription behavior. (#64023) Thanks @Alex-Alaniz.
- Agents/failover: classify OpenRouter `404 No endpoints found for <model>` responses as `model_not_found` so fallback chains continue past retired OpenRouter candidates. (#61472) Thanks @MonkeyLeeT.
- Browser/plugin SDK: route browser auth, profile, host-inspection, and doctor readiness helpers through browser plugin public facades so core compatibility helpers stop carrying duplicate runtime implementations. (#63957) Thanks @joshavant.
- Agents/failover: allow cooldown probes for `timeout` (including network outage classifications) so the primary model can recover after failover without a gateway restart. (#63996) Thanks @neeravmakwana.
- iMessage (imsg): strip an accidental protobuf length-delimited UTF-8 field wrapper from inbound `text` and `reply_to_text` when it fully consumes the field, fixing leading garbage before the real message. (#63868) Thanks @neeravmakwana.
+- Codex auth: brand Codex OAuth flows as OpenClaw in user-visible auth prompts and diagnostics.
 - Gateway/pairing: fail closed for paired device records that have no device tokens, and reject pairing approvals whose requested scopes do not match the requested device roles.
 - ACP/gateway chat: classify lifecycle errors before forwarding them to ACP clients so refusals use ACP's refusal stop reason while transient backend errors continue to finish as normal turns.
- Agents/BTW: strip replayed tool blocks, hidden reasoning, and malformed image payloads from `/btw` side-question context so Bedrock no-tools side questions keep working after tool-use turns. (#64225) Thanks @ngutman.
- Commands/btw: keep tool-less side questions from sending injected empty `tools` arrays on strict OpenAI-compatible providers, so `/btw` continues working after prior tool-call history. (#64219) Thanks @ngutman.
- Agents/Bedrock: let `/btw` side questions use `auth: "aws-sdk"` without a static API key so Bedrock IAM and instance-role sessions stop failing before the side question runs. (#64218) Thanks @SnowSky1.
- Feishu: route `/btw` side questions and `/stop` onto bounded out-of-band lanes so BTW no longer waits behind a busy normal chat turn while ordinary same-chat traffic stays FIFO. (#64324) Thanks @ngutman.
- Agents/failover: detect llama.cpp slot context overflows as context-overflow errors so compaction can retry self-hosted OpenAI-compatible runs instead of surfacing the raw upstream 400. (#64196) Thanks @alexander-applyinnovations.
 - Claude CLI/skills: pass eligible OpenClaw skills into CLI runs, including native Claude Code skill resolution via a temporary plugin plus per-run skill env/API key injection. (#62686, #62723) Thanks @zomars.
 - Discord: keep generated auto-thread names working with reasoning models by giving title generation enough output budget for thinking plus visible title text. (#64172) Thanks @hanamizuki.
- Heartbeat: ignore doc-only Markdown fence markers in the default `HEARTBEAT.md` template so comment-only heartbeat scaffolds skip API calls again. (#63434) Thanks @ravyg.
- Control UI/BTW: render `/btw` side results as dismissible ephemeral cards in the browser, send `/btw` immediately during active runs, and clear stale BTW cards on reset flows so webchat matches the intended detached side-question behavior. (#64290) Thanks @ngutman.
+- Heartbeat: ignore doc-only Markdown fence markers in the default `HEARTBEAT.md` template so comment-only heartbeat scaffolds skip API calls again. (#61690, #63434) Thanks @ravyg.
 - Reply/skills: keep resolved skill and memory secret config stable through embedded reply runs so raw SecretRefs in secondary skill settings no longer crash replies when the gateway already has the live env. (#64249) Thanks @mbelinky.
- Dreaming/startup: keep plugin-registered startup hooks alive across workspace hook reloads and include dreaming startup owners in the gateway startup plugin scope, so managed Dreaming cron registration comes back reliably after gateway boot. (#62327) Thanks @mbelinky.
+- Dreaming/startup: keep plugin-registered startup hooks alive across workspace hook reloads and include dreaming startup owners in the gateway startup plugin scope, so managed Dreaming cron registration comes back reliably after gateway boot. (#62327, #64258) Thanks @mbelinky.
 - Plugins: treat duplicate `registerService` calls from the same plugin id as idempotent so snapshot and activation loads no longer emit spurious `service already registered` diagnostics. (#62033, #64128) Thanks @ly85206559.
 - Discord/TTS: route auto voice replies through the native voice-note path so Discord receives Opus voice messages instead of regular audio attachments. (#64096) Thanks @LiuHuaize.
- Config/plugins: use plugin-owned command alias metadata when `plugins.allow` contains runtime command names like `dreaming`, and point users at the owning plugin instead of stale plugin-not-found guidance. (#64242) Thanks @feiskyer.
+- Config/plugins: use plugin-owned command alias metadata when `plugins.allow` contains runtime command names like `dreaming`, and point users at the owning plugin instead of stale plugin-not-found guidance. (#64191, #64242) Thanks @feiskyer.
 - Agents/Gemini: strip orphaned `required` entries from Gemini tool schemas so provider validation no longer rejects tools after schema cleanup or union flattening. (#64284) Thanks @xxxxxmax.
- Assistant text: strip Qwen-style XML tool call payloads from visible replies so web and channel messages no longer show raw `<tool_call><function=...>` output. (#64214) Thanks @MoerAI.
+- Assistant text: strip Qwen-style XML tool call payloads from visible replies so web and channel messages no longer show raw `<tool_call><function=...>` output. (#63999, #64214) Thanks @MoerAI.
 - Daemon/gateway: prevent systemd restart storms on configuration errors by exiting with `EX_CONFIG` and adding generated unit restart-prevention guards. (#63913) Thanks @neo1027144-creator.
 - Agents/exec: prevent gateway crash ("Agent listener invoked outside active run") when a subagent exec tool produces stdout/stderr after the agent run has ended or been aborted. (#62821) Thanks @openperf.
- Browser/tabs: route `/tabs/action` close/select through the same browser endpoint reachability and policy checks as list/new (including Playwright-backed remote tab operations), reject CDP HTTP redirects on probe requests, and sanitize blocked-endpoint error responses so tab list/focus/close flows fail closed without echoing raw policy details back to callers. (#63332)
 - Gateway/OpenAI compat: return real `usage` for non-stream `/v1/chat/completions` responses, emit the final usage chunk when `stream_options.include_usage=true`, and bound usage-gated stream finalization after lifecycle end. (#62986) Thanks @Lellansin.
 - Matrix/migration: keep packaged warning-only crypto migrations from being misclassified as actionable when only helper chunks are present, so startup and doctor stay on the warning-only path instead of creating unnecessary migration snapshots. (#64373) Thanks @gumadeiras.
 - Matrix/ACP thread bindings: preserve canonical room casing and parent conversation routing during ACP session spawn so mixed-case room ids bind correctly from top-level rooms and existing Matrix threads. (#64343) Thanks @gumadeiras.
 - Agents/subagents: deduplicate delivered completion announces so retry or re-entry cleanup does not inject duplicate internal-context completion turns into the parent session. (#61525) Thanks @100yenadmin.
 - Agents/exec: keep sandboxed `tools.exec.host=auto` sessions from honoring per-call `host=node` or `host=gateway` overrides while a sandbox runtime is active, and stop advertising node routing in that state so exec stays on the sandbox host. (#63880)
- Gateway/restart sentinel: route restart notices only from stored canonical delivery metadata and skip outbound guessing from lossy session keys, avoiding misdelivery on case-sensitive channels like Matrix. (#64391) Thanks @gumadeiras.
-
+- Agents/subagents: preserve archived delete-mode runs until `sessions.delete` succeeds and prevent overlapping archive sweeps from duplicating in-flight cleanup attempts. (#61801) Thanks @100yenadmin.
 - Cron/isolated agent: run scheduled agent turns as non-owner senders so owner-only tools stay unavailable during cron execution. (#63878)
- Voice Call/realtime: reject oversized realtime WebSocket frames before bridge setup so large pre-start payloads cannot crash the gateway. (#63890) Thanks @mmaps.
- Browser/sandbox: gate `/sandbox/novnc` behind bridge auth and stop surfacing sandbox observer URLs in model-visible prompt context. (#63882) Thanks @eleqtrizit.
-
 - Discord/sandbox: include `image` in sandbox media param normalization so Discord event cover images cannot bypass sandbox path rewriting. (#64377) Thanks @mmaps.
 - Agents/exec: extend exec completion detection to cover local background exec formats so the owner-downgrade fires correctly for all exec paths. (#64376) Thanks @mmaps.
 - Security/dependencies: pin axios to 1.15.0 and add a plugin install dependency denylist that blocks known malicious packages before install. (#63891) Thanks @mmaps.
@@ -137,18 +211,22 @@ Docs: https://docs.openclaw.ai
 - Browser/security: default browser SSRF policy to strict mode so unconfigured installs block private-network navigation, and align external-content marker span mapping so ZWS-injected boundary spoofs are fully sanitized. (#63885) Thanks @eleqtrizit.
 - Browser/security: apply SSRF navigation policy to subframe document navigations so iframe-targeted private-network hops are blocked without quarantining the parent page. (#64371) Thanks @eleqtrizit.
 - Hooks/security: mark agent hook system events as untrusted and sanitize hook display names before cron metadata reuse. (#64372) Thanks @eleqtrizit.
- Media/security: honor sender-scoped `toolsBySender` policy for outbound host-media reads so denied senders cannot trigger host file disclosure via attachment hydration. (#64459) Thanks @eleqtrizit.
- Browser/security: reject strict-policy hostname navigation unless the hostname is an explicit allowlist exception or IP literal, and route CDP HTTP discovery through the pinned SSRF fetch path. (#64367) Thanks @eleqtrizit.
- Plugins/ACPX: wrap plugin tools on the MCP bridge with the shared `before_tool_call` handler so block and approval hooks fire consistently across all execution paths. (#63886) Thanks @eleqtrizit.
-
- Logging/security: redact Gmail watcher `--hook-token` values from startup logging and `logs.tail` output. (#62661) Thanks @eleqtrizit.
- Models/fallback: preserve `/models` selection across transient primary-model failures and config reloads so the fallback chain no longer permanently clobbers a user-chosen model. (#64471) Thanks @hoyyeva.
-
- Sandbox/security: auto-derive CDP source-range from Docker network gateway and refuse to start the socat relay without one, so peer containers cannot reach CDP unauthenticated. (#61404) Thanks @dims.
 - Daemon/launchd: keep `openclaw gateway stop` persistent without uninstalling the macOS LaunchAgent, re-enable it on explicit restart or repair, and harden launchd label handling. (#64447) Thanks @ngutman.
- Agents/Slack: preserve threaded announce delivery when `sessions.list` rows lack stored thread metadata by falling back to the thread id encoded in the session key. (#63143) Thanks @mariosousa-finn.
 - Plugins/context engines: preserve `plugins.slots.contextEngine` through normalization and keep explicitly selected workspace context-engine plugins enabled, so loader diagnostics and plugin activation stop dropping that slot selection. (#64192) Thanks @hclsys.
 - Heartbeat: stop top-level `interval:` and `prompt:` fields outside the `tasks:` block from bleeding into the last parsed heartbeat task. (#64488) Thanks @Rahulkumar070.
+- Agents/OpenAI replay: preserve malformed function-call arguments in stored assistant history, avoid double-encoding preserved raw strings on replay, and coerce replayed string args back to objects at Anthropic and Google provider boundaries. (#61956) Thanks @100yenadmin.
+- Heartbeat/config: accept and honor `agents.defaults.heartbeat.timeoutSeconds` and per-agent heartbeat timeout overrides for heartbeat agent turns. (#64491) Thanks @cedillarack.
+- CLI/devices: make implicit `openclaw devices approve` selection preview-only and require approving the exact request ID, preventing latest-request races during device pairing. (#64160) Thanks @coygeek.
+- Media/security: honor sender-scoped `toolsBySender` policy for outbound host-media reads so denied senders cannot trigger host file disclosure via attachment hydration. (#64459) Thanks @eleqtrizit.
+- Browser/security: reject strict-policy hostname navigation unless the hostname is an explicit allowlist exception or IP literal, and route CDP HTTP discovery through the pinned SSRF fetch path. (#64367) Thanks @eleqtrizit.
+- Models/vLLM: ignore empty `tool_calls` arrays from reasoning-model OpenAI-compatible replies, reset false `toolUse` stop reasons when no actual tool calls were parsed, and stop sending `tool_choice` unless tools are present so vLLM reasoning responses no longer hang indefinitely. (#61197, #61534) Thanks @balajisiva.
+- Heartbeat/scheduling: spread interval heartbeats across stable per-agent phases derived from gateway identity, so provider traffic is distributed more uniformly across the configured interval instead of clustering around startup-relative times. (#64560) Thanks @odysseus0.
+- Config/media: accept `tools.media.asyncCompletion.directSend` in strict config validation so gateways no longer reject the generated-schema-backed async media completion setting at startup. (#63618) Thanks @qiziAI.
+- Telegram/exec: preserve delayed exec completion routing for forum topics by pinning background exec completions to the topic where the run started even if the session route later drifts. (#64580) thanks @jalehman.
+- Agents/locks: unregister the session write-lock `exit` cleanup handler during teardown so repeated lock lifecycle resets stop stacking process listeners in long-running gateway processes. (#65391) Thanks @adminfedres and @vincentkoc.
+- CLI/Claude: rename the trusted inbound metadata schema to `openclaw.inbound_meta.v2` so Claude CLI no longer trips Anthropic's blocked `openclaw.inbound_meta.v1` filter on channel-originated turns. (#65399) Thanks @SzyMig and @vincentkoc.
+- Agents/inbound metadata: strip NUL bytes from serialized inbound context blocks before they reach backend spawn args, so malformed message metadata cannot crash agent spawn with `ERR_INVALID_ARG_VALUE`. (#65389) Thanks @adminfedres and @vincentkoc.
+
 ## 2026.4.9

 ### Changes
@@ -158,6 +236,7 @@ Docs: https://docs.openclaw.ai
 - QA/lab: add character-vibes evaluation reports with model selection and parallel runs so live QA can compare candidate behavior faster.
 - Plugins/provider-auth: let provider manifests declare `providerAuthAliases` so provider variants can share env vars, auth profiles, config-backed auth, and API-key onboarding choices without core-specific wiring.
 - iOS: pin release versioning to an explicit CalVer in `apps/ios/version.json`, keep TestFlight iteration on the same short version until maintainers intentionally promote the next gateway version, and add the documented `pnpm ios:version:pin -- --from-gateway` workflow for release trains. (#63001) Thanks @ngutman.
+- Tools/video_generate: extend the tool and the Plugin SDK with `providerOptions` (vendor-specific options forwarded as a JSON object), `inputAudios` / `audioRef` / `audioRefs` reference audio inputs, per-asset semantic role hints (`imageRoles` / `videoRoles` / `audioRoles`) using a typed `VideoGenerationAssetRole` union, a new `"adaptive"` aspect-ratio sentinel, and `maxInputAudios` provider capability declarations. Providers opt into `providerOptions` by declaring a typed `capabilities.providerOptions` schema (`{ seed: "number", draft: "boolean", ... }`); unknown keys and type mismatches cause the runtime fallback loop to skip the candidate with a visible warning and an `attempts` entry, so vendor-specific options never silently reach the wrong provider. Also raises the in-tool image input cap to 9 and updates the docs table to list all new parameters. (#61987) Thanks @xieyongliang.

 ### Fixes

@@ -196,7 +275,6 @@ Docs: https://docs.openclaw.ai
 - Control UI/models: preserve provider-qualified refs for OpenRouter catalog models whose ids already contain slashes so picker selections submit allowlist-compatible model refs instead of dropping the `openrouter/` prefix. (#63416) Thanks @sallyom.
 - Plugin SDK/command auth: split command status builders onto the lightweight `openclaw/plugin-sdk/command-status` subpath while preserving deprecated `command-auth` compatibility exports, so auth-only plugin imports no longer pull status/context warmup into CLI onboarding paths. (#63174) Thanks @hxy91819.
 - Wizard/plugin config: coerce integer-typed plugin config fields from interactive text input so integer schema values persist as numbers instead of failing validation. (#63346) Thanks @jalehman.
- Dreaming/narrative: harden request-scoped diary fallback so scheduled dreaming only falls back on the dedicated subagent-runtime error, stop trusting spoofable raw error-code objects, and avoid leaking workspace paths when local fallback writes fail. (#64156) Thanks @mbelinky.

 ## 2026.4.8

@@ -320,6 +398,9 @@ Docs: https://docs.openclaw.ai
 - Reply execution: prefer the active runtime snapshot over stale queued reply config during embedded reply and follow-up execution so SecretRef-backed reply turns stop crashing after secrets have already resolved. (#62693) Thanks @mbelinky.
 - Android/manual connect: allow blank port input only for TLS manual gateway endpoints so standard HTTPS Tailscale hosts default to `443` without silently changing cleartext manual connects. (#63134) Thanks @Tyler-RNG.
 - Matrix/agents: hide owner-only `set-profile` from embedded agent channel-action discovery so non-owner runs stop advertising profile updates they cannot execute. (#62662) Thanks @eleqtrizit.
+- iOS/gateway: replace string-matched connection error UI with structured gateway connection problems, preserve actionable pairing/auth failures over later generic disconnect noise, and surface reusable problem banners and details across onboarding, settings, and root status surfaces. (#62650) Thanks @ngutman.
+- Git/env sanitization: block additional Git repository-plumbing env variables such as `GIT_DIR`, `GIT_WORK_TREE`, `GIT_COMMON_DIR`, `GIT_INDEX_FILE`, `GIT_OBJECT_DIRECTORY`, `GIT_ALTERNATE_OBJECT_DIRECTORIES`, and `GIT_NAMESPACE` so host-run Git commands cannot be redirected to attacker-chosen repository state through inherited or request-scoped env. (#62002) Thanks @eleqtrizit.
+- Host exec/env sanitization: block additional request-scoped credential and config-path overrides such as `KUBECONFIG`, cloud credential-path env, `CARGO_HOME`, and `HELM_HOME` so host-run tools can no longer be redirected to attacker-chosen config or state. (#59119) Thanks @eleqtrizit.

 ## 2026.4.5

@@ -539,7 +620,7 @@ Docs: https://docs.openclaw.ai
 - Agents/scheduling: steer background-now work toward automatic completion wake and treat `process` polling as on-demand inspection or intervention instead of default completion handling. (#60877) Thanks @vincentkoc.
 - Agents/skills: skip `.git` and `node_modules` when mirroring skills into sandbox workspaces so read-only sandboxes do not copy repo history or dependency trees. (#61090) Thanks @joelnishanth.
 - ACP/agents: inherit the target agent workspace for cross-agent ACP spawns and fall back safely when the inherited workspace no longer exists. (#58438) Thanks @zssggle-rgb.
- ACPX/Windows: preserve backslashes and absolute `.exe` paths in Claude CLI parsing, and fail fast on wrapper-script targets with guidance to use `cmd.exe /c`, `powershell.exe -File`, or `node <script>`. (#60689) Thanks @steipete.
+- ACPX/Windows: preserve backslashes and absolute `.exe` paths in Claude CLI parsing, and fail fast on wrapper-script targets with guidance to use `cmd.exe /c`, `powershell.exe -File`, or `node <script>`. (#60689)
 - Auth/failover: persist selected fallback overrides before retrying, shorten `auth_permanent` lockouts, and refresh websocket/shared-auth sessions only when real auth changes occur so retries and secret rotations behave predictably. (#60404, #60323, #60387) Thanks @extrasmall0 and @mappel-nv.
 - Gateway/channels: pin the initial startup channel registry before later plugin-registry churn so configured channels stay visible and `channels.status` stops falling back to empty `channelOrder` / `channels` payloads after runtime plugin loads.
 - Prompt caching: order stable workspace project-context files before `HEARTBEAT.md` and keep `HEARTBEAT.md` below the system-prompt cache boundary so heartbeat churn does not invalidate the stable project-context prefix. (#58979) Thanks @yozu and @vincentkoc.
@@ -649,6 +730,7 @@ Docs: https://docs.openclaw.ai
 - Gateway/OpenAI HTTP: restore default operator scopes for bearer-authenticated requests that omit `x-openclaw-scopes`, so headless `/v1/chat/completions` and session-history callers work again after the recent method-scope hardening. (#57596) Thanks @openperf.
 - Gateway/attachments: offload large inbound images without leaking `media://` markers into text-only runs, preserve mixed attachment order for model input/transcripts, and fail closed when model image capability cannot be resolved. (#55513) Thanks @Syysean.
 - Telegram/outbound chunking: use static markdown chunking when Telegram runtime state is unavailable so long outbound Telegram messages still split correctly after cold starts. (#57816) Thanks @ForestDengHK.
+- Update/Corepack: disable interactive Corepack download prompts during update preflight install unless `COREPACK_ENABLE_DOWNLOAD_PROMPT` is already explicitly set, so `openclaw update` can fetch the repo-pinned pnpm version non-interactively. (#61456) Thanks @p6l-richard.

 ## 2026.4.2

@@ -1510,7 +1592,7 @@ Docs: https://docs.openclaw.ai
 - Gateway/status: tolerate network interface discovery failures in status, onboarding control-UI links, and self-presence display paths so those surfaces fall back cleanly instead of crashing. (#52195) Thanks @meng-clb.
 - Gateway/Linux: auto-detect nvm-managed Node TLS CA bundle needs before CLI startup and refresh installed services that are missing `NODE_EXTRA_CA_CERTS`. (#51146) Thanks @GodsBoy.
 - Google auth/Node 25: patch `gaxios` to use native fetch without injecting `globalThis.window`, while translating proxy and mTLS transport settings so Google Vertex and Google Chat auth keep working on Node 25. (#47914) Thanks @pdd-cli.
- Gateway/plugins: pin runtime webhook routes to the gateway startup registry so channel webhooks keep working across plugin-registry churn, and make plugin auth + dispatch resolve routes from the same live HTTP-route registry. (#47902) Fixes #46924 and #47041. Thanks @steipete.
+- Gateway/plugins: pin runtime webhook routes to the gateway startup registry so channel webhooks keep working across plugin-registry churn, and make plugin auth + dispatch resolve routes from the same live HTTP-route registry. (#47902) Fixes #46924 and #47041.
 - Gateway/restart: defer externally signaled unmanaged restarts through the in-process idle drain, and preserve the restored subagent run as remap fallback during orphan recovery so resumed sessions do not duplicate work. (#47719) Thanks @joeykrug.
 - Telegram/setup: seed fresh setups with `channels.telegram.groups["*"].requireMention=true` so new bots stay mention-gated in groups unless you explicitly open them up. Thanks @vincentkoc.
 - Inbound policy hardening: tighten callback and webhook sender checks across Mattermost and Google Chat, match Nextcloud Talk rooms by stable room token, and treat explicit empty Twitch allowlists as deny-all. (#46787) Thanks @zpbrent, @ijxpwastaken and @vincentkoc.
@@ -2923,7 +3005,7 @@ Docs: https://docs.openclaw.ai
 - Gemini OAuth/Auth flow: align OAuth project discovery metadata and endpoint fallback handling for Gemini CLI auth, including fallback coverage for environment-provided project IDs. (#16684) Thanks @vincentkoc.
 - Google Chat/Lifecycle: keep Google Chat `startAccount` pending until abort in webhook mode so startup is no longer interpreted as immediate exit, preventing auto-restart loops and webhook-target churn. (#27384) thanks @junsuwhy.
 - Temp dirs/Linux umask: force `0700` permissions after temp-dir creation and self-heal existing writable temp dirs before trust checks so `umask 0002` installs no longer crash-loop on startup. Landed from contributor PR #27860. (#27853) Thanks @stakeswky.
- Nextcloud Talk/Lifecycle: keep `startAccount` pending until abort and stop the webhook monitor on shutdown, preventing `EADDRINUSE` restart loops when the gateway manages account lifecycle. (#27897) Thanks @steipete.
+- Nextcloud Talk/Lifecycle: keep `startAccount` pending until abort and stop the webhook monitor on shutdown, preventing `EADDRINUSE` restart loops when the gateway manages account lifecycle. (#27897)
 - Microsoft Teams/File uploads: acknowledge `fileConsent/invoke` immediately (`invokeResponse` before upload + file card send) so Teams no longer shows false "Something went wrong" timeout banners while upload completion continues asynchronously; includes updated async regression coverage. Landed from contributor PR #27641 by @scz2011.
 - Queue/Drain/Cron reliability: harden lane draining with guaranteed `draining` flag reset on synchronous pump failures, reject new queue enqueues during gateway restart drain windows (instead of silently killing accepted tasks), add `/stop` queued-backlog cutoff metadata with stale-message skipping (while avoiding cross-session native-stop cutoff bleed), and raise isolated cron `agentTurn` outer safety timeout to avoid false 10-minute timeout races against longer agent session timeouts. (#27407, #27332, #27427)
 - Typing/Main reply pipeline: always mark dispatch idle in `agent-runner` finalization so typing cleanup runs even when dispatcher `onIdle` does not fire, preventing stuck typing indicators after run completion. (#27250) Thanks @Sid-Qin.
@@ -2940,7 +3022,7 @@ Docs: https://docs.openclaw.ai
 - Agents/Canvas default node resolution: when multiple connected canvas-capable nodes exist and no single `mac-*` candidate is selected, default to the first connected candidate instead of failing with `node required` for implicit-node canvas tool calls. Landed from contributor PR #27444. Thanks @carbaj03.
 - TUI/stream assembly: preserve streamed text across real tool-boundary drops without keeping stale streamed text when non-text blocks appear only in the final payload. Landed from contributor PR #27711 by @scz2011. (#27674)
 - Hooks/Internal `message:sent`: forward `sessionKey` on outbound sends from agent delivery, cron isolated delivery, gateway receipt acks, heartbeat sends, session-maintenance warnings, and restart-sentinel recovery so internal `message:sent` hooks consistently dispatch with session context, including `openclaw agent --deliver` runs resumed via `--session-id` (without explicit `--session-key`). Landed from contributor PR #27584. Thanks @qualiobra.
- Pi image-token usage: stop re-injecting history image blocks each turn, process image references from the current prompt only, and prune already-answered user-image blocks in stored history to prevent runaway token growth. (#27602) Thanks @steipete.
+- Pi image-token usage: stop re-injecting history image blocks each turn, process image references from the current prompt only, and prune already-answered user-image blocks in stored history to prevent runaway token growth. (#27602)
 - BlueBubbles/SSRF: auto-allowlist the configured `serverUrl` hostname for attachment fetches so localhost/private-IP BlueBubbles setups are no longer false-blocked by default SSRF checks. Landed from contributor PR #27648 by @lailoo. (#27599) Thanks @taylorhou for reporting.
 - Agents/Compaction + onboarding safety: prevent destructive double-compaction by stripping stale assistant usage around compaction boundaries, skipping post-compaction custom metadata writes in the same attempt, and cancelling safeguard compaction when there are no real conversation messages to summarize; harden workspace/bootstrap detection for memory-backed workspaces; and change `openclaw onboard --reset` default scope to `config+creds+sessions` (workspace deletion now requires `--reset-scope full`). (#26458, #27314) Thanks @jaden-clovervnd, @Sid-Qin, and @widingmarcus-cyber for fix direction in #26502, #26529, and #27492.
 - NO_REPLY suppression: suppress `NO_REPLY` before Slack API send and in sub-agent announce completion flow so sentinel text no longer leaks into user channels. Landed from contributor PRs #27529 (by @Sid-Qin) and #27535 (rewritten minimal landing by maintainers). (#27387, #27531)
@@ -2962,7 +3044,7 @@ Docs: https://docs.openclaw.ai
 - LINE/Inline directives auth: gate directive parsing (`/model`, `/think`, `/verbose`, `/reasoning`, `/queue`) on resolved authorization (`command.isAuthorizedSender`) so `commands.allowFrom`-authorized LINE senders are not silently stripped when raw `CommandAuthorized` is unset. Landed from contributor PR #27248 by @kevinWangSheng. (#27240)
 - Onboarding/Gateway: seed default Control UI `allowedOrigins` for non-loopback binds during onboarding (`localhost`/`127.0.0.1` plus custom bind host) so fresh non-loopback setups do not fail startup due to missing origin policy. (#26157) thanks @stakeswky.
 - Docker/GCP onboarding: reduce first-build OOM risk by capping Node heap during `pnpm install`, reuse existing gateway token during `docker-setup.sh` reruns so `.env` stays aligned with config, auto-bootstrap Control UI allowed origins for non-loopback Docker binds, and add GCP docs guidance for tokenized dashboard links + pairing recovery commands. (#26253) Thanks @pandego.
- CLI/Gateway `--force` in non-root Docker: recover from `lsof` permission failures (`EACCES`/`EPERM`) by falling back to `fuser` kill + probe-based port checks, so `openclaw gateway --force` works for default container `node` user flows. (#27941) Thanks @steipete.
+- CLI/Gateway `--force` in non-root Docker: recover from `lsof` permission failures (`EACCES`/`EPERM`) by falling back to `fuser` kill + probe-based port checks, so `openclaw gateway --force` works for default container `node` user flows. (#27941)
 - Gateway/Bind visibility: emit a startup warning when binding to non-loopback addresses so operators get explicit exposure guidance in runtime logs. (#25397) thanks @let5sne.
 - Sessions cleanup/Doctor: add `openclaw sessions cleanup --fix-missing` to prune store entries whose transcript files are missing, including doctor guidance and CLI coverage. Landed from contributor PR #27508 by @Sid-Qin. (#27422)
 - Doctor/State integrity: ignore metadata-only slash routing sessions when checking recent missing transcripts so `openclaw doctor` no longer reports false-positive transcript-missing warnings for `*:slash:*` keys. (#27375) thanks @gumadeiras.
@@ -3024,24 +3106,24 @@ Docs: https://docs.openclaw.ai
 - Slack/Threading: stop forcing tool-call reply mode to `all` based on `ThreadLabel` alone; now force thread reply mode only when an explicit thread target exists (`MessageThreadId`/`ReplyToId`), so DM `replyToModeByChatType.direct` overrides are honored outside real thread replies. (#26251) Thanks @dbachelder.
 - Slack/Threading: when `replyToMode="all"` auto-threads top-level Slack DMs, seed the thread session key from the message `ts` so the initial message and later replies share the same isolated `:thread:` session instead of falling back to base DM context. (#26849) Thanks @calder-sandy.
 - Agents/Subagents delivery: refactor subagent completion announce dispatch into an explicit queue/direct/fallback state machine, recover outbound channel-plugin resolution in cold/stale plugin-registry states across announce/message/gateway send paths, finalize cleanup bookkeeping when announce flow rejects, and treat Telegram sends without `message_id` as delivery failures (instead of false-success `"unknown"` IDs). (#26867, #25961, #26803, #25069, #26741) Thanks @SmithLabsLLC and @docaohieu2808.
- Telegram/Webhook: pre-initialize webhook bots, switch webhook processing to callback-mode JSON handling, and preserve full near-limit payload reads under delayed handlers to prevent webhook request hangs and dropped updates. (#26156) Thanks @steipete.
+- Telegram/Webhook: pre-initialize webhook bots, switch webhook processing to callback-mode JSON handling, and preserve full near-limit payload reads under delayed handlers to prevent webhook request hangs and dropped updates. (#26156)
 - Slack/Session threads: prevent oversized parent-session inheritance from silently bricking new thread sessions, surface embedded context-overflow empty-result failures to users, and add configurable `session.parentForkMaxTokens` (default `100000`, `0` disables). (#26912) Thanks @markshields-tl.
 - Cron/Message multi-account routing: honor explicit `delivery.accountId` for isolated cron delivery resolution, and when `message.send` omits `accountId`, fall back to the sending agent's bound channel account instead of defaulting to the global account. (#27015, #26975) Thanks @lbo728 and @stakeswky.
 - Gateway/Message media roots: thread `agentId` through gateway `send` RPC and prefer explicit `agentId` over session/default resolution so non-default agent workspace media sends no longer fail with `LocalMediaAccessError`; added regression coverage for agent precedence and blank-agent fallback. (#23249) Thanks @Sid-Qin.
 - Followups/Routing: when explicit origin routing fails, allow same-channel fallback dispatch (while still blocking cross-channel fallback) so followup replies do not get dropped on transient origin-adapter failures. (#26109) Thanks @Sid-Qin.
- Cron/Announce duplicate guard: track attempted announce/direct delivery separately from confirmed `delivered`, and suppress fallback main-session cron summaries when delivery was already attempted to avoid duplicate end-user sends in uncertain-ack paths. (#27018) Thanks @steipete.
+- Cron/Announce duplicate guard: track attempted announce/direct delivery separately from confirmed `delivered`, and suppress fallback main-session cron summaries when delivery was already attempted to avoid duplicate end-user sends in uncertain-ack paths. (#27018)
 - LINE/Lifecycle: keep LINE `startAccount` pending until abort so webhook startup is no longer misread as immediate channel exit, preventing restart-loop storms on LINE provider boot. (#26528) Thanks @Sid-Qin.
 - Discord/Gateway: capture and drain startup-time gateway `error` events before lifecycle listeners attach so early `Fatal Gateway error: 4014` closes surface as actionable intent guidance instead of uncaught gateway crashes. (#23832) Thanks @theotarr.
 - Discord/Inbound text: preserve embed `title` + `description` fallback text in message and forwarded snapshot parsing so embed titles are not silently dropped from agent input. (#26946) Thanks @stakeswky.
 - Slack/Inbound media fallback: deliver file-only messages even when Slack media downloads fail by adding a filename placeholder fallback, capping fallback names to the shared media-file limit, and normalizing empty filenames to `file` so attachment-only messages are not silently dropped. (#25181) Thanks @justinhuangcode.
- Telegram/Preview cleanup: keep finalized text previews when a later assistant message is media-only (for example mixed text plus voice turns) by skipping finalized preview archival at assistant-message boundaries, preventing cleanup from deleting already-visible final text messages. (#27042) Thanks @steipete.
+- Telegram/Preview cleanup: keep finalized text previews when a later assistant message is media-only (for example mixed text plus voice turns) by skipping finalized preview archival at assistant-message boundaries, preventing cleanup from deleting already-visible final text messages. (#27042)
 - Telegram/Markdown spoilers: keep valid `||spoiler||` pairs while leaving unmatched trailing `||` delimiters as literal text, avoiding false all-or-nothing spoiler suppression. (#26105) Thanks @Sid-Qin.
 - Slack/Allowlist channels: match channel IDs case-insensitively during channel allowlist resolution so lowercase config keys (for example `c0abc12345`) correctly match Slack runtime IDs (`C0ABC12345`) under `groupPolicy: "allowlist"`, preventing silent channel-event drops. (#26878) Thanks @lbo728.
 - Discord/Typing indicator: prevent stuck typing indicators by sealing channel typing keepalive callbacks after idle/cleanup and ensuring Discord dispatch always marks typing idle even if preview-stream cleanup fails. (#26295) Thanks @ngutman.
 - Channels/Typing indicator: guard typing keepalive start callbacks after idle/cleanup close so post-close ticks cannot re-trigger stale typing indicators. (#26325) Thanks @win4r.
 - Followups/Typing indicator: ensure followup turns mark dispatch idle on every exit path (including `NO_REPLY`, empty payloads, and agent errors) so typing keepalive cleanup always runs and channel typing indicators do not get stuck after queued/silent followups. (#26881) Thanks @codexGW.
- Voice-call/TTS tools: hide the `tts` tool when the message provider is `voice`, preventing voice-call runs from selecting self-playback TTS and falling into silent no-output loops. (#27025) Thanks @steipete.
- Agents/Tools: normalize non-standard plugin tool results that omit `content` so embedded runs no longer crash with `Cannot read properties of undefined (reading 'filter')` after tool completion (including `tesseramemo_query`). (#27007) Thanks @steipete.
+- Voice-call/TTS tools: hide the `tts` tool when the message provider is `voice`, preventing voice-call runs from selecting self-playback TTS and falling into silent no-output loops. (#27025)
+- Agents/Tools: normalize non-standard plugin tool results that omit `content` so embedded runs no longer crash with `Cannot read properties of undefined (reading 'filter')` after tool completion (including `tesseramemo_query`). (#27007)
 - Agents/Tool-call dispatch: trim whitespace-padded tool names in both transcript repair and live streamed embedded-runner responses so exact-match tool lookup no longer fails with `Tool ... not found` for model outputs like `" read "`. (#27094) Thanks @openperf and @Sid-Qin.
 - Cron/Model overrides: when isolated `payload.model` is no longer allowlisted, fall back to default model selection instead of failing the job, while still returning explicit errors for invalid model strings. (#26717) Thanks @Youyou972.
 - Agents/Model fallback: keep explicit text + image fallback chains reachable even when `agents.defaults.models` allowlists are present, prefer explicit run `agentId` over session-key parsing for followup fallback override resolution (with session-key fallback), treat agent-level fallback overrides as configured in embedded runner preflight, and classify `model_cooldown` / `cooling down` errors as `rate_limit` so failover continues. (#11972, #24137, #17231)
@@ -3087,7 +3169,7 @@ Docs: https://docs.openclaw.ai

 ### Changes

- Auto-reply/Abort shortcuts: expand standalone stop phrases (`stop openclaw`, `stop action`, `stop run`, `stop agent`, `please stop`, and related variants), accept trailing punctuation (for example `STOP OPENCLAW!!!`), add multilingual stop keywords (including ES/FR/ZH/HI/AR/JP/DE/PT/RU forms), and treat exact `do not do that` as a stop trigger while preserving strict standalone matching. (#25103) Thanks @steipete and @vincentkoc.
+- Auto-reply/Abort shortcuts: expand standalone stop phrases (`stop openclaw`, `stop action`, `stop run`, `stop agent`, `please stop`, and related variants), accept trailing punctuation (for example `STOP OPENCLAW!!!`), add multilingual stop keywords (including ES/FR/ZH/HI/AR/JP/DE/PT/RU forms), and treat exact `do not do that` as a stop trigger while preserving strict standalone matching. (#25103) Thanks @vincentkoc.
 - Android/App UX: ship a native four-step onboarding flow, move post-onboarding into a five-tab shell (Connect, Chat, Voice, Screen, Settings), add a full Connect setup/manual mode screen, and refresh Android chat/settings surfaces for the new navigation model.
 - Talk/Gateway config: add provider-agnostic Talk configuration with legacy compatibility, and expose gateway Talk ElevenLabs config metadata for setup/status surfaces.
 - Security/Audit: add `security.trust_model.multi_user_heuristic` to flag likely shared-user ingress and clarify the personal-assistant trust model, with hardening guidance for intentional multi-user setups (`sandbox.mode="all"`, workspace-scoped FS, reduced tool surface, no personal/private identities on shared runtimes).
@@ -3097,7 +3179,7 @@ Docs: https://docs.openclaw.ai

 - Routing/Session isolation: harden followup routing so explicit cross-channel origin replies never fall back to the active dispatcher on route failure, preserve queued overflow summary routing metadata (`channel`/`to`/`thread`) across followup drain, and prefer originating channel context over internal provider tags for embedded followup runs. This prevents webchat/control-ui context from hijacking Discord-targeted replies in shared sessions. (#25864) Thanks @Gamedesigner.
 - Security/Routing: fail closed for shared-session cross-channel replies by binding outbound target resolution to the current turn's source channel metadata (instead of stale session route fallbacks), and wire those turn-source fields through gateway + command delivery planners with regression coverage. (#24571) Thanks @brandonwise.
- Heartbeat routing: prevent heartbeat leakage/spam into Discord and other direct-message destinations by blocking direct-chat heartbeat delivery targets and keeping blocked-delivery cron/exec prompts internal-only. (#25871) Thanks @steipete.
+- Heartbeat routing: prevent heartbeat leakage/spam into Discord and other direct-message destinations by blocking direct-chat heartbeat delivery targets and keeping blocked-delivery cron/exec prompts internal-only. (#25871)
 - Heartbeat defaults/prompts: switch the implicit heartbeat delivery target from `last` to `none` (opt-in for external delivery), and use internal-only cron/exec heartbeat prompt wording when delivery is disabled so background checks do not nudge user-facing relay behavior. (#25871, #24638, #25851)
 - Auto-reply/Heartbeat queueing: drop heartbeat runs when a session already has an active run instead of enqueueing a stale followup, preventing duplicate heartbeat response branches after queue drain. (#25610, #25606) Thanks @mcaxtr.
 - Cron/Heartbeat delivery: stop inheriting cached session `lastThreadId` for heartbeat-mode target resolution unless a thread/topic is explicitly requested, so announce-mode cron and heartbeat deliveries stay on top-level destinations instead of leaking into active conversation threads. (#25730) Thanks @markshields-tl.
@@ -3130,7 +3212,7 @@ Docs: https://docs.openclaw.ai
 - Windows/Media safety checks: align async local-file identity validation with sync-safe-open behavior by treating win32 `dev=0` stats as unknown-device fallbacks (while keeping strict dev checks when both sides are non-zero), fixing false `Local media path is not safe to read` drops for local attachments/TTS/images. (#25708, #21989, #25699, #25878) Thanks @kevinWangSheng.
 - iMessage/Reasoning safety: harden iMessage echo suppression with outbound `messageId` matching (plus scoped text fallback), and enforce reasoning-payload suppression on routed outbound delivery paths to prevent hidden thinking text from being sent as user-visible channel messages. (#25897, #1649, #25757) Thanks @rmarr and @Iranb.
 - Providers/OpenRouter/Auth profiles: bypass auth-profile cooldown/disable windows for OpenRouter, so provider failures no longer put OpenRouter profiles into local cooldown and stale legacy cooldown markers are ignored in fallback and status selection paths. (#25892) Thanks @alexanderatallah for raising this and @vincentkoc for the fix.
- Providers/Google reasoning: sanitize invalid negative `thinkingBudget` payloads for Gemini 3.1 requests by dropping `-1` budgets and mapping configured reasoning effort to `thinkingLevel`, preventing malformed reasoning payloads on `google-generative-ai`. (#25900) Thanks @steipete.
+- Providers/Google reasoning: sanitize invalid negative `thinkingBudget` payloads for Gemini 3.1 requests by dropping `-1` budgets and mapping configured reasoning effort to `thinkingLevel`, preventing malformed reasoning payloads on `google-generative-ai`. (#25900)
 - Providers/SiliconFlow: normalize `thinking="off"` to `thinking: null` for `Pro/*` model payloads to avoid provider-side 400 loops and misleading compaction retries. (#25435) Thanks @Zjianru.
 - Models/Bedrock auth: normalize additional Bedrock provider aliases (`bedrock`, `aws-bedrock`, `aws_bedrock`, `amazon bedrock`) to canonical `amazon-bedrock`, ensuring auth-mode resolution consistently selects AWS SDK fallback. (#25756) Thanks @fwhite13.
 - Models/Providers: preserve explicit user `reasoning` overrides when merging provider model config with built-in catalog metadata, so `reasoning: false` is no longer overwritten by catalog defaults. (#25314) Thanks @lbo728.
@@ -3226,7 +3308,7 @@ Docs: https://docs.openclaw.ai
 - Providers/Groq: avoid classifying Groq TPM limit errors as context overflow so throttling paths no longer trigger overflow recovery logic. (#16176) Thanks @dddabtc.
 - Gateway/Restart: treat child listener PIDs as owned by the service runtime PID during restart health checks to avoid false stale-process kills and restart timeouts on launchd/systemd. (#24696) Thanks @gumadeiras.
 - Config/Write: apply `unsetPaths` with immutable path-copy updates so config writes never mutate caller-provided objects, and harden `openclaw config get/set/unset` path traversal by rejecting prototype-key segments and inherited-property traversal. (#24134) thanks @frankekn.
- Channels/WhatsApp: accept `channels.whatsapp.enabled` in config validation to match built-in channel auto-enable behavior, preventing `Unrecognized key: "enabled"` failures during channel setup. (#24263) Thanks @steipete.
+- Channels/WhatsApp: accept `channels.whatsapp.enabled` in config validation to match built-in channel auto-enable behavior, preventing `Unrecognized key: "enabled"` failures during channel setup. (#24263)
 - Security/Exec: detect obfuscated commands before exec allowlist decisions and require explicit approval for obfuscation patterns. (#8592) Thanks @CornBrother0x and @vincentkoc.
 - Security/ACP: harden ACP client permission auto-approval to require trusted core tool IDs, ignore untrusted `toolCall.kind` hints, and scope `read` auto-approval to the active working directory so unknown tool names and out-of-scope file reads always prompt. Thanks @nedlir for reporting.
 - Security/Skills: escape user-controlled prompt, filename, and output-path values in `openai-image-gen` HTML gallery generation to prevent stored XSS in generated `index.html` output. (#12538) Thanks @CornBrother0x.
@@ -3251,7 +3333,7 @@ Docs: https://docs.openclaw.ai
 - Update/Core: add an optional built-in auto-updater for package installs (`update.auto.*`), default-off, with stable rollout delay+jitter and beta hourly cadence.
 - CLI/Update: add `openclaw update --dry-run` to preview channel/tag/target/restart actions without mutating config, installing, syncing plugins, or restarting.
 - Config/UI: add tag-aware settings filtering and broaden config labels/help copy so fields are easier to discover and understand in the dashboard config screen.
- Channels/Synology Chat: add a native Synology Chat channel plugin with webhook ingress, direct-message routing, outbound send/media support, per-account config, and DM policy controls. (#23012) Thanks @steipete.
+- Channels/Synology Chat: add a native Synology Chat channel plugin with webhook ingress, direct-message routing, outbound send/media support, per-account config, and DM policy controls. (#23012)
 - iOS/Talk: prefetch TTS segments and suppress expected speech-cancellation errors for smoother talk playback. (#22833) Thanks @ngutman.
 - Memory/FTS: add Spanish and Portuguese stop-word filtering for query expansion in FTS-only search mode, improving conversational recall for both languages. Thanks @vincentkoc.
 - Memory/FTS: add Japanese-aware query expansion tokenization and stop-word filtering (including mixed-script terms like ASCII + katakana) for FTS-only search mode. Thanks @vincentkoc.
@@ -3273,10 +3355,10 @@ Docs: https://docs.openclaw.ai
 - Agents/Moonshot: force `supportsDeveloperRole=false` for Moonshot-compatible `openai-completions` models (provider `moonshot` and Moonshot base URLs), so initial runs no longer send unsupported `developer` roles that trigger `ROLE_UNSPECIFIED` errors. (#21060, #22194) Thanks @ShengFuC.
 - Agents/Kimi: classify Moonshot `Your request exceeded model token limit` failures as context overflows so auto-compaction and user-facing overflow recovery trigger correctly instead of surfacing raw invalid-request errors. (#9562) Thanks @danilofalcao.
 - Providers/Moonshot: mark Kimi K2.5 as image-capable in implicit + onboarding model definitions, and refresh stale explicit provider capability fields (`input`/`reasoning`/context limits) from implicit catalogs so existing configs pick up Moonshot vision support without manual model rewrites. (#13135, #4459) Thanks @manikv12.
- Agents/Transcript: enable consecutive-user turn merging for strict non-OpenAI `openai-completions` providers (for example Moonshot/Kimi), reducing `roles must alternate` ordering failures on OpenAI-compatible endpoints while preserving current OpenRouter/Opencode behavior. (#7693) Thanks @steipete.
+- Agents/Transcript: enable consecutive-user turn merging for strict non-OpenAI `openai-completions` providers (for example Moonshot/Kimi), reducing `roles must alternate` ordering failures on OpenAI-compatible endpoints while preserving current OpenRouter/Opencode behavior. (#7693)
 - Install/Discord Voice: make the native Opus decoder optional so `openclaw` install/update no longer hard-fails when native builds fail, while keeping `opusscript` as the runtime fallback decoder for Discord voice flows. (#23737, #23733, #23703) Thanks @jeadland, @Sheetaa, and @Breakyman.
 - Docker/Setup: precreate `$OPENCLAW_CONFIG_DIR/identity` during `docker-setup.sh` so CLI commands that need device identity (for example `devices list`) avoid `EACCES ... /home/node/.openclaw/identity` failures on restrictive bind mounts. (#23948) Thanks @ackson-beep.
- Exec/Background: stop applying the default exec timeout to background sessions (`background: true` or explicit `yieldMs`) when no explicit timeout is set, so long-running background jobs are no longer terminated at the default timeout boundary. (#23303) Thanks @steipete.
+- Exec/Background: stop applying the default exec timeout to background sessions (`background: true` or explicit `yieldMs`) when no explicit timeout is set, so long-running background jobs are no longer terminated at the default timeout boundary. (#23303)
 - Slack/Threading: sessions: keep parent-session forking and thread-history context active beyond first turn by removing first-turn-only gates in session init, thread-history fetch, and reply prompt context injection. (#23843, #23090) Thanks @vincentkoc and @Taskle.
 - Slack/Threading: respect `replyToMode` when Slack auto-populates top-level `thread_ts`, and ignore inline `replyToId` directive tags when `replyToMode` is `off` so thread forcing stays disabled unless explicitly configured. (#23839, #23320, #23513) Thanks @vincentkoc and @dorukardahan.
 - Slack/Extension: forward `message read` `threadId` to `readMessages` and use delivery-context `threadId` as outbound `thread_ts` fallback so extension replies/reads stay in the correct Slack thread. (#22216, #22485, #23836) Thanks @vincentkoc, @lan17 and @dorukardahan.
@@ -3296,7 +3378,7 @@ Docs: https://docs.openclaw.ai
 - Telegram/Webhook: add `channels.telegram.webhookPort` config support and pass it through plugin startup wiring to the monitor listener.
 - Browser/Extension Relay: refactor the MV3 worker to preserve debugger attachments across relay drops, auto-reconnect with bounded backoff+jitter, persist and rehydrate attached tab state via `chrome.storage.session`, recover from `target_closed` navigation detaches, guard stale socket handlers, enforce per-tab operation locks and per-request timeouts, and add lifecycle keepalive/badge refresh hooks (`alarms`, `webNavigation`). (#15099, #6175, #8468, #9807)
 - Browser/Relay: treat extension websocket as connected only when `OPEN`, allow reconnect when a stale `CLOSING/CLOSED` extension socket lingers, and guard stale socket message/close handlers so late events cannot clear active relay state; includes regression coverage for live-duplicate `409` rejection and immediate reconnect-after-close races. (#15099, #18698, #20688)
- Browser/Remote CDP: extend stale-target recovery so `ensureTabAvailable()` now reuses the sole available tab for remote CDP profiles (same behavior as extension profiles) while preserving strict `tab not found` errors when multiple tabs exist; includes remote-profile regression tests. (#15989) Thanks @steipete.
+- Browser/Remote CDP: extend stale-target recovery so `ensureTabAvailable()` now reuses the sole available tab for remote CDP profiles (same behavior as extension profiles) while preserving strict `tab not found` errors when multiple tabs exist; includes remote-profile regression tests. (#15989)
 - Gateway/Pairing: treat `operator.admin` as satisfying other `operator.*` scope checks during device-auth verification so local CLI/TUI sessions stop entering pairing-required loops for pairing/approval-scoped commands. (#22062, #22193, #21191) Thanks @Botaccess, @jhartshorn, and @ctbritt.
 - Gateway/Pairing: auto-approve loopback `scope-upgrade` pairing requests (including device-token reconnects) so local clients do not disconnect on pairing-required scope elevation. (#23708) Thanks @widingmarcus-cyber.
 - Gateway/Scopes: include `operator.read` and `operator.write` in default operator connect scope bundles across CLI, Control UI, and macOS clients so write-scoped announce/sub-agent follow-up calls no longer hit `pairing required` disconnects on loopback gateways. (#22582) thanks @YuzuruS.
@@ -3334,25 +3416,25 @@ Docs: https://docs.openclaw.ai
 - Security/Group policy: harden `channels.*.groups.*.toolsBySender` matching by requiring explicit sender-key types (`id:`, `e164:`, `username:`, `name:`), preventing cross-identifier collisions across mutable/display-name fields while keeping legacy untyped keys on a deprecated ID-only path. Thanks @jiseoung for reporting.
 - Channels/Group policy: fail closed when `groupPolicy: "allowlist"` is set without explicit `groups`, honor account-level `groupPolicy` overrides, and enforce `groupPolicy: "disabled"` as a hard group block. (#22215) Thanks @etereo.
 - Telegram/Discord extensions: propagate trusted `mediaLocalRoots` through extension outbound `sendMedia` options so extension direct-send media paths honor agent-scoped local-media allowlists. (#20029, #21903, #23227)
- Agents/Exec: honor explicit agent context when resolving `tools.exec` defaults for runs with opaque/non-agent session keys, so per-agent `host/security/ask` policies are applied consistently. (#11832) Thanks @steipete.
+- Agents/Exec: honor explicit agent context when resolving `tools.exec` defaults for runs with opaque/non-agent session keys, so per-agent `host/security/ask` policies are applied consistently. (#11832)
 - CLI/Sessions: resolve implicit session-store path templates with the configured default agent ID so named-agent setups do not silently read/write stale `agent:main` session/auth stores. (#22685) Thanks @sene1337.
- Doctor/Security: add an explicit warning that `approvals.exec.enabled=false` disables forwarding only, while enforcement remains driven by host-local `exec-approvals.json` policy. (#15047) Thanks @steipete.
- Sandbox/Docker: default sandbox container user to the workspace owner `uid:gid` when `agents.*.sandbox.docker.user` is unset, fixing non-root gateway file-tool permissions under capability-dropped containers. (#20979) Thanks @steipete.
+- Doctor/Security: add an explicit warning that `approvals.exec.enabled=false` disables forwarding only, while enforcement remains driven by host-local `exec-approvals.json` policy. (#15047)
+- Sandbox/Docker: default sandbox container user to the workspace owner `uid:gid` when `agents.*.sandbox.docker.user` is unset, fixing non-root gateway file-tool permissions under capability-dropped containers. (#20979)
 - Plugins/Media sandbox: propagate trusted `mediaLocalRoots` through plugin action dispatch (including Discord/Telegram action adapters) so plugin send paths enforce the same agent-scoped local-media sandbox roots as core outbound sends. (#20258, #22718)
- Agents/Workspace guard: map sandbox container-workdir file-tool paths (for example `/workspace/...` and `file:///workspace/...`) to host workspace roots before workspace-only validation, preventing false `Path escapes sandbox root` rejections for sandbox file tools. (#9560) Thanks @steipete.
- Gateway/Exec approvals: expire approval requests immediately when no approval-capable gateway clients are connected and no forwarding targets are available, avoiding delayed approvals after restarts/offline approver windows. (#22144) Thanks @steipete.
+- Agents/Workspace guard: map sandbox container-workdir file-tool paths (for example `/workspace/...` and `file:///workspace/...`) to host workspace roots before workspace-only validation, preventing false `Path escapes sandbox root` rejections for sandbox file tools. (#9560)
+- Gateway/Exec approvals: expire approval requests immediately when no approval-capable gateway clients are connected and no forwarding targets are available, avoiding delayed approvals after restarts/offline approver windows. (#22144)
 - Security/Exec approvals: when approving wrapper commands with allow-always in allowlist mode, persist inner executable paths for known dispatch wrappers (`env`, `nice`, `nohup`, `stdbuf`, `timeout`) and fail closed (no persisted entry) when wrapper unwrapping is not safe, preventing wrapper-path approval bypasses. Thanks @tdjackey for reporting.
- Node/macOS exec host: default headless macOS node `system.run` to local execution and only route through the companion app when `OPENCLAW_NODE_EXEC_HOST=app` is explicitly set, avoiding companion-app filesystem namespace mismatches during exec. (#23547) Thanks @steipete.
+- Node/macOS exec host: default headless macOS node `system.run` to local execution and only route through the companion app when `OPENCLAW_NODE_EXEC_HOST=app` is explicitly set, avoiding companion-app filesystem namespace mismatches during exec. (#23547)
 - Sandbox/Media: map container workspace paths (`/workspace/...` and `file:///workspace/...`) back to the host sandbox root for outbound media validation, preventing false deny errors for sandbox-generated local media. (#23083) Thanks @echo931.
 - Sandbox/Docker: apply custom bind mounts after workspace mounts and prioritize bind-source resolution on overlapping paths, so explicit workspace binds are no longer ignored. (#22669) Thanks @tasaankaeris.
 - Exec approvals/Forwarding: restore Discord text forwarding when component approvals are not configured, and carry request snapshots through resolve events so resolved notices still forward after cache misses/restarts. (#22988) Thanks @bubmiller.
 - Control UI/WebSocket: stop and clear the browser gateway client on UI teardown so remounts cannot leave orphan websocket clients that create duplicate active connections. (#23422) Thanks @floatinggball-design.
 - Control UI/WebSocket: send a stable per-tab `instanceId` in websocket connect frames so reconnect cycles keep a consistent client identity for diagnostics and presence tracking. (#23616) Thanks @zq58855371-ui.
 - Config/Memory: allow `"mistral"` in `agents.defaults.memorySearch.provider` and `agents.defaults.memorySearch.fallback` schema validation. (#14934) Thanks @ThomsenDrake.
- Feishu/Commands: in group chats, command authorization now falls back to top-level `channels.feishu.allowFrom` when per-group `allowFrom` is not set, so `/command` no longer gets blocked by an unintended empty allowlist. (#23756) Thanks @steipete.
+- Feishu/Commands: in group chats, command authorization now falls back to top-level `channels.feishu.allowFrom` when per-group `allowFrom` is not set, so `/command` no longer gets blocked by an unintended empty allowlist. (#23756)
 - Dev tooling: prevent `CLAUDE.md` symlink target regressions by excluding CLAUDE symlink sentinels from `oxfmt` and marking them `-text` in `.gitattributes`, so formatter/EOL normalization cannot reintroduce trailing-newline targets. Thanks @vincentkoc.
 - Agents/Compaction: restore embedded compaction safeguard/context-pruning extension loading in production by wiring bundled extension factories into the resource loader instead of runtime file-path resolution. (#22349; landed from contributor PR #5005 by @Diaspar4u) Thanks @Diaspar4u.
- Feishu/Media: for inbound video messages that include both `file_key` (video) and `image_key` (thumbnail), prefer `file_key` when downloading media so video attachments are saved instead of silently failing on thumbnail keys. (#23633) Thanks @steipete.
+- Feishu/Media: for inbound video messages that include both `file_key` (video) and `image_key` (thumbnail), prefer `file_key` when downloading media so video attachments are saved instead of silently failing on thumbnail keys. (#23633)
 - Hooks/Loader: avoid redundant hook-module recompilation on gateway restart by skipping cache-busting for bundled hooks and using stable file metadata keys (`mtime+size`) for mutable workspace/managed/plugin hook imports. (#16953) Thanks @mudrii.
 - Hooks/Cron: suppress duplicate main-session events for delivered hook turns and mark `SILENT_REPLY_TOKEN` (`NO_REPLY`) early exits as delivered to prevent hook context pollution. (#20678) Thanks @JonathanWorks.
 - Providers/OpenRouter: inject `cache_control` on system prompts for OpenRouter Anthropic models to improve prompt-cache reuse. (#17473) Thanks @rrenamed.
@@ -3831,7 +3913,7 @@ Docs: https://docs.openclaw.ai
 - Ollama/Qwen: handle Qwen 3 reasoning field format in Ollama responses. (#18631) Thanks @mr-sk.
 - OpenAI/Transcripts: always drop orphaned reasoning blocks from transcript repair. (#18632) Thanks @TySabs.
 - Fix types in all tests. Typecheck the whole repository.
- Gateway/Channels: wire `gateway.channelHealthCheckMinutes` into strict config validation, treat implicit account status as managed for health checks, and harden channel auto-restart flow (preserve restart-attempt caps across crash loops, propagate enabled/configured runtime flags, and stop pending restart backoff after manual stop). Thanks @steipete.
+- Gateway/Channels: wire `gateway.channelHealthCheckMinutes` into strict config validation, treat implicit account status as managed for health checks, and harden channel auto-restart flow (preserve restart-attempt caps across crash loops, propagate enabled/configured runtime flags, and stop pending restart backoff after manual stop).
 - Gateway/WebChat: hard-cap `chat.history` oversized payloads by truncating high-cost fields and replacing over-budget entries with placeholders, so history fetches stay within configured byte limits and avoid chat UI freezes. (#18505)
 - UI/Usage: replace lingering undefined `var(--text-muted)` usage with `var(--muted)` in usage date-range and chart styles to keep muted text visible across themes. (#17975) Thanks @jogelin.
 - UI/Usage: preserve selected-range totals when timeline data is downsampled by bucket-aggregating timeseries points (instead of dropping intermediate points), so filtered tokens/cost stay accurate. (#17959) Thanks @jogelin.
@@ -4841,21 +4923,21 @@ Docs: https://docs.openclaw.ai

 - Providers: Ollama discovery + docs; Venice guide upgrades + cross-links. (#1606) Thanks @abhaymundhara. https://docs.openclaw.ai/providers/ollama https://docs.openclaw.ai/providers/venice
 - Channels: LINE plugin (Messaging API) with rich replies + quick replies. (#1630) Thanks @plum-dawg.
- TTS: Edge fallback (keyless) + `/tts` auto modes. (#1668, #1667) Thanks @steipete, @sebslight. https://docs.openclaw.ai/tts
+- TTS: Edge fallback (keyless) + `/tts` auto modes. (#1668, #1667) Thanks @sebslight. https://docs.openclaw.ai/tts
 - Exec approvals: approve in-chat via `/approve` across all channels (including plugins). (#1621) Thanks @czekaj. https://docs.openclaw.ai/tools/exec-approvals https://docs.openclaw.ai/tools/slash-commands
 - Telegram: DM topics as separate sessions + outbound link preview toggle. (#1597, #1700) Thanks @rohannagpal, @zerone0x. https://docs.openclaw.ai/channels/telegram

 ### Changes

 - Channels: add LINE plugin (Messaging API) with rich replies, quick replies, and plugin HTTP registry. (#1630) Thanks @plum-dawg.
- TTS: add Edge TTS provider fallback, defaulting to keyless Edge with MP3 retry on format failures. (#1668) Thanks @steipete. https://docs.openclaw.ai/tts
+- TTS: add Edge TTS provider fallback, defaulting to keyless Edge with MP3 retry on format failures. (#1668) https://docs.openclaw.ai/tts
 - TTS: add auto mode enum (off/always/inbound/tagged) with per-session `/tts` override. (#1667) Thanks @sebslight. https://docs.openclaw.ai/tts
 - Telegram: treat DM topics as separate sessions and keep DM history limits stable with thread suffixes. (#1597) Thanks @rohannagpal.
 - Telegram: add `channels.telegram.linkPreview` to toggle outbound link previews. (#1700) Thanks @zerone0x. https://docs.openclaw.ai/channels/telegram
 - Web search: add Brave freshness filter parameter for time-scoped results. (#1688) Thanks @JonUleis. https://docs.openclaw.ai/tools/web
 - UI: refresh Control UI dashboard design system (colors, icons, typography). (#1745, #1786) Thanks @EnzeD, @mousberg.
 - Exec approvals: forward approval prompts to chat with `/approve` for all channels (including plugins). (#1621) Thanks @czekaj. https://docs.openclaw.ai/tools/exec-approvals https://docs.openclaw.ai/tools/slash-commands
- Gateway: expose config.patch in the gateway tool with safe partial updates + restart sentinel. (#1653) Thanks @steipete.
+- Gateway: expose config.patch in the gateway tool with safe partial updates + restart sentinel. (#1653)
 - Diagnostics: add diagnostic flags for targeted debug logs (config + env override). https://docs.openclaw.ai/diagnostics/flags
 - Docs: expand FAQ (migration, scheduling, concurrency, model recommendations, OpenAI subscription auth, Pi sizing, hackable install, docs SSL workaround).
 - Docs: add verbose installer troubleshooting guidance.
@@ -4868,9 +4950,9 @@ Docs: https://docs.openclaw.ai

 - Web UI: fix config/debug layout overflow, scrolling, and code block sizing. (#1715) Thanks @saipreetham589.
 - Web UI: show Stop button during active runs, swap back to New session when idle. (#1664) Thanks @ndbroadbent.
- Web UI: clear stale disconnect banners on reconnect; allow form saves with unsupported schema paths but block missing schema. (#1707) Thanks @steipete.
+- Web UI: clear stale disconnect banners on reconnect; allow form saves with unsupported schema paths but block missing schema. (#1707)
 - Web UI: hide internal `message_id` hints in chat bubbles.
- Gateway: allow Control UI token-only auth to skip device pairing even when device identity is present (`gateway.controlUi.allowInsecureAuth`). (#1679) Thanks @steipete.
+- Gateway: allow Control UI token-only auth to skip device pairing even when device identity is present (`gateway.controlUi.allowInsecureAuth`). (#1679)
 - Matrix: decrypt E2EE media attachments with preflight size guard. (#1744) Thanks @araa47.
 - BlueBubbles: route phone-number targets to DMs, avoid leaking routing IDs, and auto-create missing DMs (Private API required). (#1751) Thanks @tyler6204. https://docs.openclaw.ai/channels/bluebubbles
 - BlueBubbles: keep part-index GUIDs in reply tags when short IDs are missing.
@@ -4943,7 +5025,7 @@ Docs: https://docs.openclaw.ai
 - Heartbeat: accept plugin channel ids for heartbeat target validation + UI hints.
 - Messaging/Sessions: mirror outbound sends into target session keys (threads + dmScope), create session entries on send, and normalize session key casing. (#1520, commit 4b6cdd1d3)
 - Sessions: reject array-backed session stores to prevent silent wipes. (#1469)
- Gateway: compare Linux process start time to avoid PID recycling lock loops; keep locks unless stale. (#1572) Thanks @steipete.
+- Gateway: compare Linux process start time to avoid PID recycling lock loops; keep locks unless stale. (#1572)
 - Gateway: accept null optional fields in exec approval requests. (#1511) Thanks @pvoo.
 - Exec approvals: persist allowlist entry ids to keep macOS allowlist rows stable. (#1521) Thanks @ngutman.
 - Exec: honor tools.exec ask/security defaults for elevated approvals (avoid unwanted prompts). (commit 5662a9cdf)
@@ -5249,7 +5331,7 @@ Docs: https://docs.openclaw.ai
 - macOS: bundle Textual resources in packaged app builds to avoid code block crashes. (#1006)
 - Daemon: include HOME in service environments to avoid missing HOME errors. (#1214)

-Thanks @AlexMikhalev, @CoreyH, @John-Rood, @KrauseFx, @MaudeBot, @Nachx639, @NicholaiVogel, @RyanLisse, @ThePickle31, @VACInc, @Whoaa512, @YuriNachos, @aaronveklabs, @abdaraxus, @alauppe, @ameno-, @artuskg, @austinm911, @bradleypriest, @cheeeee, @dougvk, @fogboots, @gnarco, @gumadeiras, @jdrhyne, @joelklabo, @longmaba, @mukhtharcm, @odysseus0, @oscargavin, @rhjoh, @sebslight, @sibbl, @sleontenko, @steipete, @suminhthanh, @thewilloftheshadow, @tyler6204, @vignesh07, @visionik, @ysqander, @zerone0x.
+Thanks @AlexMikhalev, @CoreyH, @John-Rood, @KrauseFx, @MaudeBot, @Nachx639, @NicholaiVogel, @RyanLisse, @ThePickle31, @VACInc, @Whoaa512, @YuriNachos, @aaronveklabs, @abdaraxus, @alauppe, @ameno-, @artuskg, @austinm911, @bradleypriest, @cheeeee, @dougvk, @fogboots, @gnarco, @gumadeiras, @jdrhyne, @joelklabo, @longmaba, @mukhtharcm, @odysseus0, @oscargavin, @rhjoh, @sebslight, @sibbl, @sleontenko, @suminhthanh, @thewilloftheshadow, @tyler6204, @vignesh07, @visionik, @ysqander, @zerone0x.

 ### Breaking

@@ -5618,7 +5700,7 @@ Thanks @AlexMikhalev, @CoreyH, @John-Rood, @KrauseFx, @MaudeBot, @Nachx639, @Nic
 - Gateway/CLI: honor `CLAWDBOT_LAUNCHD_LABEL` / `CLAWDBOT_SYSTEMD_UNIT` overrides; `agents.list` respects explicit config; reduce noisy loopback WS logs during tests; run `openclaw doctor --non-interactive` during updates. (#781) - thanks @ronyrus.
 - Onboarding/Control UI: refuse invalid configs (run doctor first); quote Windows browser URLs for OAuth; keep chat scroll position unless the user is near the bottom. (#764) - thanks @mukhtharcm; (#794) - thanks @roshanasingh4; (#217) - thanks @thewilloftheshadow.
 - Tools/UI: harden tool input schemas for strict providers; drop null-only union variants for Gemini schema cleanup; treat `maxChars: 0` as unlimited; keep TUI last streamed response instead of "(no output)". (#782) - thanks @AbhisekBasu1; (#796) - thanks @gabriel-trigo; (#747) - thanks @thewilloftheshadow.
- Connections UI: polish multi-account account cards. (#816) - thanks @steipete.
+- Connections UI: polish multi-account account cards. (#816)

 ### Installer

@@ -5658,7 +5740,7 @@ Thanks @AlexMikhalev, @CoreyH, @John-Rood, @KrauseFx, @MaudeBot, @Nachx639, @Nic
 - Tests: add Docker plugin loader + tgz-install smoke test.
 - Tests: extend Docker plugin E2E to cover installing from local folders (`plugins.load.paths`) and `file:` npm specs.
 - Tests: add coverage for pre-compaction memory flush settings.
- Tests: modernize live model smoke selection for current releases and enforce tools/images/thinking-high coverage. (#769) - thanks @steipete.
+- Tests: modernize live model smoke selection for current releases and enforce tools/images/thinking-high coverage. (#769)
 - Agents/Tools: add `apply_patch` tool for multi-file edits (experimental; gated by tools.exec.applyPatch; OpenAI-only).
 - Agents/Tools: rename the bash tool to exec (config alias maintained). (#748) - thanks @myfunc.
 - Agents: add pre-compaction memory flush config (`agents.defaults.compaction.*`) with a soft threshold + system prompt.
@@ -5678,8 +5760,8 @@ Thanks @AlexMikhalev, @CoreyH, @John-Rood, @KrauseFx, @MaudeBot, @Nachx639, @Nic
 ### Fixes

 - Models/Onboarding: configure MiniMax (minimax.io) via Anthropic-compatible `/anthropic` endpoint by default (keep `minimax-api` as a legacy alias).
- Models: normalize Gemini 3 Pro/Flash IDs to preview names for live model lookups. (#769) - thanks @steipete.
- CLI: fix guardCancel typing for configure prompts. (#769) - thanks @steipete.
+- Models: normalize Gemini 3 Pro/Flash IDs to preview names for live model lookups. (#769)
+- CLI: fix guardCancel typing for configure prompts. (#769)
 - Gateway/WebChat: include handshake validation details in the WebSocket close reason for easier debugging; preserve close codes.
 - Gateway/Auth: send invalid connect responses before closing the handshake; stabilize invalid-connect auth test.
 - Gateway: tighten gateway listener detection.
@@ -5696,7 +5778,7 @@ Thanks @AlexMikhalev, @CoreyH, @John-Rood, @KrauseFx, @MaudeBot, @Nachx639, @Nic
 - Auto-reply: align `/think` default display with model reasoning defaults. (#751) - thanks @gabriel-trigo.
 - Auto-reply: flush block reply buffers on tool boundaries. (#750) - thanks @sebslight.
 - Auto-reply: allow sender fallback for command authorization when `SenderId` is empty (WhatsApp self-chat). (#755) - thanks @juanpablodlc.
- Auto-reply: treat whitespace-only sender ids as missing for command authorization (WhatsApp self-chat). (#766) - thanks @steipete.
+- Auto-reply: treat whitespace-only sender ids as missing for command authorization (WhatsApp self-chat). (#766)
 - Heartbeat: refresh prompt text for updated defaults.
 - Memory/QMD: prefer `qmd collection add --glob` for current QMD releases and fall back to legacy `--mask` when older builds reject it. (#55123) Thanks @ForceConstant and @vincentkoc.
 - Agents/Tools: use PowerShell on Windows to capture system utility output. (#748) - thanks @myfunc.
--- a/CONTRIBUTING.md
+++ b/CONTRIBUTING.md
@@ -95,6 +95,7 @@ For coordinated change sets that genuinely need more than 10 PRs, join the **#cl

 - Test locally with your OpenClaw instance
 - Run tests: `pnpm build && pnpm check && pnpm test`
+- For iterative local commits, `scripts/committer --fast "message" <files...>` passes `FAST_COMMIT=1` through to the pre-commit hook so it skips the repo-wide `pnpm check`. Only use it when you've already run equivalent targeted validation for the touched surface.
 - For extension/plugin changes, run the fast local lane first:
  - `pnpm test:extension <extension-name>`
  - `pnpm test:extension --list` to see valid extension ids
--- a/appcast.xml
+++ b/appcast.xml
@@ -2,6 +2,193 @@
 <rss xmlns:sparkle="http://www.andymatuschak.org/xml-namespaces/sparkle" version="2.0">
    <channel>
        <title>OpenClaw</title>
+        <item>
+            <title>2026.4.12</title>
+            <pubDate>Sun, 12 Apr 2026 12:00:00 +0000</pubDate>
+            <link>https://raw.githubusercontent.com/openclaw/openclaw/main/appcast.xml</link>
+            <sparkle:version>2026041290</sparkle:version>
+            <sparkle:shortVersionString>2026.4.12</sparkle:shortVersionString>
+            <sparkle:minimumSystemVersion>15.0</sparkle:minimumSystemVersion>
+            <description><![CDATA[<h2>OpenClaw 2026.4.12</h2>
+<h3>Changes</h3>
+<ul>
+<li>Dreaming/memory-wiki: add ChatGPT import ingestion plus new <code>Imported Insights</code> and <code>Memory Palace</code> diary subtabs so Dreaming can inspect imported source chats, compiled wiki pages, and full source pages directly from the UI. (#64505)</li>
+<li>Control UI/webchat: render assistant media/reply/voice directives as structured chat bubbles, add the <code>[embed ...]</code> rich output tag, and gate external embed URLs behind config. (#64104)</li>
+<li>Tools/video_generate: add URL-only generated asset delivery, typed <code>providerOptions</code>, reference audio inputs, per-asset role hints, <code>adaptive</code> aspect-ratio support, and a higher image-input cap so video providers can expose richer generation modes without forcing large files into memory. (#61987, #61988) Thanks @xieyongliang.</li>
+<li>Feishu: improve document comment sessions with richer context parsing, comment reactions, and typing feedback so document-thread conversations behave more like chat conversations. (#63785)</li>
+<li>Microsoft Teams: add reaction support, reaction listing, Graph pagination, and delegated OAuth setup for sending reactions while preserving application-auth read paths. (#51646)</li>
+<li>Plugins: allow plugin manifests to declare activation and setup descriptors so plugin setup flows can describe required auth, pairing, and configuration steps without hardcoded core special cases. (#64780)</li>
+<li>Ollama: cache <code>/api/show</code> context-window and capability metadata during model discovery so repeated picker refreshes stop refetching unchanged models, while still retrying after empty responses and invalidating on digest changes. (#64753) Thanks @ImLukeF.</li>
+<li>Models/providers: surface how configured OpenAI-compatible endpoints are classified in embedded-agent debug logs, so local and proxy routing issues are easier to diagnose. (#64754) Thanks @ImLukeF.</li>
+<li>QA/parity: add the GPT-5.4 vs Opus 4.6 agentic parity report gate with shared scenario coverage checks, stricter evidence heuristics, and skipped-scenario accounting for maintainer review. (#64441) Thanks @100yenadmin.</li>
+</ul>
+<h3>Fixes</h3>
+<ul>
+<li>OpenAI/Codex OAuth: stop rewriting the upstream authorize URL scopes so new Codex sign-ins do not fail with <code>invalid_scope</code> before returning an authorization code. (#64713) Thanks @fuller-stack-dev.</li>
+<li>Audio transcription: disable pinned DNS only for OpenAI-compatible multipart requests, while still validating hostnames, so OpenAI, Groq, and Mistral transcription works again without weakening other request paths. (#64766) Thanks @GodsBoy.</li>
+<li>macOS/Talk Mode: after granting microphone permission on first enable, continue starting Talk Mode instead of requiring a second toggle. (#62459) Thanks @ggarber.</li>
+<li>Control UI/webchat: persist agent-run TTS audio replies into webchat history and preserve interleaved tool card pairing so generated audio and mixed tool output stay attached to the right messages. (#63514) Thanks @bittoby.</li>
+<li>WhatsApp: honor the configured default account when the active listener helper is used without an explicit account id, so named default accounts do not get registered under <code>default</code>. (#53918) Thanks @yhyatt.</li>
+<li>ACP/agents: suppress commentary-phase child assistant relay text in ACP parent stream updates, so spawned child runs stop leaking internal progress chatter into the parent session. Thanks @vincentkoc.</li>
+<li>Agents/timeouts: honor explicit run timeouts in the LLM idle watchdog and align default timeout config so slow models can keep working until the configured limit instead of using the wrong idle window.</li>
+<li>Config: include <code>asyncCompletion</code> in the generated zod schema so documented async completion config no longer fails with an unrecognized-key error. (#63618)</li>
+<li>Google/Veo: stop sending the unsupported <code>numberOfVideos</code> request field so Gemini Developer API Veo runs do not fail before OpenClaw can complete the intended Google video generation path. (#64723) Thanks @velvet-shark.</li>
+<li>QA/packaging: stop packaged CLI startup and completion cache generation from reading repo-only QA scenario markdown, ship the bundled QA scenario pack in npm releases, and keep <code>openclaw completion --write-state</code> working even if QA setup is broken. (#64648) Thanks @obviyus.</li>
+<li>Codex/QA: keep Codex app-server coordination chatter out of visible replies, add a live QA leak scenario, and classify leaked harness meta text as a QA failure instead of a successful reply. Thanks @vincentkoc.</li>
+<li>WhatsApp: route <code>message react</code> through the gateway-owned action path so reactions use the live WhatsApp listener in both DM and group chats, matching <code>message send</code> and <code>message poll</code>. Thanks @mcaxtr.</li>
+<li>Auto-reply/WhatsApp: preserve inbound image attachment notes after media understanding so image edits keep the real saved media path instead of hallucinating a missing local path. (#64918) Thanks @ngutman.</li>
+<li>Telegram/sessions: keep topic-scoped session initialization on the canonical topic transcript path when inbound turns omit <code>MessageThreadId</code>, so one topic session no longer alternates between bare and topic-qualified transcript files. (#64869) Thanks @jalehman.</li>
+<li>Agents/failover: scope assistant-side fallback classification and surfaced provider errors to the current attempt instead of stale session history, so cross-provider fallback runs stop inheriting the previous provider's failure. (#62907) Thanks @stainlu.</li>
+<li>MiniMax/OAuth: write <code>api: "anthropic-messages"</code> and <code>authHeader: true</code> into the <code>minimax-portal</code> config patch during <code>openclaw configure</code>, so re-authenticated portal setups keep Bearer auth routing working. (#64964) Thanks @ryanlee666.</li>
+</ul>
+<p><a href="https://github.com/openclaw/openclaw/blob/main/CHANGELOG.md">View full changelog</a></p>
+]]></description>
+            <enclosure url="https://github.com/openclaw/openclaw/releases/download/v2026.4.12/OpenClaw-2026.4.12.zip" length="47317969" type="application/octet-stream" sparkle:edSignature="v9bUsh1mBBPtpMn7kKYAvO8MNJHAeMj7UkmkkuDSC8NvwPx2Fo3+NEeyAyA9s9Vax6L7i+eHSpwzAmtwpnHcCA=="/>
+        </item>
+        <item>
+            <title>2026.4.10</title>
+            <pubDate>Sat, 11 Apr 2026 03:17:02 +0000</pubDate>
+            <link>https://raw.githubusercontent.com/openclaw/openclaw/main/appcast.xml</link>
+            <sparkle:version>2026041090</sparkle:version>
+            <sparkle:shortVersionString>2026.4.10</sparkle:shortVersionString>
+            <sparkle:minimumSystemVersion>15.0</sparkle:minimumSystemVersion>
+            <description><![CDATA[<h2>OpenClaw 2026.4.10</h2>
+<h3>Changes</h3>
+<ul>
+<li>Models/Codex: add the bundled Codex provider and plugin-owned app-server harness so <code>codex/gpt-*</code> models use Codex-managed auth, native threads, model discovery, and compaction while <code>openai/gpt-*</code> stays on the normal OpenAI provider path. (#64298)</li>
+<li>Memory/Active Memory: add a new optional Active Memory plugin that gives OpenClaw a dedicated memory sub-agent right before the main reply, so ongoing chats can automatically pull in relevant preferences, context, and past details without making users remember to manually say "remember this" or "search memory" first. Includes configurable message/recent/full context modes, live <code>/verbose</code> inspection, advanced prompt/thinking overrides for tuning, and opt-in transcript persistence for debugging. Docs: https://docs.openclaw.ai/concepts/active-memory. (#63286) Thanks @Takhoffman.</li>
+<li>macOS/Talk: add an experimental local MLX speech provider for Talk Mode, with explicit provider selection, local utterance playback, interruption handling, and system-voice fallback. (#63539) Thanks @ImLukeF.</li>
+<li>Tools/video generation: add Seedance 2.0 model refs to the bundled fal provider and submit the provider-specific duration, resolution, audio, and seed metadata fields needed for live Seedance 2.0 runs.</li>
+<li>Microsoft Teams: add message actions for pin, unpin, read, react, and listing reactions. (#53432) Thanks @sudie-codes.</li>
+<li>QA/Matrix: add a live <code>openclaw qa matrix</code> lane backed by a disposable Matrix homeserver, shared live-transport seams, and Matrix-specific transport coverage for threading, reactions, restart, and allowlist behavior. (#64489) Thanks @gumadeiras.</li>
+<li>QA/Telegram: add a live <code>openclaw qa telegram</code> lane for private-group bot-to-bot checks, harden its artifact handling, and preserve native Telegram command reply threading for QA verification. (#64303) Thanks @obviyus.</li>
+<li>QA/testing: add a <code>--runner multipass</code> lane for <code>openclaw qa suite</code> so repo-backed QA scenarios can run inside a disposable Linux VM and write back the usual report, summary, and VM logs. (#63426) Thanks @shakkernerd.</li>
+<li>CLI/exec policy: add a local <code>openclaw exec-policy</code> command with <code>show</code>, <code>preset</code>, and <code>set</code> subcommands for synchronizing requested <code>tools.exec.*</code> config with the local exec approvals file, plus follow-up hardening for node-host rejection, rollback safety, and sync conflict detection. (#64050)</li>
+<li>Gateway: add a <code>commands.list</code> RPC so remote gateway clients can discover runtime-native, text, skill, and plugin commands with surface-aware naming and serialized argument metadata. (#62656) Thanks @samzong.</li>
+<li>Models/providers: add per-provider <code>models.providers.*.request.allowPrivateNetwork</code> for trusted self-hosted OpenAI-compatible endpoints, keep the opt-in scoped to model request surfaces, and refresh cached WebSocket managers when request transport overrides change. (#63671) Thanks @qas.</li>
+<li>Feishu: standardize request user agents and register the bot as an AI agent so Feishu deployments identify OpenClaw consistently. (#63835) Thanks @evandance.</li>
+<li>Matrix/partial streaming: add MSC4357 live markers to draft preview sends and edits so supporting Matrix clients can render a live/typewriter animation and stop it when the final edit lands. (#63513) Thanks @TigerInYourDream.</li>
+<li>Control UI/dreaming: simplify the Scene and Diary surfaces, preserve unknown phase state for partial status payloads, and stabilize waiting-entry recency ordering so Dreaming status and review lists stay clear and deterministic. (#64035) Thanks @davemorin.</li>
+<li>Agents: add an opt-in strict-agentic embedded Pi execution contract for GPT-5-family runs so plan-only or filler turns keep acting until they hit a real blocker. (#64241) Thanks @100yenadmin.</li>
+<li>Agents/OpenAI: add provider-owned OpenAI/Codex tool schema compatibility and surface embedded-run replay/liveness state for long-running runs. (#64300) Thanks @100yenadmin.</li>
+<li>Docs i18n: chunk raw doc translation, reject truncated tagged outputs, avoid ambiguous body-only wrapper unwrapping, and recover from terminated Pi translation sessions without changing the default <code>openai/gpt-5.4</code> path. (#62969, #63808) Thanks @hxy91819.</li>
+</ul>
+<h3>Fixes</h3>
+<ul>
+<li>Browser/security: tighten browser and sandbox navigation defenses across strict SSRF defaults, hostname allowlists, interaction-driven redirects, subframes, CDP discovery, existing sessions, tab actions, noVNC, marker-span sanitization, and Docker CDP source-range enforcement. (#61404, #63332, #63882, #63885, #63889, #64367, #64370, #64371)</li>
+<li>Security/tools: harden exec preflight reads, host env denylisting, node output boundaries, outbound host-media reads, profile-mutation authorization, plugin install dependency scanning, ACPX tool hooks, Gmail watcher token redaction, and oversized realtime WebSocket frame handling. (#62333, #62661, #62662, #63277, #63551, #63553, #63886, #63890, #63891, #64459)</li>
+<li>OpenAI/Codex: add required Codex OAuth scopes, classify provider/runtime failures more clearly, stop suggesting <code>/elevated full</code> when auto-approved host exec is unavailable, add OpenAI/Codex tool-schema compatibility, and preserve embedded-run replay/liveness truth across compaction retries and mutating side effects. (#64300, #64439) Thanks @100yenadmin.</li>
+<li>CLI/WhatsApp media sends: route gateway-mode outbound sends with <code>--media</code> through the channel <code>sendMedia</code> path and preserve media access context, so WhatsApp document and attachment sends stop silently dropping the file while still delivering the caption. (#64478, #64492) Thanks @ShionEria.</li>
+<li>Microsoft Teams: restore media downloads for personal DMs, Bot Framework <code>a:</code> conversations, OneDrive/SharePoint shared files, and Graph-backed chat IDs; accept Bot Framework audience tokens; prevent feedback-learning filename collisions; keep long tool chains alive with typing indicators; add SSO sign-in callbacks; inject parent context for thread replies; and deliver cron announcements to Teams conversation IDs. (#54932, #55383, #55386, #58001, #58249, #58774, #59731, #60956, #62219, #62674, #63063, #63942, #63945, #63949, #63951, #63953, #64087, #64088, #64089)</li>
+<li>Gateway/tailscale: start Tailscale exposure and the gateway update check before awaiting channel and plugin sidecar startup so remote operators are not locked out when startup sidecars stall.</li>
+<li>Gateway/startup: keep WebSocket RPC available while channels and plugin sidecars start, hold <code>chat.history</code> unavailable until startup sidecars finish so synchronous history reads cannot stall startup (reported in #63450), refresh advertised gateway methods after deferred plugin reloads, and enforce the pre-auth WebSocket upgrade budget before the no-handler 503 path so upgrade floods cannot bypass connection limits during that window. (#63480) Thanks @neeravmakwana.</li>
+<li>WhatsApp: keep inbound replies, media, composing indicators, and queued outbound deliveries attached to the current socket across reconnect gaps, including fresh retry-eligible sends after the listener comes back. (#30806, #46299, #62892, #63916) Thanks @mcaxtr.</li>
+<li>Gateway/thread routing: preserve Slack, Telegram, Mattermost, Matrix, ACP, restart-sentinel, and agent announce delivery targets so subagent, cron, stream-relay, session fallback, and restart messages land back in the originating thread, topic, or room casing. (#54840, #57056, #63143, #63228, #63506, #64343, #64391)</li>
+<li>Models/fallback: preserve <code>/models</code> selection across transient primary-model failures and config reloads, allow timeout cooldown probes, classify OpenRouter no-endpoints responses, detect llama.cpp context overflows, and keep provider/runtime context metadata stable through reloads. (#61472, #64196, #64471)</li>
+<li>Agents/BTW: keep <code>/btw</code> side questions working after tool-use turns by stripping replayed tool blocks, hidden reasoning, and malformed image payloads, omitting empty tool arrays, allowing Bedrock <code>auth: "aws-sdk"</code>, and routing Feishu <code>/btw</code> plus <code>/stop</code> through bounded out-of-band lanes. (#64218, #64219, #64225, #64324) Thanks @ngutman.</li>
+<li>Control UI/BTW: render <code>/btw</code> side results as dismissible ephemeral cards in the browser, send <code>/btw</code> immediately during active runs, and clear stale BTW cards on reset flows so webchat matches the intended detached side-question behavior. (#64290) Thanks @ngutman.</li>
+<li>Commands/targeting: use the selected agent or session for command output, send policy, usage/cost, context reports, model lists, bash sandbox hints, BTW/compact working directories, plugin commands, and session exports so multi-agent commands describe and mutate the intended target instead of the requester.</li>
+<li>Conversation bindings: normalize focused/current conversation ids, preserve binding metadata on account and Discord rebinds, avoid stale Discord lifecycle windows, and keep generic activity touches persisted so reply routing survives rebinds and restarts.</li>
+<li>iMessage/self-chat: distinguish normal DM outbound rows from true self-chat using <code>destination_caller_id</code> plus chat participants, preserve multi-handle self-chat aliases, drop ambiguous reflected echoes, and strip wrapped imsg RPC text fields. (#61619, #63868, #63980, #63989, #64000) Thanks @neeravmakwana.</li>
+<li>Matrix: keep multi-account room scoping consistent, keep packaged crypto migrations warning-only when appropriate, preserve ordered block streaming, add explicit Matrix block-streaming opt-in, and resolve verification/bootstrap from the packaged runtime entry. (#58449, #59249, #59266, #64373) Thanks @gumadeiras.</li>
+<li>Telegram/security: tighten Telegram <code>allowFrom</code> sender validation and keep <code>/whoami</code> allowlist reporting in sync with command auth checks.</li>
+<li>Agents/timeouts: extend the default LLM idle window to 120s and keep silent no-token idle timeouts on recovery paths, so slow models can retry or fall back before users see an error.</li>
+<li>Gateway/agents: preserve configured model selection and richer <code>IDENTITY.md</code> content across agent create/update flows and workspace moves, and fail safely instead of silently overwriting unreadable identity files. (#61577) Thanks @samzong.</li>
+<li>Skills/TaskFlow: restore valid frontmatter fences for the bundled <code>taskflow</code> and <code>taskflow-inbox-triage</code> skills and copy bundled <code>SKILL.md</code> files as hard dist-runtime copies so skills stay discoverable and loadable after updates. (#64166, #64469) Thanks @extrasmall0.</li>
+<li>Skills: respect overridden home directories when loading personal skills so service, test, and custom launch environments read the intended user skill directory instead of the process home.</li>
+<li>Windows/exec: settle supervisor waits from child exit state after stdout and stderr drain even when <code>close</code> never arrives, so CLI commands stop hanging or dying with forced <code>SIGKILL</code> on Windows. (#64072) Thanks @obviyus.</li>
+<li>Browser/sandbox: prevent sandbox browser CDP startup hangs by recreating containers when the browser security hash changes and by waiting on the correct sandbox browser lifecycle. (#62873) Thanks @Syysean.</li>
+<li>QQBot/streaming: make block streaming configurable per QQ bot account via <code>streaming.mode</code> (<code>"partial"</code> | <code>"off"</code>, default <code>"partial"</code>) instead of hardcoding it off, so responses can be delivered incrementally. (#63746)</li>
+<li>QQBot/config: allow extra fields in <code>channels.qqbot</code> and <code>channels.qqbot.accounts.*</code> so extended qqbot builds can add new config options without gateway startup failing on schema validation. (#64075) Thanks @WideLee.</li>
+<li>Dreaming/gateway: require <code>operator.admin</code> for persistent <code>/dreaming on|off</code> changes and treat missing gateway client scopes as unprivileged instead of silently allowing config writes. (#63872) Thanks @mbelinky.</li>
+<li>Gateway/pairing: prefer explicit QR bootstrap auth over earlier Tailscale auth classification so iOS <code>/pair qr</code> silent bootstrap pairing does not fall through to <code>pairing required</code>. (#59232) Thanks @ngutman.</li>
+<li>Browser/control: auto-generate browser-control auth tokens for <code>none</code> and <code>trusted-proxy</code> modes, and route browser auth/profile/doctor helpers through the public browser plugin facades. (#63280, #63957) Thanks @pgondhi987.</li>
+<li>Browser/act: centralize <code>/act</code> request normalization and execution dispatch while adding stable machine-readable route-level error codes for invalid requests, selector misuse, evaluate-disabled gating, target mismatch, and existing-session unsupported actions. (#63977) Thanks @joshavant.</li>
+<li>Security/QQBot: enforce media storage boundaries for all outbound local file paths and route image-size probes through SSRF-guarded media fetching instead of raw <code>fetch()</code>. (#63271, #63495) Thanks @pgondhi987.</li>
+<li>Channel setup: ignore workspace plugin shadows when resolving trusted channel setup catalog entries so onboarding and setup flows keep using the bundled, trusted setup contract.</li>
+<li>Gateway/memory startup: load the explicitly selected memory-slot plugin during gateway startup, while keeping restrictive allowlists and implicit default memory slots from auto-starting unrelated memory plugins. (#64423) Thanks @EronFan.</li>
+<li>Config/plugins: let config writes keep disabled plugin entries without forcing required plugin config schemas or crashing raw plugin validation, and avoid re-activating plugin registry state during schema checks. (#54971, #63296) Thanks @fuller-stack-dev.</li>
+<li>Config validation: surface the actual offending field for strict-schema union failures in bindings, including top-level unexpected keys on the matching ACP branch. (#40841) Thanks @Hollychou924.</li>
+<li>Wizard/plugin config: coerce integer-typed plugin config fields from interactive text input so integer schema values persist as numbers instead of failing validation. (#63346) Thanks @jalehman.</li>
+<li>Daemon/gateway install: preserve safe custom service env vars on forced reinstall, merge prior custom PATH segments behind the managed service PATH, and stop removed managed env keys from persisting as custom carryover. (#63136) Thanks @WarrenJones.</li>
+<li>Cron/scheduling: treat <code>nextRunAtMs <= 0</code> as invalid across cron update, maintenance, timer, and stale-delivery paths so corrupted zero timestamps self-heal instead of causing immediate runs or skipped deliveries. (#63507) Thanks @WarrenJones.</li>
+<li>Cron/auth: resolve auth profiles consistently for isolated cron jobs so scheduled runs use the same configured provider credentials as interactive sessions. (#62797) Thanks @neeravmakwana.</li>
+<li>Tasks: let <code>openclaw tasks cancel</code> cancel stuck background tasks that never reached a normal terminal state. (#62506) Thanks @neeravmakwana.</li>
+<li>Sessions/model selection: preserve catalog-backed session model labels, provider-qualified context limits, and already-qualified session model refs when catalog metadata is unavailable, so model selection and memory/context budgets survive reloads without bogus provider prefixes. (#61382, #62493) Thanks @Mule-ME.</li>
+<li>Status: show configured fallback models in <code>/status</code> and shared session status cards so per-agent fallback configuration is visible before a live failover happens. (#33111) Thanks @AnCoSONG.</li>
+<li><code>/context detail</code> now compares the tracked prompt estimate with cached context usage and surfaces untracked provider/runtime overhead when present. (#28391) Thanks @ImLukeF.</li>
+<li>Gateway/sessions: scope bare <code>sessions.create</code> aliases like <code>main</code> to the requested agent while preserving the canonical <code>global</code> and <code>unknown</code> sentinel keys. (#58207) Thanks @jalehman.</li>
+<li>Gateway/session reset: emit the typed <code>before_reset</code> hook for gateway <code>/new</code> and <code>/reset</code>, preserving reset-hook behavior even when the previous transcript has already been archived. (#53872) Thanks @VACInc.</li>
+<li>Plugins/commands: pass the active host <code>sessionKey</code> into plugin command contexts, and include <code>sessionId</code> when it is already available from the active session entry, so bundled and third-party commands can resolve the current conversation reliably. (#59044) Thanks @jalehman.</li>
+<li>Agents/auth: honor <code>models.providers.*.authHeader</code> for pi embedded runner model requests by injecting <code>Authorization: Bearer <apiKey></code> when requested. (#54390) Thanks @lndyzwdxhs.</li>
+<li>Claude CLI: clear inherited Anthropic auth/header environment aliases before spawning Claude Code and add sanitized CLI backend auth-env diagnostics for debugging gateway-run provider selection.</li>
+<li>Agents/failover: classify AbortError and stream-abort messages as timeout so Ollama NDJSON stream aborts stop showing <code>reason=unknown</code> in model fallback logs. (#58324) Thanks @yelog.</li>
+<li>Fireworks/FirePass: disable Kimi K2.5 Turbo reasoning output by forcing thinking off on the FirePass path and hardening the provider wrapper so hidden reasoning no longer leaks into visible replies. (#63607) Thanks @frankekn.</li>
+<li>Discord: update Carbon to v0.15.0. Thanks @thewilloftheshadow.</li>
+<li>Config/Discord: coerce safe integer numeric Discord IDs to strings during config validation, keep unsafe or precision-losing numeric snowflakes rejected, and align <code>openclaw doctor</code> repair guidance with the same fail-closed behavior. (#45125) Thanks @moliendocode.</li>
+<li>BlueBubbles/config: accept <code>enrichGroupParticipantsFromContacts</code> in the core strict config schema so gateways no longer fail validation or startup when the BlueBubbles plugin writes that field. (#56889) Thanks @zqchris.</li>
+<li>Feishu/webhooks: read webhook bodies through the pre-auth guard so unauthenticated webhook traffic stays under the same body budget as other protected channel ingress paths.</li>
+<li>Tools/web_fetch: add an opt-in <code>tools.web.fetch.ssrfPolicy.allowRfc2544BenchmarkRange</code> config so fake-IP proxy environments that resolve public sites into <code>198.18.0.0/15</code> can use <code>web_fetch</code> without weakening the default SSRF block. (#61830) Thanks @xing-xing-coder.</li>
+<li>Dreaming/cron: reconcile managed dreaming cron from startup config and runtime lifecycle changes, but only recover managed dreaming cron state during heartbeat-triggered dreaming checks so ordinary chat traffic does not recreate removed jobs. (#63873, #63929, #63938) Thanks @mbelinky.</li>
+<li>Memory/lancedb: accept <code>dreaming</code> config when <code>memory-lancedb</code> owns the memory slot so Dreaming surfaces can read slot-owner settings without schema rejection. (#63874) Thanks @mbelinky.</li>
+<li>Control UI/dreaming: keep the Dreaming trace area contained and scrollable so overlays no longer cover tabs or blow out the page layout. (#63875) Thanks @mbelinky.</li>
+<li>Dreaming/narrative: harden request-scoped diary fallback so scheduled dreaming only falls back on the dedicated subagent-runtime error, stop trusting spoofable raw error-code objects, and avoid leaking workspace paths when local fallback writes fail. (#64156) Thanks @mbelinky.</li>
+<li>Dreaming/diary: add idempotent narrative subagent runs, preserve restrictive <code>DREAMS.md</code> permissions during atomic writes, and surface temp cleanup failures so repeated sweeps do not double-run the same narrative request or silently weaken diary safety. (#63876) Thanks @mbelinky.</li>
+<li>Heartbeats/sessions: remove stale accumulated isolated heartbeat session keys when the next tick converges them back to the canonical sibling, so repaired sessions stop showing orphaned <code>:heartbeat:heartbeat</code> variants in session listings. (#59606) Thanks @rogerdigital.</li>
+<li>Gateway/run cleanup: fix stale run-context TTL cleanup so the new maintenance sweep resets orphaned run sequence state and prevents unbounded run-context growth. (#52731) Thanks @artwalker.</li>
+<li>UI/compaction: keep the compaction indicator in a retry-pending state until the run actually finishes, so the UI does not show <code>Context compacted</code> before compaction actually finishes. (#55132) Thanks @mpz4life.</li>
+<li>Cron/tool schemas: keep cron tool schemas strict-model-friendly while still preserving <code>failureAlert=false</code>, nullable <code>agentId</code>/<code>sessionKey</code>, and flattened add/update recovery for the newly exposed cron job fields. (#55043) Thanks @brunolorente.</li>
+<li>Git metadata: read commit ids from packed refs as well as loose refs so version and status metadata stay accurate after repository maintenance. (#63943)</li>
+<li>Gateway: keep <code>commands.list</code> skill entries categorized under tools and include provider-aware plugin <code>nativeName</code> metadata even when <code>scope=text</code>, so remote clients can group skills correctly and map text-surface plugin commands back to native aliases. (#64147)</li>
+<li>TUI: reset footer activity to idle when switching sessions so a stale streaming indicator cannot persist after the selection changes. (#63988) Thanks @neeravmakwana.</li>
+<li>Claude CLI: stop marking spawned Claude Code runs as host-managed so they keep using normal CLI subscription behavior. (#64023) Thanks @Alex-Alaniz.</li>
+<li>Codex auth: brand Codex OAuth flows as OpenClaw in user-visible auth prompts and diagnostics.</li>
+<li>Gateway/pairing: fail closed for paired device records that have no device tokens, and reject pairing approvals whose requested scopes do not match the requested device roles.</li>
+<li>ACP/gateway chat: classify lifecycle errors before forwarding them to ACP clients so refusals use ACP's refusal stop reason while transient backend errors continue to finish as normal turns.</li>
+<li>Claude CLI/skills: pass eligible OpenClaw skills into CLI runs, including native Claude Code skill resolution via a temporary plugin plus per-run skill env/API key injection. (#62686, #62723) Thanks @zomars.</li>
+<li>Discord: keep generated auto-thread names working with reasoning models by giving title generation enough output budget for thinking plus visible title text. (#64172) Thanks @hanamizuki.</li>
+<li>Heartbeat: ignore doc-only Markdown fence markers in the default <code>HEARTBEAT.md</code> template so comment-only heartbeat scaffolds skip API calls again. (#61690, #63434) Thanks @ravyg.</li>
+<li>Reply/skills: keep resolved skill and memory secret config stable through embedded reply runs so raw SecretRefs in secondary skill settings no longer crash replies when the gateway already has the live env. (#64249) Thanks @mbelinky.</li>
+<li>Dreaming/startup: keep plugin-registered startup hooks alive across workspace hook reloads and include dreaming startup owners in the gateway startup plugin scope, so managed Dreaming cron registration comes back reliably after gateway boot. (#62327, #64258) Thanks @mbelinky.</li>
+<li>Plugins: treat duplicate <code>registerService</code> calls from the same plugin id as idempotent so snapshot and activation loads no longer emit spurious <code>service already registered</code> diagnostics. (#62033, #64128) Thanks @ly85206559.</li>
+<li>Discord/TTS: route auto voice replies through the native voice-note path so Discord receives Opus voice messages instead of regular audio attachments. (#64096) Thanks @LiuHuaize.</li>
+<li>Config/plugins: use plugin-owned command alias metadata when <code>plugins.allow</code> contains runtime command names like <code>dreaming</code>, and point users at the owning plugin instead of stale plugin-not-found guidance. (#64191, #64242) Thanks @feiskyer.</li>
+<li>Agents/Gemini: strip orphaned <code>required</code> entries from Gemini tool schemas so provider validation no longer rejects tools after schema cleanup or union flattening. (#64284) Thanks @xxxxxmax.</li>
+<li>Assistant text: strip Qwen-style XML tool call payloads from visible replies so web and channel messages no longer show raw <code><tool_call><function=...></code> output. (#63999, #64214) Thanks @MoerAI.</li>
+<li>Daemon/gateway: prevent systemd restart storms on configuration errors by exiting with <code>EX_CONFIG</code> and adding generated unit restart-prevention guards. (#63913) Thanks @neo1027144-creator.</li>
+<li>Agents/exec: prevent gateway crash ("Agent listener invoked outside active run") when a subagent exec tool produces stdout/stderr after the agent run has ended or been aborted. (#62821) Thanks @openperf.</li>
+<li>Gateway/OpenAI compat: return real <code>usage</code> for non-stream <code>/v1/chat/completions</code> responses, emit the final usage chunk when <code>stream_options.include_usage=true</code>, and bound usage-gated stream finalization after lifecycle end. (#62986) Thanks @Lellansin.</li>
+<li>Matrix/migration: keep packaged warning-only crypto migrations from being misclassified as actionable when only helper chunks are present, so startup and doctor stay on the warning-only path instead of creating unnecessary migration snapshots. (#64373) Thanks @gumadeiras.</li>
+<li>Matrix/ACP thread bindings: preserve canonical room casing and parent conversation routing during ACP session spawn so mixed-case room ids bind correctly from top-level rooms and existing Matrix threads. (#64343) Thanks @gumadeiras.</li>
+<li>Agents/subagents: deduplicate delivered completion announces so retry or re-entry cleanup does not inject duplicate internal-context completion turns into the parent session. (#61525) Thanks @100yenadmin.</li>
+<li>Agents/exec: keep sandboxed <code>tools.exec.host=auto</code> sessions from honoring per-call <code>host=node</code> or <code>host=gateway</code> overrides while a sandbox runtime is active, and stop advertising node routing in that state so exec stays on the sandbox host. (#63880)</li>
+<li>Agents/subagents: preserve archived delete-mode runs until <code>sessions.delete</code> succeeds and prevent overlapping archive sweeps from duplicating in-flight cleanup attempts. (#61801) Thanks @100yenadmin.</li>
+<li>Cron/isolated agent: run scheduled agent turns as non-owner senders so owner-only tools stay unavailable during cron execution. (#63878)</li>
+<li>Discord/sandbox: include <code>image</code> in sandbox media param normalization so Discord event cover images cannot bypass sandbox path rewriting. (#64377) Thanks @mmaps.</li>
+<li>Agents/exec: extend exec completion detection to cover local background exec formats so the owner-downgrade fires correctly for all exec paths. (#64376) Thanks @mmaps.</li>
+<li>Security/dependencies: pin axios to 1.15.0 and add a plugin install dependency denylist that blocks known malicious packages before install. (#63891) Thanks @mmaps.</li>
+<li>Browser/security: apply three-phase interaction navigation guard to pressKey and type(submit) so delayed JS redirects from keypress cannot bypass SSRF policy. (#63889) Thanks @mmaps.</li>
+</ul>
+<ul>
+<li>Browser/security: guard existing-session Chrome MCP interaction routes with SSRF post-checks so delayed navigation from click, type, press, and evaluate cannot bypass the configured policy. (#64370) Thanks @eleqtrizit.</li>
+<li>Browser/security: default browser SSRF policy to strict mode so unconfigured installs block private-network navigation, and align external-content marker span mapping so ZWS-injected boundary spoofs are fully sanitized. (#63885) Thanks @eleqtrizit.</li>
+<li>Browser/security: apply SSRF navigation policy to subframe document navigations so iframe-targeted private-network hops are blocked without quarantining the parent page. (#64371) Thanks @eleqtrizit.</li>
+<li>Hooks/security: mark agent hook system events as untrusted and sanitize hook display names before cron metadata reuse. (#64372) Thanks @eleqtrizit.</li>
+<li>Daemon/launchd: keep <code>openclaw gateway stop</code> persistent without uninstalling the macOS LaunchAgent, re-enable it on explicit restart or repair, and harden launchd label handling. (#64447) Thanks @ngutman.</li>
+<li>Plugins/context engines: preserve <code>plugins.slots.contextEngine</code> through normalization and keep explicitly selected workspace context-engine plugins enabled, so loader diagnostics and plugin activation stop dropping that slot selection. (#64192) Thanks @hclsys.</li>
+<li>Heartbeat: stop top-level <code>interval:</code> and <code>prompt:</code> fields outside the <code>tasks:</code> block from bleeding into the last parsed heartbeat task. (#64488) Thanks @Rahulkumar070.</li>
+<li>Agents/OpenAI replay: preserve malformed function-call arguments in stored assistant history, avoid double-encoding preserved raw strings on replay, and coerce replayed string args back to objects at Anthropic and Google provider boundaries. (#61956) Thanks @100yenadmin.</li>
+<li>Heartbeat/config: accept and honor <code>agents.defaults.heartbeat.timeoutSeconds</code> and per-agent heartbeat timeout overrides for heartbeat agent turns. (#64491) Thanks @cedillarack.</li>
+<li>CLI/devices: make implicit <code>openclaw devices approve</code> selection preview-only and require approving the exact request ID, preventing latest-request races during device pairing. (#64160) Thanks @coygeek.</li>
+<li>Media/security: honor sender-scoped <code>toolsBySender</code> policy for outbound host-media reads so denied senders cannot trigger host file disclosure via attachment hydration. (#64459) Thanks @eleqtrizit.</li>
+<li>Browser/security: reject strict-policy hostname navigation unless the hostname is an explicit allowlist exception or IP literal, and route CDP HTTP discovery through the pinned SSRF fetch path. (#64367) Thanks @eleqtrizit.</li>
+<li>Models/vLLM: ignore empty <code>tool_calls</code> arrays from reasoning-model OpenAI-compatible replies, reset false <code>toolUse</code> stop reasons when no actual tool calls were parsed, and stop sending <code>tool_choice</code> unless tools are present so vLLM reasoning responses no longer hang indefinitely. (#61197, #61534) Thanks @balajisiva.</li>
+<li>Heartbeat/scheduling: spread interval heartbeats across stable per-agent phases derived from gateway identity, so provider traffic is distributed more uniformly across the configured interval instead of clustering around startup-relative times. (#64560) Thanks @odysseus0.</li>
+</ul>
+<p><a href="https://github.com/openclaw/openclaw/blob/main/CHANGELOG.md">View full changelog</a></p>
+]]></description>
+            <enclosure url="https://github.com/openclaw/openclaw/releases/download/v2026.4.10/OpenClaw-2026.4.10.zip" length="47259509" type="application/octet-stream" sparkle:edSignature="XY9FHxx09r2O9rlFs3t5UV9Zk2rGXSpWw5InazJhb661kgp6OKiOrrNTV631b2StWze5tnSEPXakkOCXq7O6DQ=="/>
+        </item>
        <item>
            <title>2026.4.9</title>
            <pubDate>Thu, 09 Apr 2026 02:38:08 +0000</pubDate>
@@ -59,135 +246,5 @@
 ]]></description>
            <enclosure url="https://github.com/openclaw/openclaw/releases/download/v2026.4.9/OpenClaw-2026.4.9.zip" length="25336730" type="application/octet-stream" sparkle:edSignature="zFKTcKpejPyGEHj6Bdop3EBDfRrHyQMtJzrpVKsIkBq3I/jbTNvsxQveKEy9r7dqkZVsldFYv7eSunP3SUmaAw=="/>
        </item>
-        <item>
-            <title>2026.4.8</title>
-            <pubDate>Wed, 08 Apr 2026 06:12:50 +0000</pubDate>
-            <link>https://raw.githubusercontent.com/openclaw/openclaw/main/appcast.xml</link>
-            <sparkle:version>2026040890</sparkle:version>
-            <sparkle:shortVersionString>2026.4.8</sparkle:shortVersionString>
-            <sparkle:minimumSystemVersion>15.0</sparkle:minimumSystemVersion>
-            <description><![CDATA[<h2>OpenClaw 2026.4.8</h2>
-<h3>Fixes</h3>
-<ul>
-<li>Telegram/setup: load setup and secret contracts through packaged top-level sidecars so installed npm builds no longer try to import missing <code>dist/extensions/telegram/src/*</code> files during gateway startup.</li>
-<li>Bundled channels/setup: load shared secret contracts through packaged top-level sidecars across BlueBubbles, Feishu, Google Chat, IRC, Matrix, Mattermost, Microsoft Teams, Nextcloud Talk, Slack, and Zalo so installed npm builds no longer rely on missing <code>dist/extensions/*/src/*</code> files during gateway startup.</li>
-<li>Bundled plugins: align packaged plugin compatibility metadata with the release version so bundled channels and providers load on OpenClaw 2026.4.8.</li>
-<li>Agents/progress: keep <code>update_plan</code> available for OpenAI-family runs while returning compact success payloads and allowing <code>tools.experimental.planTool=false</code> to opt out.</li>
-<li>Agents/exec: keep <code>/exec</code> current-default reporting aligned with real runtime behavior so <code>host=auto</code> sessions surface the correct host-aware fallback policy (<code>full/off</code> on gateway or node, <code>deny/off</code> on sandbox) instead of stale stricter defaults.</li>
-<li>Slack: honor ambient HTTP(S) proxy settings for Socket Mode WebSocket connections, including NO_PROXY exclusions, so proxy-only deployments can connect without a monkey patch. (#62878) Thanks @mjamiv.</li>
-<li>Slack/actions: pass the already resolved read token into <code>downloadFile</code> so SecretRef-backed bot tokens no longer fail after a raw config re-read. (#62097) Thanks @martingarramon.</li>
-<li>Network/fetch guard: skip target DNS pinning when trusted env-proxy mode is active so proxy-only sandboxes can let the trusted proxy resolve outbound hosts. (#59007) Thanks @cluster2600.</li>
-</ul>
-<p><a href="https://github.com/openclaw/openclaw/blob/main/CHANGELOG.md">View full changelog</a></p>
-]]></description>
-            <enclosure url="https://github.com/openclaw/openclaw/releases/download/v2026.4.8/OpenClaw-2026.4.8.zip" length="25324810" type="application/octet-stream" sparkle:edSignature="aogl3hJf+FeRvQj0W4WDGMQnIRPpxXPQam50U7SBT3ljA1CeSbIGsnaj20aLF0Qc9DikPEXt5AEg7LMOen4+BQ=="/>
-        </item>
-        <item>
-            <title>2026.4.7</title>
-            <pubDate>Wed, 08 Apr 2026 02:54:26 +0000</pubDate>
-            <link>https://raw.githubusercontent.com/openclaw/openclaw/main/appcast.xml</link>
-            <sparkle:version>2026040790</sparkle:version>
-            <sparkle:shortVersionString>2026.4.7</sparkle:shortVersionString>
-            <sparkle:minimumSystemVersion>15.0</sparkle:minimumSystemVersion>
-            <description><![CDATA[<h2>OpenClaw 2026.4.7</h2>
-<h3>Changes</h3>
-<ul>
-<li>CLI/infer: add a first-class <code>openclaw infer ...</code> hub for provider-backed inference workflows across model, media, web, and embedding tasks. Thanks @Takhoffman.</li>
-<li>Tools/media generation: auto-fallback across auth-backed image, music, and video providers by default, preserve intent during provider switches, remap size/aspect/resolution/duration hints to the closest supported option, and surface provider capabilities plus mode-aware video-to-video support.</li>
-<li>Memory/wiki: restore the bundled <code>memory-wiki</code> stack with plugin, CLI, sync/query/apply tooling, memory-host integration, structured claim/evidence fields, compiled digest retrieval, claim-health linting, contradiction clustering, staleness dashboards, and freshness-weighted search. Thanks @vincentkoc.</li>
-<li>Plugins/webhooks: add a bundled webhook ingress plugin so external automation can create and drive bound TaskFlows through per-route shared-secret endpoints. (#61892) Thanks @mbelinky.</li>
-<li>Gateway/sessions: add persisted compaction checkpoints plus Sessions UI branch/restore actions so operators can inspect and recover pre-compaction session state. (#62146) Thanks @scoootscooob.</li>
-<li>Compaction: add pluggable compaction provider registry so plugins can replace the built-in summarization pipeline. Configure via <code>agents.defaults.compaction.provider</code>; falls back to LLM summarization on provider failure. (#56224) Thanks @DhruvBhatia0.</li>
-<li>Agents/system prompt: add <code>agents.defaults.systemPromptOverride</code> for controlled prompt experiments plus heartbeat prompt-section controls so heartbeat runtime behavior can stay enabled without injecting heartbeat instructions every turn.</li>
-<li>Providers/Google: add Gemma 4 model support and keep Google fallback resolution on the requested provider path so native Google Gemma routes work again. (#61507) Thanks @eyjohn.</li>
-<li>Providers/Google: preserve explicit thinking-off semantics for Gemma 4 while still enabling Gemma reasoning support in compatibility wrappers. (#62127) Thanks @romgenie.</li>
-<li>Providers/Arcee AI: add a bundled Arcee AI provider plugin with Trinity catalog entries, OpenRouter support, and updated onboarding/auth guidance. (#62068) Thanks @arthurbr11.</li>
-<li>Providers/Anthropic: restore Claude CLI as the preferred local Anthropic path in onboarding, model-auth guidance, doctor flows, and Docker Claude CLI live lanes again.</li>
-<li>Providers/Ollama: detect vision capability from the <code>/api/show</code> response and set image input on models that support it so Ollama vision models accept image attachments. (#62193) Thanks @BruceMacD.</li>
-<li>Memory/dreaming: ingest redacted session transcripts into the dreaming corpus with per-day session-corpus notes, cursor checkpointing, and promotion/doctor support. (#62227) Thanks @vignesh07.</li>
-<li>Providers/inferrs: add string-content compatibility for stricter OpenAI-compatible chat backends, document <code>inferrs</code> setup with a full config example, and add troubleshooting guidance for local backends that pass direct probes but fail on full agent-runtime prompts.</li>
-<li>Agents/context engine: expose prompt-cache runtime context to context engines and keep current-turn prompt-cache usage aligned with the active attempt instead of stale prior-turn assistant state. (#62179) Thanks @jalehman.</li>
-<li>Plugin SDK/context engines: pass <code>availableTools</code> and <code>citationsMode</code> into <code>assemble()</code>, and expose memory-artifact and memory-prompt seams so companion plugins and non-legacy context engines can consume active memory state without reaching into internals. Thanks @vincentkoc.</li>
-<li>ACP/ACPX plugin: bump the bundled <code>acpx</code> pin to <code>0.5.1</code> so plugin-local installs and strict version checks pick up the latest published runtime release. (#62148) Thanks @onutc.</li>
-<li>Discord/events: allow <code>event-create</code> to accept a cover image URL or local file path, load and validate PNG/JPG/GIF event cover media, and pass the encoded image payload through Discord admin action/runtime paths. (#60883) Thanks @bittoby.</li>
-</ul>
-<h3>Fixes</h3>
-<ul>
-<li>CLI/infer: keep provider-backed infer behavior aligned with actual runtime execution by fixing explicit TTS override handling, profile-aware gateway TTS prefs resolution, per-request transcription <code>prompt</code>/<code>language</code> overrides, image output MIME/extension mismatches, configured web-search fallback behavior, and agent-vs-CLI web-search execution drift.</li>
-<li>Plugins/media: when <code>plugins.allow</code> is set, capability fallback now merges bundled capability plugin ids into the allowlist (not only <code>plugins.entries</code>), so media understanding providers such as OpenAI-compatible STT load for voice transcription without requiring <code>openai</code> in <code>plugins.allow</code>. (#62205) Thanks @neeravmakwana.</li>
-<li>Agents/history and replies: buffer phaseless OpenAI WS text until a real assistant phase arrives, keep replay and SSE history sequence tracking aligned, hide commentary and leaked tool XML from user-visible history, and keep history-based follow-up replies on <code>final_answer</code> text only. (#61729, #61747, #61829, #61855, #61954) Thanks @100yenadmin and contributors.</li>
-<li>Control UI: show <code>/tts</code> audio replies in webchat, detect mistaken <code>?token=</code> auth links with the correct <code>#token=</code> hint, and keep Copy, Canvas, and mobile exec-approval UI from covering chat content on narrow screens. (#54842, #61514, #61598) Thanks @neeravmakwana.</li>
-<li>iOS/gateway: replace string-matched connection error UI with structured gateway connection problems, preserve actionable pairing/auth failures over later generic disconnect noise, and surface reusable problem banners and details across onboarding, settings, and root status surfaces. (#62650) Thanks @ngutman.</li>
-<li>TUI: route <code>/status</code> through the shared session-status command, keep commentary hidden in history, strip raw envelope metadata from async command notices, preserve fallback streaming before per-attempt failures finalize, and restore Kitty keyboard state on exit or fatal crashes. (#49130, #59985, #60043, #61463) Thanks @biefan and contributors.</li>
-<li>iOS/Watch exec approvals: keep Apple Watch review and approval recovery working while the iPhone is locked or backgrounded, including reconnect recovery, pending approval persistence, notification cleanup, and APNs-backed watch refresh recovery. (#61757) Thanks @ngutman.</li>
-<li>Agents/context overflow: combine oversized and aggregate tool-result recovery in one pass and restore a total-context overflow backstop so recoverable sessions retry instead of failing early. (#61651) Thanks @Takhoffman.</li>
-<li>Auth/OpenAI Codex OAuth: reload fresh on-disk credentials inside the locked refresh path and retry once after <code>refresh_token_reused</code> rotates only the stored refresh token, so relogin/restart recovery stops getting stuck on stale cached auth state. Thanks @owen-ever.</li>
-<li>Auth/OpenAI Codex OAuth: keep native <code>/model ...@profile</code> selections on the target session and honor explicit user-locked auth profiles even when per-agent auth order excludes them. (#62744) Thanks @jalehman.</li>
-<li>Providers/Anthropic: preserve thinking blocks for Claude Opus 4.5+, Sonnet 4.5+, and newer Claude 4-family models so prompt-cache prefixes keep matching, and skip <code>service_tier</code> injection on OAuth-authenticated stream wrapper requests so Claude OAuth streaming stops failing with HTTP 401. (#60356, #61793)</li>
-<li>Agents/Claude CLI: surface nested API error messages from structured CLI output so billing/auth/provider failures show the real provider error instead of an opaque CLI failure.</li>
-<li>Agents/exec: preserve explicit <code>host=node</code> routing under elevated defaults when <code>tools.exec.host=auto</code>, fail loud on invalid elevated cross-host overrides, and keep <code>strictInlineEval</code> commands blocked after approval timeouts instead of falling through to automatic execution. (#61739) Thanks @obviyus.</li>
-<li>Nodes/exec approvals: keep <code>host=node</code> POSIX transport shell wrappers (<code>/bin/sh -lc ...</code>) aligned with inner-command allowlist analysis so allowlisted scripts stop prompting unnecessarily, while Windows <code>cmd.exe</code> wrapper runs stay approval-gated. (#62401) Thanks @ngutman.</li>
-<li>Nodes/exec approvals: keep Windows <code>cmd.exe /c</code> wrapper runs approval-gated even when <code>env</code> carriers, including env-assignment carriers, wrap the shell invocation. (#62439) Thanks @ngutman.</li>
-<li>Gateway tool/exec config: block model-facing <code>gateway config.apply</code> and <code>config.patch</code> writes from changing exec approval paths such as <code>safeBins</code>, <code>safeBinProfiles</code>, <code>safeBinTrustedDirs</code>, and <code>strictInlineEval</code>, while still allowing unchanged structured values through. (#62001) Thanks @eleqtrizit.</li>
-<li>Host exec/env sanitization: block dangerous Java, Rust, Cargo, Git, Kubernetes, cloud credential, config-path, and Helm env overrides so host-run tools cannot be redirected to attacker-chosen code, config, credentials, or repository state. (#59119, #62002, #62291) Thanks @eleqtrizit and contributors.</li>
-<li>Commands/allowlist: require owner authorization for <code>/allowlist add</code> and <code>/allowlist remove</code> before channel resolution, so non-owner but command-authorized senders can no longer persistently rewrite allowlist policy state. (#62383) Thanks @pgondhi987.</li>
-<li>Feishu/docx uploads: honor <code>tools.fs.workspaceOnly</code> for local <code>upload_file</code> and <code>upload_image</code> paths by forwarding workspace-constrained <code>localRoots</code> into the media loader, so docx uploads can no longer read host-local files outside the workspace when workspace-only mode is active. (#62369) Thanks @pgondhi987.</li>
-<li>Network/fetch guard: drop request bodies and body-describing headers on cross-origin <code>307</code> and <code>308</code> redirects by default, so attacker-controlled redirect hops cannot receive secret-bearing POST payloads from SSRF-guarded fetch flows unless a caller explicitly opts in. (#62357) Thanks @pgondhi987.</li>
-<li>Browser/SSRF: treat main-frame <code>document</code> redirect hops as navigations even when Playwright does not flag them as <code>isNavigationRequest()</code>, so strict private-network blocking still stops forbidden redirect pivots before the browser reaches the internal target. (#62355) Thanks @pgondhi987.</li>
-<li>Browser/node invoke: block persistent browser profile create, reset, and delete mutations through <code>browser.proxy</code> on both gateway-forwarded <code>node.invoke</code> and the node-host proxy path, even when no profile allowlist is configured. (#60489)</li>
-<li>Gateway/node pairing: require a fresh pairing request when a previously paired node reconnects with additional declared commands, and keep the live session pinned to the earlier approved command set until the upgrade is approved. (#62658) Thanks @eleqtrizit.</li>
-<li>Gateway/auth: invalidate existing shared-token and password WebSocket sessions when the configured secret rotates, so stale authenticated sockets cannot stay attached after token or password changes. (#62350) Thanks @pgondhi987.</li>
-<li>MS Teams/security: validate file-consent upload URLs against HTTPS, Microsoft/SharePoint host allowlists, and private-IP DNS checks before uploading attachments, blocking SSRF-style consent-upload abuse. (#23596)</li>
-<li>Media/base64 decode guards: enforce byte limits before decoding missed base64-backed Teams, Signal, QQ Bot, and image-tool payloads so oversized inbound media and data URLs no longer bypass pre-decode size checks. (#62007) Thanks @eleqtrizit.</li>
-<li>Runtime event trust: mark background <code>notifyOnExit</code> summaries, ACP parent-stream relays, and wake-hook payloads as untrusted system events so lower-trust runtime output no longer re-enters later turns as trusted <code>System:</code> text. (#62003)</li>
-<li>Auto-reply/media: allow managed generated-media <code>MEDIA:</code> paths from normal reply text again while still blocking arbitrary host-local media and document paths, so generated media keep delivering without reopening host-path injection holes.</li>
-<li>Gateway/status and containers: auto-bind to <code>0.0.0.0</code> inside Docker and Podman environments, and probe local TLS gateways over <code>wss://</code> with self-signed fingerprint forwarding so container startup and loopback TLS status checks work again. (#61818, #61935) Thanks @openperf and contributors.</li>
-<li>Gateway/OpenAI-compatible HTTP: abort in-flight <code>/v1/chat/completions</code> and <code>/v1/responses</code> turns when clients disconnect so abandoned HTTP requests stop wasting agent runtime. (#54388) Thanks @Lellansin.</li>
-<li>macOS/gateway version: strip trailing commit metadata from CLI version output before semver parsing so the Mac app recognizes installed gateway versions like <code>OpenClaw 2026.4.2 (d74a122)</code> again. (#61111) Thanks @oliviareid-svg.</li>
-<li>Sessions/model selection: resolve the explicitly selected session model separately from runtime fallback resolution so session status and live model switching stay aligned with the chosen model.</li>
-<li>Discord/ACP bindings: canonicalize DM conversation identity across inbound messages, component interactions, native commands, and current-conversation binding resolution so <code>--bind here</code> in Discord DMs keeps routing follow-up replies to the bound agent instead of falling back to the default agent.</li>
-<li>Discord: recover forwarded referenced message text and attachments when snapshots are missing, use <code>ws://</code> again for gateway monitor sockets, stop forcing a hardcoded temperature for Codex-backed auto-thread titles, and harden voice receive recovery so rapid speaker restarts keep their next utterance. (#41536, #61670) Thanks @artwalker and contributors.</li>
-<li>Slack/thread mentions: add <code>channels.slack.thread.requireExplicitMention</code> so Slack channels that already require mentions can also require explicit <code>@bot</code> mentions inside bot-participated threads. (#58276) Thanks @praktika-engineer.</li>
-<li>Slack/threading: keep legacy thread stickiness for real replies when older callers omit <code>isThreadReply</code>, while still honoring <code>replyToMode</code> for Slack's auto-created top-level <code>thread_ts</code>. (#61835) Thanks @kaonash.</li>
-<li>Slack/media: keep attachment downloads on the SSRF-guarded dispatcher path so Slack media fetching works on Node 22 without dropping pinned transport enforcement. (#62239) Thanks @openperf.</li>
-<li>Matrix/onboarding: add an invite auto-join setup step with explicit off warnings and strict stable-target validation so new Matrix accounts stop silently ignoring invited rooms and fresh DM-style invites unless operators opt in. (#62168) Thanks @gumadeiras.</li>
-<li>Matrix/formatting: preserve multi-paragraph and loose-list rendering in Element so numbered and bulleted Markdown keeps their content attached to the correct list item. (#60997) Thanks @gucasbrg.</li>
-<li>Telegram/doctor: keep top-level access-control fallback in place during multi-account normalization while still promoting legacy default auth into <code>accounts.default</code>, so existing named bots keep inherited allowlists without dropping the legacy default bot. (#62263) Thanks @obviyus.</li>
-<li>Plugins/loaders: centralize bundled <code>dist/**</code> Jiti native-load policy and keep channel, public-surface, facade, and config-metadata loader seams off native Jiti on Windows so onboarding and configure flows stop tripping <code>ERR_UNSUPPORTED_ESM_URL_SCHEME</code>. (#62286) Thanks @chen-zhang-cs-code.</li>
-<li>Plugins/channels: keep bundled channel artifact and secret-contract loading stable under lazy loading, preserve plugin-schema defaults during install, and fix Windows <code>file://</code> plus native-Jiti plugin loader paths so onboarding, doctor, <code>openclaw secret</code>, and bundled plugin installs work again. (#61832, #61836, #61853, #61856) Thanks @Zeesejo and contributors.</li>
-<li>Plugins/ClawHub: verify downloaded plugin archives against version metadata SHA-256, fail closed when archive integrity metadata is missing or malformed, and tighten fallback ZIP verification so plugin installs cannot proceed on mismatched or incomplete ClawHub package metadata. (#60517) Thanks @mappel-nv.</li>
-<li>Plugins/provider hooks: stop recursive provider snapshot loads from overflowing the stack during plugin initialization, while still preserving cached nested provider-hook results. (#61922, #61938, #61946, #61951)</li>
-<li>Docker/plugins: stop forcing bundled plugin discovery to <code>/app/extensions</code> in runtime images so packaged installs use compiled <code>dist/extensions</code> artifacts again and Node 24 containers do not boot through source-only plugin entry paths. Fixes #62044. (#62316) Thanks @gumadeiras.</li>
-<li>Providers/Ollama: honor the selected provider's <code>baseUrl</code> during streaming so multi-Ollama setups stop routing every stream to the first configured Ollama endpoint. (#61678)</li>
-<li>Providers/Ollama: stop warning that Ollama could not be reached when discovery only sees empty default local stubs, while still keeping real explicit Ollama overrides loud when the endpoint is unreachable.</li>
-<li>Providers/xAI: recognize <code>api.grok.x.ai</code> as an xAI-native endpoint again and keep legacy <code>x_search</code> auth resolution working so older xAI web-search configs continue to load. (#61377) Thanks @jjjojoj.</li>
-<li>Providers/Mistral: send <code>reasoning_effort</code> for <code>mistral/mistral-small-latest</code> (Mistral Small 4) with thinking-level mapping, and mark the catalog entry as reasoning-capable so adjustable reasoning matches Mistral’s Chat Completions API. (#62162) Thanks @neeravmakwana.</li>
-<li>OpenAI TTS/Groq: send <code>wav</code> to Groq-compatible speech endpoints, honor explicit <code>responseFormat</code> overrides on OpenAI-compatible paths, and only mark voice-note output as voice-compatible when the actual format is <code>opus</code>. (#62233) Thanks @neeravmakwana.</li>
-<li>Tools/web_fetch and web_search: fix <code>TypeError: fetch failed</code> caused by undici 8.0 enabling HTTP/2 by default; pinned SSRF-guard dispatchers now explicitly set <code>allowH2: false</code> to restore HTTP/1.1 behavior and keep the custom DNS-pinning lookup compatible. (#61738, #61777) Thanks @zozo123.</li>
-<li>Tools/web search/Exa: show Exa Search in onboarding and configure provider pickers again by marking the bundled Exa provider as setup-visible. Thanks @vincentkoc.</li>
-<li>Memory/vector recall: surface explicit warnings when <code>sqlite-vec</code> is unavailable or vector writes are degraded, and strip managed Light Sleep and REM blocks before daily-note ingestion so memory indexing and dreaming stop reporting false-success or re-ingesting staged output. (#61720) Thanks @MonkeyLeeT.</li>
-<li>Memory/dreaming: make Dreams config reads and writes respect the selected memory slot plugin instead of always targeting <code>memory-core</code>. (#62275) Thanks @SnowSky1.</li>
-<li>QQ Bot/media: route gateway-side attachment and fallback downloads through guarded QQ/Tencent HTTPS fetches so QQ media handling no longer follows arbitrary remote hosts.</li>
-<li>Browser/remote CDP: retry the DevTools websocket once after remote browser restarts so healthy remote browser profiles do not fail availability checks during CDP warm-up. (#57397) Thanks @ThanhNguyxn07.</li>
-<li>UI/light mode: target both root and nested WebKit scrollbar thumbs in the light theme so page-level and container scrollbars stay visible on light backgrounds. (#61753) Thanks @chziyue.</li>
-<li>Agents/subagents: honor <code>sessions_spawn(lightContext: true)</code> for spawned subagent runs by preserving lightweight bootstrap context through the gateway and embedded runner instead of silently falling back to full workspace bootstrap injection. (#62264) Thanks @theSamPadilla.</li>
-<li>Cron: load <code>jobId</code> into <code>id</code> when the on-disk store omits <code>id</code>, matching doctor migration and fixing <code>unknown cron job id</code> for hand-edited <code>jobs.json</code>. (#62246) Thanks @neeravmakwana.</li>
-<li>Agents/model fallback: classify minimal HTTP 404 API errors (for example <code>404 status code (no body)</code>) as <code>model_not_found</code> so assistant failures throw into the fallback chain instead of stopping at the first fallback candidate. (#62119) Thanks @neeravmakwana.</li>
-<li>BlueBubbles/network: respect explicit private-network opt-out for loopback and private <code>serverUrl</code> values across account resolution, status probes, monitor startup, and attachment downloads, while keeping public-host attachment hostname pinning intact. (#59373) Thanks @jpreagan.</li>
-<li>Agents/heartbeat: keep heartbeat runs pinned to the main session so active subagent transcripts are not overwritten by heartbeat status messages. (#61803) Thanks @100yenadmin.</li>
-<li>Agents/heartbeat: respect disabled heartbeat prompt guidance so operators can suppress heartbeat prompt instructions without disabling heartbeat runtime behavior.</li>
-<li>Agents/compaction: stop compaction-wait aborts from re-entering prompt failover and replaying completed tool turns. (#62600) Thanks @i-dentifier.</li>
-<li>Approvals/runtime: move native approval lifecycle assembly into shared core bootstrap/runtime seams driven by channel capabilities and runtime contexts, and remove the legacy bundled approval fallback wiring. (#62135) Thanks @gumadeiras.</li>
-<li>Security/fetch-guard: stop rejecting operator-configured proxy hostnames against the target-scoped hostname allowlist in SSRF-guarded fetches, restoring proxy-based media downloads for Telegram and other channels. (#62312) Thanks @ademczuk.</li>
-<li>Logging: make <code>logging.level</code> and <code>logging.consoleLevel</code> honor the documented severity threshold ordering again, and keep child loggers inheriting the parent <code>minLevel</code>. (#44646) Thanks @zhumengzhu.</li>
-<li>Agents/sessions_send: pass <code>threadId</code> through announce delivery so cross-session notifications land in the correct Telegram forum topic instead of the group's general thread. (#62758) Thanks @jalehman.</li>
-<li>Daemon/systemd: keep sudo systemctl calls scoped to the invoking user when machine-scoped systemctl fails, while still avoiding machine fallback for permission-denied user bus errors. (#62337) Thanks @Aftabbs.</li>
-<li>Docs/i18n: relocalize final localized-page links after translation and remove the zh-CN homepage redirect override so localized Mintlify pages resolve to the correct language roots again. (#61796) Thanks @hxy91819.</li>
-<li>Agents/exec: keep timed-out shell-backgrounded commands on the failed path and point long-running jobs to exec background/yield sessions so process polling is only suggested for registered sessions.</li>
-</ul>
-<p><a href="https://github.com/openclaw/openclaw/blob/main/CHANGELOG.md">View full changelog</a></p>
-]]></description>
-            <enclosure url="https://github.com/openclaw/openclaw/releases/download/v2026.4.7/OpenClaw-2026.4.7.zip" length="25324827" type="application/octet-stream" sparkle:edSignature="RyFWRz1trE/qvOiInD4vR6je9wx7fUTtHpZ94W8rMlZDByux9CyXOm/Anai96b9KyjTeQyC7YnJp5SRnYY3iCg=="/>
-        </item>
    </channel>
-</rss>
+</rss>
--- a/apps/android/app/build.gradle.kts
+++ b/apps/android/app/build.gradle.kts
@@ -65,8 +65,8 @@ android {
        applicationId = "ai.openclaw.app"
        minSdk = 31
        targetSdk = 36
-        versionCode = 2026041001
-        versionName = "2026.4.10"
+        versionCode = 2026041101
+        versionName = "2026.4.12"
        ndk {
            // Support all major ABIs — native libs are tiny (~47 KB per ABI)
            abiFilters += listOf("armeabi-v7a", "arm64-v8a", "x86", "x86_64")
--- a/apps/ios/CHANGELOG.md
+++ b/apps/ios/CHANGELOG.md
@@ -1,12 +1,8 @@
 # OpenClaw iOS Changelog

-## Unreleased
+## 2026.4.12 - 2026-04-12

-### Added
-
-### Changed
-
-### Fixed
+Maintenance update for the current OpenClaw release.

 ## 2026.4.10 - 2026-04-10

--- a/apps/ios/Config/Version.xcconfig
+++ b/apps/ios/Config/Version.xcconfig
@@ -2,8 +2,8 @@
 // Source of truth: apps/ios/version.json
 // Generated by scripts/ios-sync-versioning.ts.

-OPENCLAW_IOS_VERSION = 2026.4.10
-OPENCLAW_MARKETING_VERSION = 2026.4.10
+OPENCLAW_IOS_VERSION = 2026.4.12
+OPENCLAW_MARKETING_VERSION = 2026.4.12
 OPENCLAW_BUILD_VERSION = 1

 #include? "../build/Version.xcconfig"
--- a/apps/ios/version.json
+++ b/apps/ios/version.json
@@ -1,3 +1,3 @@
 {
-  "version": "2026.4.10"
+  "version": "2026.4.12"
 }
--- a/apps/macos/Sources/OpenClaw/Resources/Info.plist
+++ b/apps/macos/Sources/OpenClaw/Resources/Info.plist
@@ -15,9 +15,9 @@
    <key>CFBundlePackageType</key>
    <string>APPL</string>
    <key>CFBundleShortVersionString</key>
-    <string>2026.4.10</string>
+    <string>2026.4.12</string>
    <key>CFBundleVersion</key>
-    <string>2026041001</string>
+    <string>2026041101</string>
    <key>CFBundleIconFile</key>
    <string>OpenClaw</string>
    <key>CFBundleURLTypes</key>
--- a/apps/macos/Sources/OpenClaw/ShellExecutor.swift
+++ b/apps/macos/Sources/OpenClaw/ShellExecutor.swift
@@ -11,6 +11,40 @@ enum ShellExecutor {
        var errorMessage: String?
    }

+    private final class CompletionBox: @unchecked Sendable {
+        private let lock = NSLock()
+        private var finished = false
+        private let continuation: CheckedContinuation<ShellResult, Never>
+
+        init(continuation: CheckedContinuation<ShellResult, Never>) {
+            self.continuation = continuation
+        }
+
+        func finish(_ result: ShellResult) {
+            self.lock.lock()
+            defer { self.lock.unlock() }
+            guard !self.finished else { return }
+            self.finished = true
+            self.continuation.resume(returning: result)
+        }
+    }
+
+    private static func completedResult(
+        status: Int,
+        outTask: Task<Data, Never>,
+        errTask: Task<Data, Never>) async -> ShellResult
+    {
+        let out = await outTask.value
+        let err = await errTask.value
+        return ShellResult(
+            stdout: String(bytes: out, encoding: .utf8) ?? "",
+            stderr: String(bytes: err, encoding: .utf8) ?? "",
+            exitCode: status,
+            timedOut: false,
+            success: status == 0,
+            errorMessage: status == 0 ? nil : "exit \(status)")
+    }
+
    static func runDetailed(
        command: [String],
        cwd: String?,
@@ -38,6 +72,53 @@ enum ShellExecutor {
        process.standardOutput = stdoutPipe
        process.standardError = stderrPipe

+        let outTask = Task { stdoutPipe.fileHandleForReading.readToEndSafely() }
+        let errTask = Task { stderrPipe.fileHandleForReading.readToEndSafely() }
+
+        if let timeout, timeout > 0 {
+            return await withCheckedContinuation { continuation in
+                let completion = CompletionBox(continuation: continuation)
+
+                process.terminationHandler = { terminatedProcess in
+                    let status = Int(terminatedProcess.terminationStatus)
+                    Task {
+                        let result = await self.completedResult(
+                            status: status,
+                            outTask: outTask,
+                            errTask: errTask)
+                        completion.finish(result)
+                    }
+                }
+
+                do {
+                    try process.run()
+                } catch {
+                    completion.finish(
+                        ShellResult(
+                            stdout: "",
+                            stderr: "",
+                            exitCode: nil,
+                            timedOut: false,
+                            success: false,
+                            errorMessage: "failed to start: \(error.localizedDescription)"))
+                    return
+                }
+
+                DispatchQueue.global(qos: .userInitiated).asyncAfter(deadline: .now() + timeout) {
+                    guard process.isRunning else { return }
+                    process.terminate()
+                    completion.finish(
+                        ShellResult(
+                            stdout: "",
+                            stderr: "",
+                            exitCode: nil,
+                            timedOut: true,
+                            success: false,
+                            errorMessage: "timeout"))
+                }
+            }
+        }
+
        do {
            try process.run()
        } catch {
@@ -50,48 +131,11 @@ enum ShellExecutor {
                errorMessage: "failed to start: \(error.localizedDescription)")
        }

-        let outTask = Task { stdoutPipe.fileHandleForReading.readToEndSafely() }
-        let errTask = Task { stderrPipe.fileHandleForReading.readToEndSafely() }
-
-        let waitTask = Task { () -> ShellResult in
-            process.waitUntilExit()
-            let out = await outTask.value
-            let err = await errTask.value
-            let status = Int(process.terminationStatus)
-            return ShellResult(
-                stdout: String(bytes: out, encoding: .utf8) ?? "",
-                stderr: String(bytes: err, encoding: .utf8) ?? "",
-                exitCode: status,
-                timedOut: false,
-                success: status == 0,
-                errorMessage: status == 0 ? nil : "exit \(status)")
-        }
-
-        if let timeout, timeout > 0 {
-            let nanos = UInt64(timeout * 1_000_000_000)
-            return await withTaskGroup(of: ShellResult.self) { group in
-                group.addTask { await waitTask.value }
-                group.addTask {
-                    try? await Task.sleep(nanoseconds: nanos)
-                    guard process.isRunning else {
-                        return await waitTask.value
-                    }
-                    process.terminate()
-                    return ShellResult(
-                        stdout: "",
-                        stderr: "",
-                        exitCode: nil,
-                        timedOut: true,
-                        success: false,
-                        errorMessage: "timeout")
-                }
-                let first = await group.next()!
-                group.cancelAll()
-                return first
-            }
-        }
-
-        return await waitTask.value
+        process.waitUntilExit()
+        return await self.completedResult(
+            status: Int(process.terminationStatus),
+            outTask: outTask,
+            errTask: errTask)
    }

    static func run(command: [String], cwd: String?, env: [String: String]?, timeout: Double?) async -> Response {
--- a/apps/macos/Sources/OpenClaw/TalkModeRuntime.swift
+++ b/apps/macos/Sources/OpenClaw/TalkModeRuntime.swift
@@ -128,8 +128,9 @@ actor TalkModeRuntime {
    private func start() async {
        let gen = self.lifecycleGeneration
        guard voiceWakeSupported else { return }
-        guard PermissionManager.voiceWakePermissionsGranted() else {
-            self.logger.debug("talk runtime not starting: permissions missing")
+
+        guard await PermissionManager.ensureVoiceWakePermissions(interactive: true) else {
+            self.logger.error("talk runtime not starting: permissions missing")
            return
        }
        await self.reloadConfig()
--- a/apps/macos/Sources/OpenClawProtocol/GatewayModels.swift
+++ b/apps/macos/Sources/OpenClawProtocol/GatewayModels.swift
@@ -401,6 +401,60 @@ public struct AgentEvent: Codable, Sendable {
    }
 }

+public struct MessageActionParams: Codable, Sendable {
+    public let channel: String
+    public let action: String
+    public let params: [String: AnyCodable]
+    public let accountid: String?
+    public let requestersenderid: String?
+    public let senderisowner: Bool?
+    public let sessionkey: String?
+    public let sessionid: String?
+    public let agentid: String?
+    public let toolcontext: [String: AnyCodable]?
+    public let idempotencykey: String
+
+    public init(
+        channel: String,
+        action: String,
+        params: [String: AnyCodable],
+        accountid: String?,
+        requestersenderid: String?,
+        senderisowner: Bool?,
+        sessionkey: String?,
+        sessionid: String?,
+        agentid: String?,
+        toolcontext: [String: AnyCodable]?,
+        idempotencykey: String)
+    {
+        self.channel = channel
+        self.action = action
+        self.params = params
+        self.accountid = accountid
+        self.requestersenderid = requestersenderid
+        self.senderisowner = senderisowner
+        self.sessionkey = sessionkey
+        self.sessionid = sessionid
+        self.agentid = agentid
+        self.toolcontext = toolcontext
+        self.idempotencykey = idempotencykey
+    }
+
+    private enum CodingKeys: String, CodingKey {
+        case channel
+        case action
+        case params
+        case accountid = "accountId"
+        case requestersenderid = "requesterSenderId"
+        case senderisowner = "senderIsOwner"
+        case sessionkey = "sessionKey"
+        case sessionid = "sessionId"
+        case agentid = "agentId"
+        case toolcontext = "toolContext"
+        case idempotencykey = "idempotencyKey"
+    }
+}
+
 public struct SendParams: Codable, Sendable {
    public let to: String
    public let message: String?
@@ -1893,6 +1947,7 @@ public struct ConfigApplyParams: Codable, Sendable {
    public let raw: String
    public let basehash: String?
    public let sessionkey: String?
+    public let deliverycontext: [String: AnyCodable]?
    public let note: String?
    public let restartdelayms: Int?

@@ -1900,12 +1955,14 @@ public struct ConfigApplyParams: Codable, Sendable {
        raw: String,
        basehash: String?,
        sessionkey: String?,
+        deliverycontext: [String: AnyCodable]?,
        note: String?,
        restartdelayms: Int?)
    {
        self.raw = raw
        self.basehash = basehash
        self.sessionkey = sessionkey
+        self.deliverycontext = deliverycontext
        self.note = note
        self.restartdelayms = restartdelayms
    }
@@ -1914,6 +1971,7 @@ public struct ConfigApplyParams: Codable, Sendable {
        case raw
        case basehash = "baseHash"
        case sessionkey = "sessionKey"
+        case deliverycontext = "deliveryContext"
        case note
        case restartdelayms = "restartDelayMs"
    }
@@ -1923,6 +1981,7 @@ public struct ConfigPatchParams: Codable, Sendable {
    public let raw: String
    public let basehash: String?
    public let sessionkey: String?
+    public let deliverycontext: [String: AnyCodable]?
    public let note: String?
    public let restartdelayms: Int?

@@ -1930,12 +1989,14 @@ public struct ConfigPatchParams: Codable, Sendable {
        raw: String,
        basehash: String?,
        sessionkey: String?,
+        deliverycontext: [String: AnyCodable]?,
        note: String?,
        restartdelayms: Int?)
    {
        self.raw = raw
        self.basehash = basehash
        self.sessionkey = sessionkey
+        self.deliverycontext = deliverycontext
        self.note = note
        self.restartdelayms = restartdelayms
    }
@@ -1944,6 +2005,7 @@ public struct ConfigPatchParams: Codable, Sendable {
        case raw
        case basehash = "baseHash"
        case sessionkey = "sessionKey"
+        case deliverycontext = "deliveryContext"
        case note
        case restartdelayms = "restartDelayMs"
    }
@@ -4313,17 +4375,20 @@ public struct ChatEvent: Codable, Sendable {

 public struct UpdateRunParams: Codable, Sendable {
    public let sessionkey: String?
+    public let deliverycontext: [String: AnyCodable]?
    public let note: String?
    public let restartdelayms: Int?
    public let timeoutms: Int?

    public init(
        sessionkey: String?,
+        deliverycontext: [String: AnyCodable]?,
        note: String?,
        restartdelayms: Int?,
        timeoutms: Int?)
    {
        self.sessionkey = sessionkey
+        self.deliverycontext = deliverycontext
        self.note = note
        self.restartdelayms = restartdelayms
        self.timeoutms = timeoutms
@@ -4331,6 +4396,7 @@ public struct UpdateRunParams: Codable, Sendable {

    private enum CodingKeys: String, CodingKey {
        case sessionkey = "sessionKey"
+        case deliverycontext = "deliveryContext"
        case note
        case restartdelayms = "restartDelayMs"
        case timeoutms = "timeoutMs"
--- a/apps/shared/OpenClawKit/Sources/OpenClawProtocol/GatewayModels.swift
+++ b/apps/shared/OpenClawKit/Sources/OpenClawProtocol/GatewayModels.swift
@@ -401,6 +401,60 @@ public struct AgentEvent: Codable, Sendable {
    }
 }

+public struct MessageActionParams: Codable, Sendable {
+    public let channel: String
+    public let action: String
+    public let params: [String: AnyCodable]
+    public let accountid: String?
+    public let requestersenderid: String?
+    public let senderisowner: Bool?
+    public let sessionkey: String?
+    public let sessionid: String?
+    public let agentid: String?
+    public let toolcontext: [String: AnyCodable]?
+    public let idempotencykey: String
+
+    public init(
+        channel: String,
+        action: String,
+        params: [String: AnyCodable],
+        accountid: String?,
+        requestersenderid: String?,
+        senderisowner: Bool?,
+        sessionkey: String?,
+        sessionid: String?,
+        agentid: String?,
+        toolcontext: [String: AnyCodable]?,
+        idempotencykey: String)
+    {
+        self.channel = channel
+        self.action = action
+        self.params = params
+        self.accountid = accountid
+        self.requestersenderid = requestersenderid
+        self.senderisowner = senderisowner
+        self.sessionkey = sessionkey
+        self.sessionid = sessionid
+        self.agentid = agentid
+        self.toolcontext = toolcontext
+        self.idempotencykey = idempotencykey
+    }
+
+    private enum CodingKeys: String, CodingKey {
+        case channel
+        case action
+        case params
+        case accountid = "accountId"
+        case requestersenderid = "requesterSenderId"
+        case senderisowner = "senderIsOwner"
+        case sessionkey = "sessionKey"
+        case sessionid = "sessionId"
+        case agentid = "agentId"
+        case toolcontext = "toolContext"
+        case idempotencykey = "idempotencyKey"
+    }
+}
+
 public struct SendParams: Codable, Sendable {
    public let to: String
    public let message: String?
@@ -1893,6 +1947,7 @@ public struct ConfigApplyParams: Codable, Sendable {
    public let raw: String
    public let basehash: String?
    public let sessionkey: String?
+    public let deliverycontext: [String: AnyCodable]?
    public let note: String?
    public let restartdelayms: Int?

@@ -1900,12 +1955,14 @@ public struct ConfigApplyParams: Codable, Sendable {
        raw: String,
        basehash: String?,
        sessionkey: String?,
+        deliverycontext: [String: AnyCodable]?,
        note: String?,
        restartdelayms: Int?)
    {
        self.raw = raw
        self.basehash = basehash
        self.sessionkey = sessionkey
+        self.deliverycontext = deliverycontext
        self.note = note
        self.restartdelayms = restartdelayms
    }
@@ -1914,6 +1971,7 @@ public struct ConfigApplyParams: Codable, Sendable {
        case raw
        case basehash = "baseHash"
        case sessionkey = "sessionKey"
+        case deliverycontext = "deliveryContext"
        case note
        case restartdelayms = "restartDelayMs"
    }
@@ -1923,6 +1981,7 @@ public struct ConfigPatchParams: Codable, Sendable {
    public let raw: String
    public let basehash: String?
    public let sessionkey: String?
+    public let deliverycontext: [String: AnyCodable]?
    public let note: String?
    public let restartdelayms: Int?

@@ -1930,12 +1989,14 @@ public struct ConfigPatchParams: Codable, Sendable {
        raw: String,
        basehash: String?,
        sessionkey: String?,
+        deliverycontext: [String: AnyCodable]?,
        note: String?,
        restartdelayms: Int?)
    {
        self.raw = raw
        self.basehash = basehash
        self.sessionkey = sessionkey
+        self.deliverycontext = deliverycontext
        self.note = note
        self.restartdelayms = restartdelayms
    }
@@ -1944,6 +2005,7 @@ public struct ConfigPatchParams: Codable, Sendable {
        case raw
        case basehash = "baseHash"
        case sessionkey = "sessionKey"
+        case deliverycontext = "deliveryContext"
        case note
        case restartdelayms = "restartDelayMs"
    }
@@ -4313,17 +4375,20 @@ public struct ChatEvent: Codable, Sendable {

 public struct UpdateRunParams: Codable, Sendable {
    public let sessionkey: String?
+    public let deliverycontext: [String: AnyCodable]?
    public let note: String?
    public let restartdelayms: Int?
    public let timeoutms: Int?

    public init(
        sessionkey: String?,
+        deliverycontext: [String: AnyCodable]?,
        note: String?,
        restartdelayms: Int?,
        timeoutms: Int?)
    {
        self.sessionkey = sessionkey
+        self.deliverycontext = deliverycontext
        self.note = note
        self.restartdelayms = restartdelayms
        self.timeoutms = timeoutms
@@ -4331,6 +4396,7 @@ public struct UpdateRunParams: Codable, Sendable {

    private enum CodingKeys: String, CodingKey {
        case sessionkey = "sessionKey"
+        case deliverycontext = "deliveryContext"
        case note
        case restartdelayms = "restartDelayMs"
        case timeoutms = "timeoutMs"
--- a/apps/shared/OpenClawKit/Tools/CanvasA2UI/bootstrap.js
+++ b/apps/shared/OpenClawKit/Tools/CanvasA2UI/bootstrap.js
@@ -466,8 +466,10 @@ class OpenClawA2UIHost extends LitElement {
      try {
        // WebKit message handlers support structured objects; Android's JS interface expects strings.
        if (handler === globalThis.openclawCanvasA2UIAction) {
+          // oxlint-disable-next-line unicorn/require-post-message-target-origin -- Native app message handler, not Window.postMessage.
          handler.postMessage(JSON.stringify({ userAction }));
        } else {
+          // oxlint-disable-next-line unicorn/require-post-message-target-origin -- WebKit message handler, not Window.postMessage.
          handler.postMessage({ userAction });
        }
      } catch (e) {
--- a/docs/.generated/config-baseline.sha256
+++ b/docs/.generated/config-baseline.sha256
@@ -1,4 +1,4 @@
-1977d4698bb80b9aa99315f1114a61b5692bd5630f2ac4a225d81ddc5459d588  config-baseline.json
-d1ee5c4d01deac5cf8ea284cafcd8b6c952b2554d40947d2463d08e314acfcda  config-baseline.core.json
-e1f94346a8507ce3dec763b598e79f3bb89ff2e33189ce977cc87d3b05e71c1d  config-baseline.channel.json
-0fb10e5cb00e7da2cd07c959e0e3397ecb2fdcf15e13a7eae06a2c5b2346bb10  config-baseline.plugin.json
+fce3cbf24274016e01324082ad8ffe81fe2fb41a6e6314aa6efcdbe6689fd628  config-baseline.json
+fb6f0ef881fb591d2791d2adca43c7e88d48f8b562457683092ab6e767aece78  config-baseline.core.json
+3bb312dc9c39a374ca92613abf21606c25dc571287a3941dac71ff57b2b5c519  config-baseline.channel.json
+6c19997f1fb2aff4315f2cb9c7d9e299b403fbc0f9e78e3412cc7fe1c655f222  config-baseline.plugin.json
--- a/docs/.generated/plugin-sdk-api-baseline.sha256
+++ b/docs/.generated/plugin-sdk-api-baseline.sha256
@@ -1,2 +1,2 @@
-2256ba1237c3608ca981bce3a7c66b6880b12d05025f260d5c086b69038f408b  plugin-sdk-api-baseline.json
-6360529513280140c122020466f0821a9acc83aba64612cf90656c2af0261ab3  plugin-sdk-api-baseline.jsonl
+f0d71b70eb54d67fdc35dde8a5051e527c8a910b7b981f5075d78a5160dd08fa  plugin-sdk-api-baseline.json
+e305bb63072efa680951babd1eb1f419e9965d8a4bdabfc9bf3cafe24a8551df  plugin-sdk-api-baseline.jsonl
--- a/docs/.i18n/glossary.ar.json
+++ b/docs/.i18n/glossary.ar.json
@@ -1,5 +1,78 @@
 [
-  { "source": "CLI", "target": "CLI" },
-  { "source": "Mintlify", "target": "Mintlify" },
-  { "source": "OpenClaw", "target": "OpenClaw" }
+  {
+    "source": "ACP",
+    "target": "ACP"
+  },
+  {
+    "source": "Active Memory",
+    "target": "Active Memory"
+  },
+  {
+    "source": "ClawHub",
+    "target": "ClawHub"
+  },
+  {
+    "source": "CLI",
+    "target": "CLI"
+  },
+  {
+    "source": "Compaction",
+    "target": "Compaction"
+  },
+  {
+    "source": "Cron",
+    "target": "Cron"
+  },
+  {
+    "source": "Dreaming",
+    "target": "Dreaming"
+  },
+  {
+    "source": "Gateway",
+    "target": "Gateway"
+  },
+  {
+    "source": "Heartbeat",
+    "target": "Heartbeat"
+  },
+  {
+    "source": "Mintlify",
+    "target": "Mintlify"
+  },
+  {
+    "source": "Node",
+    "target": "Node"
+  },
+  {
+    "source": "OpenClaw",
+    "target": "OpenClaw"
+  },
+  {
+    "source": "Pi",
+    "target": "Pi"
+  },
+  {
+    "source": "Plugin",
+    "target": "Plugin"
+  },
+  {
+    "source": "Skills",
+    "target": "Skills"
+  },
+  {
+    "source": "Tailscale",
+    "target": "Tailscale"
+  },
+  {
+    "source": "TaskFlow",
+    "target": "TaskFlow"
+  },
+  {
+    "source": "TUI",
+    "target": "TUI"
+  },
+  {
+    "source": "Webhook",
+    "target": "Webhook"
+  }
 ]
--- a/docs/.i18n/glossary.de.json
+++ b/docs/.i18n/glossary.de.json
@@ -1,5 +1,78 @@
 [
-  { "source": "CLI", "target": "CLI" },
-  { "source": "Mintlify", "target": "Mintlify" },
-  { "source": "OpenClaw", "target": "OpenClaw" }
+  {
+    "source": "ACP",
+    "target": "ACP"
+  },
+  {
+    "source": "Active Memory",
+    "target": "Active Memory"
+  },
+  {
+    "source": "ClawHub",
+    "target": "ClawHub"
+  },
+  {
+    "source": "CLI",
+    "target": "CLI"
+  },
+  {
+    "source": "Compaction",
+    "target": "Compaction"
+  },
+  {
+    "source": "Cron",
+    "target": "Cron"
+  },
+  {
+    "source": "Dreaming",
+    "target": "Dreaming"
+  },
+  {
+    "source": "Gateway",
+    "target": "Gateway"
+  },
+  {
+    "source": "Heartbeat",
+    "target": "Heartbeat"
+  },
+  {
+    "source": "Mintlify",
+    "target": "Mintlify"
+  },
+  {
+    "source": "Node",
+    "target": "Node"
+  },
+  {
+    "source": "OpenClaw",
+    "target": "OpenClaw"
+  },
+  {
+    "source": "Pi",
+    "target": "Pi"
+  },
+  {
+    "source": "Plugin",
+    "target": "Plugin"
+  },
+  {
+    "source": "Skills",
+    "target": "Skills"
+  },
+  {
+    "source": "Tailscale",
+    "target": "Tailscale"
+  },
+  {
+    "source": "TaskFlow",
+    "target": "TaskFlow"
+  },
+  {
+    "source": "TUI",
+    "target": "TUI"
+  },
+  {
+    "source": "Webhook",
+    "target": "Webhook"
+  }
 ]
--- a/docs/.i18n/glossary.es.json
+++ b/docs/.i18n/glossary.es.json
@@ -1,5 +1,78 @@
 [
-  { "source": "CLI", "target": "CLI" },
-  { "source": "Mintlify", "target": "Mintlify" },
-  { "source": "OpenClaw", "target": "OpenClaw" }
+  {
+    "source": "ACP",
+    "target": "ACP"
+  },
+  {
+    "source": "Active Memory",
+    "target": "Active Memory"
+  },
+  {
+    "source": "ClawHub",
+    "target": "ClawHub"
+  },
+  {
+    "source": "CLI",
+    "target": "CLI"
+  },
+  {
+    "source": "Compaction",
+    "target": "Compaction"
+  },
+  {
+    "source": "Cron",
+    "target": "Cron"
+  },
+  {
+    "source": "Dreaming",
+    "target": "Dreaming"
+  },
+  {
+    "source": "Gateway",
+    "target": "Gateway"
+  },
+  {
+    "source": "Heartbeat",
+    "target": "Heartbeat"
+  },
+  {
+    "source": "Mintlify",
+    "target": "Mintlify"
+  },
+  {
+    "source": "Node",
+    "target": "Node"
+  },
+  {
+    "source": "OpenClaw",
+    "target": "OpenClaw"
+  },
+  {
+    "source": "Pi",
+    "target": "Pi"
+  },
+  {
+    "source": "Plugin",
+    "target": "Plugin"
+  },
+  {
+    "source": "Skills",
+    "target": "Skills"
+  },
+  {
+    "source": "Tailscale",
+    "target": "Tailscale"
+  },
+  {
+    "source": "TaskFlow",
+    "target": "TaskFlow"
+  },
+  {
+    "source": "TUI",
+    "target": "TUI"
+  },
+  {
+    "source": "Webhook",
+    "target": "Webhook"
+  }
 ]
--- a/docs/.i18n/glossary.fr.json
+++ b/docs/.i18n/glossary.fr.json
@@ -1,5 +1,78 @@
 [
-  { "source": "CLI", "target": "CLI" },
-  { "source": "Mintlify", "target": "Mintlify" },
-  { "source": "OpenClaw", "target": "OpenClaw" }
+  {
+    "source": "ACP",
+    "target": "ACP"
+  },
+  {
+    "source": "Active Memory",
+    "target": "Active Memory"
+  },
+  {
+    "source": "ClawHub",
+    "target": "ClawHub"
+  },
+  {
+    "source": "CLI",
+    "target": "CLI"
+  },
+  {
+    "source": "Compaction",
+    "target": "Compaction"
+  },
+  {
+    "source": "Cron",
+    "target": "Cron"
+  },
+  {
+    "source": "Dreaming",
+    "target": "Dreaming"
+  },
+  {
+    "source": "Gateway",
+    "target": "Gateway"
+  },
+  {
+    "source": "Heartbeat",
+    "target": "Heartbeat"
+  },
+  {
+    "source": "Mintlify",
+    "target": "Mintlify"
+  },
+  {
+    "source": "Node",
+    "target": "Node"
+  },
+  {
+    "source": "OpenClaw",
+    "target": "OpenClaw"
+  },
+  {
+    "source": "Pi",
+    "target": "Pi"
+  },
+  {
+    "source": "Plugin",
+    "target": "Plugin"
+  },
+  {
+    "source": "Skills",
+    "target": "Skills"
+  },
+  {
+    "source": "Tailscale",
+    "target": "Tailscale"
+  },
+  {
+    "source": "TaskFlow",
+    "target": "TaskFlow"
+  },
+  {
+    "source": "TUI",
+    "target": "TUI"
+  },
+  {
+    "source": "Webhook",
+    "target": "Webhook"
+  }
 ]
--- a/docs/.i18n/glossary.id.json
+++ b/docs/.i18n/glossary.id.json
@@ -1,5 +1,78 @@
 [
-  { "source": "CLI", "target": "CLI" },
-  { "source": "Mintlify", "target": "Mintlify" },
-  { "source": "OpenClaw", "target": "OpenClaw" }
+  {
+    "source": "ACP",
+    "target": "ACP"
+  },
+  {
+    "source": "Active Memory",
+    "target": "Active Memory"
+  },
+  {
+    "source": "ClawHub",
+    "target": "ClawHub"
+  },
+  {
+    "source": "CLI",
+    "target": "CLI"
+  },
+  {
+    "source": "Compaction",
+    "target": "Compaction"
+  },
+  {
+    "source": "Cron",
+    "target": "Cron"
+  },
+  {
+    "source": "Dreaming",
+    "target": "Dreaming"
+  },
+  {
+    "source": "Gateway",
+    "target": "Gateway"
+  },
+  {
+    "source": "Heartbeat",
+    "target": "Heartbeat"
+  },
+  {
+    "source": "Mintlify",
+    "target": "Mintlify"
+  },
+  {
+    "source": "Node",
+    "target": "Node"
+  },
+  {
+    "source": "OpenClaw",
+    "target": "OpenClaw"
+  },
+  {
+    "source": "Pi",
+    "target": "Pi"
+  },
+  {
+    "source": "Plugin",
+    "target": "Plugin"
+  },
+  {
+    "source": "Skills",
+    "target": "Skills"
+  },
+  {
+    "source": "Tailscale",
+    "target": "Tailscale"
+  },
+  {
+    "source": "TaskFlow",
+    "target": "TaskFlow"
+  },
+  {
+    "source": "TUI",
+    "target": "TUI"
+  },
+  {
+    "source": "Webhook",
+    "target": "Webhook"
+  }
 ]
--- a/docs/.i18n/glossary.it.json
+++ b/docs/.i18n/glossary.it.json
@@ -1,5 +1,78 @@
 [
-  { "source": "CLI", "target": "CLI" },
-  { "source": "Mintlify", "target": "Mintlify" },
-  { "source": "OpenClaw", "target": "OpenClaw" }
+  {
+    "source": "ACP",
+    "target": "ACP"
+  },
+  {
+    "source": "Active Memory",
+    "target": "Active Memory"
+  },
+  {
+    "source": "ClawHub",
+    "target": "ClawHub"
+  },
+  {
+    "source": "CLI",
+    "target": "CLI"
+  },
+  {
+    "source": "Compaction",
+    "target": "Compaction"
+  },
+  {
+    "source": "Cron",
+    "target": "Cron"
+  },
+  {
+    "source": "Dreaming",
+    "target": "Dreaming"
+  },
+  {
+    "source": "Gateway",
+    "target": "Gateway"
+  },
+  {
+    "source": "Heartbeat",
+    "target": "Heartbeat"
+  },
+  {
+    "source": "Mintlify",
+    "target": "Mintlify"
+  },
+  {
+    "source": "Node",
+    "target": "Node"
+  },
+  {
+    "source": "OpenClaw",
+    "target": "OpenClaw"
+  },
+  {
+    "source": "Pi",
+    "target": "Pi"
+  },
+  {
+    "source": "Plugin",
+    "target": "Plugin"
+  },
+  {
+    "source": "Skills",
+    "target": "Skills"
+  },
+  {
+    "source": "Tailscale",
+    "target": "Tailscale"
+  },
+  {
+    "source": "TaskFlow",
+    "target": "TaskFlow"
+  },
+  {
+    "source": "TUI",
+    "target": "TUI"
+  },
+  {
+    "source": "Webhook",
+    "target": "Webhook"
+  }
 ]
--- a/docs/.i18n/glossary.ja-JP.json
+++ b/docs/.i18n/glossary.ja-JP.json
@@ -1,14 +1,98 @@
 [
-  { "source": "OpenClaw", "target": "OpenClaw" },
-  { "source": "Gateway", "target": "Gateway" },
-  { "source": "Pi", "target": "Pi" },
-  { "source": "Skills", "target": "Skills" },
-  { "source": "local loopback", "target": "local loopback" },
-  { "source": "Tailscale", "target": "Tailscale" },
-  { "source": "Getting Started", "target": "はじめに" },
-  { "source": "Getting started", "target": "はじめに" },
-  { "source": "Quick start", "target": "クイックスタート" },
-  { "source": "Quick Start", "target": "クイックスタート" },
-  { "source": "Onboarding", "target": "オンボーディング" },
-  { "source": "wizard", "target": "ウィザード" }
+  {
+    "source": "ACP",
+    "target": "ACP"
+  },
+  {
+    "source": "Active Memory",
+    "target": "Active Memory"
+  },
+  {
+    "source": "ClawHub",
+    "target": "ClawHub"
+  },
+  {
+    "source": "Compaction",
+    "target": "Compaction"
+  },
+  {
+    "source": "Cron",
+    "target": "Cron"
+  },
+  {
+    "source": "Dreaming",
+    "target": "Dreaming"
+  },
+  {
+    "source": "Gateway",
+    "target": "Gateway"
+  },
+  {
+    "source": "Getting Started",
+    "target": "はじめに"
+  },
+  {
+    "source": "Getting started",
+    "target": "はじめに"
+  },
+  {
+    "source": "Heartbeat",
+    "target": "Heartbeat"
+  },
+  {
+    "source": "local loopback",
+    "target": "local loopback"
+  },
+  {
+    "source": "Node",
+    "target": "Node"
+  },
+  {
+    "source": "Onboarding",
+    "target": "オンボーディング"
+  },
+  {
+    "source": "OpenClaw",
+    "target": "OpenClaw"
+  },
+  {
+    "source": "Pi",
+    "target": "Pi"
+  },
+  {
+    "source": "Plugin",
+    "target": "Plugin"
+  },
+  {
+    "source": "Quick start",
+    "target": "クイックスタート"
+  },
+  {
+    "source": "Quick Start",
+    "target": "クイックスタート"
+  },
+  {
+    "source": "Skills",
+    "target": "Skills"
+  },
+  {
+    "source": "Tailscale",
+    "target": "Tailscale"
+  },
+  {
+    "source": "TaskFlow",
+    "target": "TaskFlow"
+  },
+  {
+    "source": "TUI",
+    "target": "TUI"
+  },
+  {
+    "source": "Webhook",
+    "target": "Webhook"
+  },
+  {
+    "source": "wizard",
+    "target": "ウィザード"
+  }
 ]
--- a/docs/.i18n/glossary.ko.json
+++ b/docs/.i18n/glossary.ko.json
@@ -1,5 +1,78 @@
 [
-  { "source": "CLI", "target": "CLI" },
-  { "source": "Mintlify", "target": "Mintlify" },
-  { "source": "OpenClaw", "target": "OpenClaw" }
+  {
+    "source": "ACP",
+    "target": "ACP"
+  },
+  {
+    "source": "Active Memory",
+    "target": "Active Memory"
+  },
+  {
+    "source": "ClawHub",
+    "target": "ClawHub"
+  },
+  {
+    "source": "CLI",
+    "target": "CLI"
+  },
+  {
+    "source": "Compaction",
+    "target": "Compaction"
+  },
+  {
+    "source": "Cron",
+    "target": "Cron"
+  },
+  {
+    "source": "Dreaming",
+    "target": "Dreaming"
+  },
+  {
+    "source": "Gateway",
+    "target": "Gateway"
+  },
+  {
+    "source": "Heartbeat",
+    "target": "Heartbeat"
+  },
+  {
+    "source": "Mintlify",
+    "target": "Mintlify"
+  },
+  {
+    "source": "Node",
+    "target": "Node"
+  },
+  {
+    "source": "OpenClaw",
+    "target": "OpenClaw"
+  },
+  {
+    "source": "Pi",
+    "target": "Pi"
+  },
+  {
+    "source": "Plugin",
+    "target": "Plugin"
+  },
+  {
+    "source": "Skills",
+    "target": "Skills"
+  },
+  {
+    "source": "Tailscale",
+    "target": "Tailscale"
+  },
+  {
+    "source": "TaskFlow",
+    "target": "TaskFlow"
+  },
+  {
+    "source": "TUI",
+    "target": "TUI"
+  },
+  {
+    "source": "Webhook",
+    "target": "Webhook"
+  }
 ]
--- a/docs/.i18n/glossary.pl.json
+++ b/docs/.i18n/glossary.pl.json
@@ -1,5 +1,78 @@
 [
-  { "source": "CLI", "target": "CLI" },
-  { "source": "Mintlify", "target": "Mintlify" },
-  { "source": "OpenClaw", "target": "OpenClaw" }
+  {
+    "source": "ACP",
+    "target": "ACP"
+  },
+  {
+    "source": "Active Memory",
+    "target": "Active Memory"
+  },
+  {
+    "source": "ClawHub",
+    "target": "ClawHub"
+  },
+  {
+    "source": "CLI",
+    "target": "CLI"
+  },
+  {
+    "source": "Compaction",
+    "target": "Compaction"
+  },
+  {
+    "source": "Cron",
+    "target": "Cron"
+  },
+  {
+    "source": "Dreaming",
+    "target": "Dreaming"
+  },
+  {
+    "source": "Gateway",
+    "target": "Gateway"
+  },
+  {
+    "source": "Heartbeat",
+    "target": "Heartbeat"
+  },
+  {
+    "source": "Mintlify",
+    "target": "Mintlify"
+  },
+  {
+    "source": "Node",
+    "target": "Node"
+  },
+  {
+    "source": "OpenClaw",
+    "target": "OpenClaw"
+  },
+  {
+    "source": "Pi",
+    "target": "Pi"
+  },
+  {
+    "source": "Plugin",
+    "target": "Plugin"
+  },
+  {
+    "source": "Skills",
+    "target": "Skills"
+  },
+  {
+    "source": "Tailscale",
+    "target": "Tailscale"
+  },
+  {
+    "source": "TaskFlow",
+    "target": "TaskFlow"
+  },
+  {
+    "source": "TUI",
+    "target": "TUI"
+  },
+  {
+    "source": "Webhook",
+    "target": "Webhook"
+  }
 ]
--- a/docs/.i18n/glossary.pt-BR.json
+++ b/docs/.i18n/glossary.pt-BR.json
@@ -1,5 +1,78 @@
 [
-  { "source": "CLI", "target": "CLI" },
-  { "source": "Mintlify", "target": "Mintlify" },
-  { "source": "OpenClaw", "target": "OpenClaw" }
+  {
+    "source": "ACP",
+    "target": "ACP"
+  },
+  {
+    "source": "Active Memory",
+    "target": "Active Memory"
+  },
+  {
+    "source": "ClawHub",
+    "target": "ClawHub"
+  },
+  {
+    "source": "CLI",
+    "target": "CLI"
+  },
+  {
+    "source": "Compaction",
+    "target": "Compaction"
+  },
+  {
+    "source": "Cron",
+    "target": "Cron"
+  },
+  {
+    "source": "Dreaming",
+    "target": "Dreaming"
+  },
+  {
+    "source": "Gateway",
+    "target": "Gateway"
+  },
+  {
+    "source": "Heartbeat",
+    "target": "Heartbeat"
+  },
+  {
+    "source": "Mintlify",
+    "target": "Mintlify"
+  },
+  {
+    "source": "Node",
+    "target": "Node"
+  },
+  {
+    "source": "OpenClaw",
+    "target": "OpenClaw"
+  },
+  {
+    "source": "Pi",
+    "target": "Pi"
+  },
+  {
+    "source": "Plugin",
+    "target": "Plugin"
+  },
+  {
+    "source": "Skills",
+    "target": "Skills"
+  },
+  {
+    "source": "Tailscale",
+    "target": "Tailscale"
+  },
+  {
+    "source": "TaskFlow",
+    "target": "TaskFlow"
+  },
+  {
+    "source": "TUI",
+    "target": "TUI"
+  },
+  {
+    "source": "Webhook",
+    "target": "Webhook"
+  }
 ]
--- a/docs/.i18n/glossary.tr.json
+++ b/docs/.i18n/glossary.tr.json
@@ -1,5 +1,78 @@
 [
-  { "source": "CLI", "target": "CLI" },
-  { "source": "Mintlify", "target": "Mintlify" },
-  { "source": "OpenClaw", "target": "OpenClaw" }
+  {
+    "source": "ACP",
+    "target": "ACP"
+  },
+  {
+    "source": "Active Memory",
+    "target": "Active Memory"
+  },
+  {
+    "source": "ClawHub",
+    "target": "ClawHub"
+  },
+  {
+    "source": "CLI",
+    "target": "CLI"
+  },
+  {
+    "source": "Compaction",
+    "target": "Compaction"
+  },
+  {
+    "source": "Cron",
+    "target": "Cron"
+  },
+  {
+    "source": "Dreaming",
+    "target": "Dreaming"
+  },
+  {
+    "source": "Gateway",
+    "target": "Gateway"
+  },
+  {
+    "source": "Heartbeat",
+    "target": "Heartbeat"
+  },
+  {
+    "source": "Mintlify",
+    "target": "Mintlify"
+  },
+  {
+    "source": "Node",
+    "target": "Node"
+  },
+  {
+    "source": "OpenClaw",
+    "target": "OpenClaw"
+  },
+  {
+    "source": "Pi",
+    "target": "Pi"
+  },
+  {
+    "source": "Plugin",
+    "target": "Plugin"
+  },
+  {
+    "source": "Skills",
+    "target": "Skills"
+  },
+  {
+    "source": "Tailscale",
+    "target": "Tailscale"
+  },
+  {
+    "source": "TaskFlow",
+    "target": "TaskFlow"
+  },
+  {
+    "source": "TUI",
+    "target": "TUI"
+  },
+  {
+    "source": "Webhook",
+    "target": "Webhook"
+  }
 ]
--- a/docs/.i18n/glossary.uk.json
+++ b/docs/.i18n/glossary.uk.json
@@ -1,5 +1,78 @@
 [
-  { "source": "CLI", "target": "CLI" },
-  { "source": "Mintlify", "target": "Mintlify" },
-  { "source": "OpenClaw", "target": "OpenClaw" }
+  {
+    "source": "ACP",
+    "target": "ACP"
+  },
+  {
+    "source": "Active Memory",
+    "target": "Active Memory"
+  },
+  {
+    "source": "ClawHub",
+    "target": "ClawHub"
+  },
+  {
+    "source": "CLI",
+    "target": "CLI"
+  },
+  {
+    "source": "Compaction",
+    "target": "Compaction"
+  },
+  {
+    "source": "Cron",
+    "target": "Cron"
+  },
+  {
+    "source": "Dreaming",
+    "target": "Dreaming"
+  },
+  {
+    "source": "Gateway",
+    "target": "Gateway"
+  },
+  {
+    "source": "Heartbeat",
+    "target": "Heartbeat"
+  },
+  {
+    "source": "Mintlify",
+    "target": "Mintlify"
+  },
+  {
+    "source": "Node",
+    "target": "Node"
+  },
+  {
+    "source": "OpenClaw",
+    "target": "OpenClaw"
+  },
+  {
+    "source": "Pi",
+    "target": "Pi"
+  },
+  {
+    "source": "Plugin",
+    "target": "Plugin"
+  },
+  {
+    "source": "Skills",
+    "target": "Skills"
+  },
+  {
+    "source": "Tailscale",
+    "target": "Tailscale"
+  },
+  {
+    "source": "TaskFlow",
+    "target": "TaskFlow"
+  },
+  {
+    "source": "TUI",
+    "target": "TUI"
+  },
+  {
+    "source": "Webhook",
+    "target": "Webhook"
+  }
 ]
--- a/docs/AGENTS.md
+++ b/docs/AGENTS.md
@@ -0,0 +1,28 @@
+# Docs Guide
+
+This directory owns docs authoring, Mintlify link rules, and docs i18n policy.
+
+## Mintlify Rules
+
+- Docs are hosted on Mintlify (`https://docs.openclaw.ai`).
+- Internal doc links in `docs/**/*.md` must stay root-relative with no `.md` or `.mdx` suffix (example: `[Config](/configuration)`).
+- Section cross-references should use anchors on root-relative paths (example: `[Hooks](/configuration#hooks)`).
+- Doc headings should avoid em dashes and apostrophes because Mintlify anchor generation is brittle there.
+- README and other GitHub-rendered docs should keep absolute docs URLs so links work outside Mintlify.
+- Docs content must stay generic: no personal device names, hostnames, or local paths; use placeholders like `user@gateway-host`.
+
+## Docs Content Rules
+
+- For docs, UI copy, and picker lists, order services/providers alphabetically unless the section is explicitly describing runtime order or auto-detection order.
+- Keep bundled plugin naming consistent with the repo-wide plugin terminology rules in the root `AGENTS.md`.
+
+## Docs i18n
+
+- Foreign-language docs are not maintained in this repo. The generated publish output lives in the separate `openclaw/docs` repo (often cloned locally as `../openclaw-docs`).
+- Do not add or edit localized docs under `docs/<locale>/**` here.
+- Treat English docs in this repo plus glossary files as the source of truth.
+- Pipeline: update English docs here, update `docs/.i18n/glossary.<locale>.json` as needed, then let the publish-repo sync and `scripts/docs-i18n` run in `openclaw/docs`.
+- Before rerunning `scripts/docs-i18n`, add glossary entries for any new technical terms, page titles, or short nav labels that must stay in English or use a fixed translation.
+- `pnpm docs:check-i18n-glossary` is the guard for changed English doc titles and short internal doc labels.
+- Translation memory lives in generated `docs/.i18n/*.tm.jsonl` files in the publish repo.
+- See `docs/.i18n/README.md`.
--- a/docs/CLAUDE.md
+++ b/docs/CLAUDE.md
@@ -0,0 +1 @@
+AGENTS.md
--- a/docs/automation/cron-jobs.md
+++ b/docs/automation/cron-jobs.md
@@ -62,6 +62,18 @@ Timestamps without a timezone are treated as UTC. Add `--tz America/New_York` fo

 Recurring top-of-hour expressions are automatically staggered by up to 5 minutes to reduce load spikes. Use `--exact` to force precise timing or `--stagger 30s` for an explicit window.

+### Day-of-month and day-of-week use OR logic
+
+Cron expressions are parsed by [croner](https://github.com/Hexagon/croner). When both the day-of-month and day-of-week fields are non-wildcard, croner matches when **either** field matches — not both. This is standard Vixie cron behavior.
+
+```
+# Intended: "9 AM on the 15th, only if it's a Monday"
+# Actual:   "9 AM on every 15th, AND 9 AM on every Monday"
+0 9 15 * 1
+```
+
+This fires ~5–6 times per month instead of 0–1 times per month. OpenClaw uses Croner's default OR behavior here. To require both conditions, use Croner's `+` day-of-week modifier (`0 9 15 * +1`) or schedule on one field and guard the other in your job's prompt or command.
+
 ## Execution styles

 | Style           | `--session` value   | Runs in                  | Best for                        |
--- a/docs/channels/msteams.md
+++ b/docs/channels/msteams.md
@@ -9,7 +9,7 @@ title: "Microsoft Teams"

 > "Abandon all hope, ye who enter here."

-Updated: 2026-01-21
+Updated: 2026-03-25

 Status: text + DM attachments are supported; channel/group file sending requires `sharePointSiteId` + Graph permissions (see [Sending files in group chats](#sending-files-in-group-chats)). Polls are sent via Adaptive Cards. Message actions expose explicit `upload-file` for file-first sends.

@@ -43,7 +43,7 @@ Details: [Plugins](/tools/plugin)
 4. Expose `/api/messages` (port 3978 by default) via a public URL or tunnel.
 5. Install the Teams app package and start the gateway.

-Minimal config:
+Minimal config (client secret):

 ```json5
 {
@@ -59,6 +59,8 @@ Minimal config:
 }
 ```

+For production deployments, consider using [federated authentication](#federated-authentication-certificate--managed-identity) (certificate or managed identity) instead of client secrets.
+
 Note: group chats are blocked by default (`channels.msteams.groupPolicy: "allowlist"`). To allow group replies, set `channels.msteams.groupAllowFrom` (or use `groupPolicy: "open"` to allow any member, mention-gated).

 ## Goals
@@ -190,6 +192,148 @@ Before configuring OpenClaw, you need to create an Azure Bot resource.
 2. Click **Microsoft Teams** → Configure → Save
 3. Accept the Terms of Service

+## Federated Authentication (Certificate + Managed Identity)
+
+> Added in 2026.3.24
+
+For production deployments, OpenClaw supports **federated authentication** as a more secure alternative to client secrets. Two methods are available:
+
+### Option A: Certificate-based authentication
+
+Use a PEM certificate registered with your Entra ID app registration.
+
+**Setup:**
+
+1. Generate or obtain a certificate (PEM format with private key).
+2. In Entra ID → App Registration → **Certificates & secrets** → **Certificates** → Upload the public certificate.
+
+**Config:**
+
+```json5
+{
+  channels: {
+    msteams: {
+      enabled: true,
+      appId: "<APP_ID>",
+      tenantId: "<TENANT_ID>",
+      authType: "federated",
+      certificatePath: "/path/to/cert.pem",
+      webhook: { port: 3978, path: "/api/messages" },
+    },
+  },
+}
+```
+
+**Env vars:**
+
+- `MSTEAMS_AUTH_TYPE=federated`
+- `MSTEAMS_CERTIFICATE_PATH=/path/to/cert.pem`
+
+### Option B: Azure Managed Identity
+
+Use Azure Managed Identity for passwordless authentication. This is ideal for deployments on Azure infrastructure (AKS, App Service, Azure VMs) where a managed identity is available.
+
+**How it works:**
+
+1. The bot pod/VM has a managed identity (system-assigned or user-assigned).
+2. A **federated identity credential** links the managed identity to the Entra ID app registration.
+3. At runtime, OpenClaw uses `@azure/identity` to acquire tokens from the Azure IMDS endpoint (`169.254.169.254`).
+4. The token is passed to the Teams SDK for bot authentication.
+
+**Prerequisites:**
+
+- Azure infrastructure with managed identity enabled (AKS workload identity, App Service, VM)
+- Federated identity credential created on the Entra ID app registration
+- Network access to IMDS (`169.254.169.254:80`) from the pod/VM
+
+**Config (system-assigned managed identity):**
+
+```json5
+{
+  channels: {
+    msteams: {
+      enabled: true,
+      appId: "<APP_ID>",
+      tenantId: "<TENANT_ID>",
+      authType: "federated",
+      useManagedIdentity: true,
+      webhook: { port: 3978, path: "/api/messages" },
+    },
+  },
+}
+```
+
+**Config (user-assigned managed identity):**
+
+```json5
+{
+  channels: {
+    msteams: {
+      enabled: true,
+      appId: "<APP_ID>",
+      tenantId: "<TENANT_ID>",
+      authType: "federated",
+      useManagedIdentity: true,
+      managedIdentityClientId: "<MI_CLIENT_ID>",
+      webhook: { port: 3978, path: "/api/messages" },
+    },
+  },
+}
+```
+
+**Env vars:**
+
+- `MSTEAMS_AUTH_TYPE=federated`
+- `MSTEAMS_USE_MANAGED_IDENTITY=true`
+- `MSTEAMS_MANAGED_IDENTITY_CLIENT_ID=<client-id>` (only for user-assigned)
+
+### AKS Workload Identity Setup
+
+For AKS deployments using workload identity:
+
+1. **Enable workload identity** on your AKS cluster.
+2. **Create a federated identity credential** on the Entra ID app registration:
+
+   ```bash
+   az ad app federated-credential create --id <APP_OBJECT_ID> --parameters '{
+     "name": "my-bot-workload-identity",
+     "issuer": "<AKS_OIDC_ISSUER_URL>",
+     "subject": "system:serviceaccount:<NAMESPACE>:<SERVICE_ACCOUNT>",
+     "audiences": ["api://AzureADTokenExchange"]
+   }'
+   ```
+
+3. **Annotate the Kubernetes service account** with the app client ID:
+
+   ```yaml
+   apiVersion: v1
+   kind: ServiceAccount
+   metadata:
+     name: my-bot-sa
+     annotations:
+       azure.workload.identity/client-id: "<APP_CLIENT_ID>"
+   ```
+
+4. **Label the pod** for workload identity injection:
+
+   ```yaml
+   metadata:
+     labels:
+       azure.workload.identity/use: "true"
+   ```
+
+5. **Ensure network access** to IMDS (`169.254.169.254`) — if using NetworkPolicy, add an egress rule allowing traffic to `169.254.169.254/32` on port 80.
+
+### Auth type comparison
+
+| Method               | Config                                         | Pros                               | Cons                                  |
+| -------------------- | ---------------------------------------------- | ---------------------------------- | ------------------------------------- |
+| **Client secret**    | `appPassword`                                  | Simple setup                       | Secret rotation required, less secure |
+| **Certificate**      | `authType: "federated"` + `certificatePath`    | No shared secret over network      | Certificate management overhead       |
+| **Managed Identity** | `authType: "federated"` + `useManagedIdentity` | Passwordless, no secrets to manage | Azure infrastructure required         |
+
+**Default behavior:** When `authType` is not set, OpenClaw defaults to client secret authentication. Existing configurations continue to work without changes.
+
 ## Local Development (Tunneling)

 Teams can't reach `localhost`. Use a tunnel for local development:
@@ -279,6 +423,11 @@ This is often easier than hand-editing JSON manifests.
   - `MSTEAMS_APP_ID`
   - `MSTEAMS_APP_PASSWORD`
   - `MSTEAMS_TENANT_ID`
+   - `MSTEAMS_AUTH_TYPE` (optional: `"secret"` or `"federated"`)
+   - `MSTEAMS_CERTIFICATE_PATH` (federated + certificate)
+   - `MSTEAMS_CERTIFICATE_THUMBPRINT` (optional, not required for auth)
+   - `MSTEAMS_USE_MANAGED_IDENTITY` (federated + managed identity)
+   - `MSTEAMS_MANAGED_IDENTITY_CLIENT_ID` (user-assigned MI only)

 5. **Bot endpoint**
   - Set the Azure Bot Messaging Endpoint to:
@@ -492,6 +641,11 @@ Key settings (see `/gateway/configuration` for shared channel patterns):
 - `toolsBySender` keys should use explicit prefixes:
  `id:`, `e164:`, `username:`, `name:` (legacy unprefixed keys still map to `id:` only).
 - `channels.msteams.actions.memberInfo`: enable or disable the Graph-backed member info action (default: enabled when Graph credentials are available).
+- `channels.msteams.authType`: authentication type — `"secret"` (default) or `"federated"`.
+- `channels.msteams.certificatePath`: path to PEM certificate file (federated + certificate auth).
+- `channels.msteams.certificateThumbprint`: certificate thumbprint (optional, not required for auth).
+- `channels.msteams.useManagedIdentity`: enable managed identity auth (federated mode).
+- `channels.msteams.managedIdentityClientId`: client ID for user-assigned managed identity.
 - `channels.msteams.sharePointSiteId`: SharePoint site ID for file uploads in group chats/channels (see [Sending files in group chats](#sending-files-in-group-chats)).

 ## Routing & Sessions
--- a/docs/channels/slack.md
+++ b/docs/channels/slack.md
@@ -282,7 +282,279 @@ openclaw gateway
  </Tab>
 </Tabs>

+### Additional manifest settings
+
+Surface different features that extend the above defaults.
+
 <AccordionGroup>
+  <Accordion title="Optional native slash commands">
+
+    Multiple [native slash commands](#commands-and-slash-behavior) can be used instead of a single configured command with nuance:
+
+    - Use `/agentstatus` instead of `/status` because the `/status` command is reserved.
+    - No more than 25 slash commands can be made available at once.
+
+    Replace your existing `features.slash_commands` section with a subset of [available commands](/tools/slash-commands#command-list):
+
+    <Tabs>
+      <Tab title="Socket Mode (default)">
+
+```json
+    "slash_commands": [
+      {
+        "command": "/new",
+        "description": "Start a new session",
+        "usage_hint": "[model]"
+      },
+      {
+        "command": "/reset",
+        "description": "Reset the current session"
+      },
+      {
+        "command": "/compact",
+        "description": "Compact the session context",
+        "usage_hint": "[instructions]"
+      },
+      {
+        "command": "/stop",
+        "description": "Stop the current run"
+      },
+      {
+        "command": "/session",
+        "description": "Manage thread-binding expiry",
+        "usage_hint": "idle <duration|off> or max-age <duration|off>"
+      },
+      {
+        "command": "/think",
+        "description": "Set the thinking level",
+        "usage_hint": "<off|minimal|low|medium|high|xhigh>"
+      },
+      {
+        "command": "/verbose",
+        "description": "Toggle verbose output",
+        "usage_hint": "on|off|full"
+      },
+      {
+        "command": "/fast",
+        "description": "Show or set fast mode",
+        "usage_hint": "[status|on|off]"
+      },
+      {
+        "command": "/reasoning",
+        "description": "Toggle reasoning visibility",
+        "usage_hint": "[on|off|stream]"
+      },
+      {
+        "command": "/elevated",
+        "description": "Toggle elevated mode",
+        "usage_hint": "[on|off|ask|full]"
+      },
+      {
+        "command": "/exec",
+        "description": "Show or set exec defaults",
+        "usage_hint": "host=<auto|sandbox|gateway|node> security=<deny|allowlist|full> ask=<off|on-miss|always> node=<id>"
+      },
+      {
+        "command": "/model",
+        "description": "Show or set the model",
+        "usage_hint": "[name|#|status]"
+      },
+      {
+        "command": "/models",
+        "description": "List providers or models for a provider",
+        "usage_hint": "[provider] [page] [limit=<n>|size=<n>|all]"
+      },
+      {
+        "command": "/help",
+        "description": "Show the short help summary"
+      },
+      {
+        "command": "/commands",
+        "description": "Show the generated command catalog"
+      },
+      {
+        "command": "/tools",
+        "description": "Show what the current agent can use right now",
+        "usage_hint": "[compact|verbose]"
+      },
+      {
+        "command": "/agentstatus",
+        "description": "Show runtime status, including provider usage/quota when available"
+      },
+      {
+        "command": "/tasks",
+        "description": "List active/recent background tasks for the current session"
+      },
+      {
+        "command": "/context",
+        "description": "Explain how context is assembled",
+        "usage_hint": "[list|detail|json]"
+      },
+      {
+        "command": "/whoami",
+        "description": "Show your sender identity"
+      },
+      {
+        "command": "/skill",
+        "description": "Run a skill by name",
+        "usage_hint": "<name> [input]"
+      },
+      {
+        "command": "/btw",
+        "description": "Ask a side question without changing session context",
+        "usage_hint": "<question>"
+      },
+      {
+        "command": "/usage",
+        "description": "Control the usage footer or show cost summary",
+        "usage_hint": "off|tokens|full|cost"
+      }
+    ]
+```
+
+      </Tab>
+      <Tab title="HTTP Request URLs">
+
+```json
+    "slash_commands": [
+      {
+        "command": "/new",
+        "description": "Start a new session",
+        "usage_hint": "[model]",
+        "url": "https://gateway-host.example.com/slack/events"
+      },
+      {
+        "command": "/reset",
+        "description": "Reset the current session",
+        "url": "https://gateway-host.example.com/slack/events"
+      },
+      {
+        "command": "/compact",
+        "description": "Compact the session context",
+        "usage_hint": "[instructions]",
+        "url": "https://gateway-host.example.com/slack/events"
+      },
+      {
+        "command": "/stop",
+        "description": "Stop the current run",
+        "url": "https://gateway-host.example.com/slack/events"
+      },
+      {
+        "command": "/session",
+        "description": "Manage thread-binding expiry",
+        "usage_hint": "idle <duration|off> or max-age <duration|off>",
+        "url": "https://gateway-host.example.com/slack/events"
+      },
+      {
+        "command": "/think",
+        "description": "Set the thinking level",
+        "usage_hint": "<off|minimal|low|medium|high|xhigh>",
+        "url": "https://gateway-host.example.com/slack/events"
+      },
+      {
+        "command": "/verbose",
+        "description": "Toggle verbose output",
+        "usage_hint": "on|off|full",
+        "url": "https://gateway-host.example.com/slack/events"
+      },
+      {
+        "command": "/fast",
+        "description": "Show or set fast mode",
+        "usage_hint": "[status|on|off]",
+        "url": "https://gateway-host.example.com/slack/events"
+      },
+      {
+        "command": "/reasoning",
+        "description": "Toggle reasoning visibility",
+        "usage_hint": "[on|off|stream]",
+        "url": "https://gateway-host.example.com/slack/events"
+      },
+      {
+        "command": "/elevated",
+        "description": "Toggle elevated mode",
+        "usage_hint": "[on|off|ask|full]",
+        "url": "https://gateway-host.example.com/slack/events"
+      },
+      {
+        "command": "/exec",
+        "description": "Show or set exec defaults",
+        "usage_hint": "host=<auto|sandbox|gateway|node> security=<deny|allowlist|full> ask=<off|on-miss|always> node=<id>",
+        "url": "https://gateway-host.example.com/slack/events"
+      },
+      {
+        "command": "/model",
+        "description": "Show or set the model",
+        "usage_hint": "[name|#|status]",
+        "url": "https://gateway-host.example.com/slack/events"
+      },
+      {
+        "command": "/models",
+        "description": "List providers or models for a provider",
+        "usage_hint": "[provider] [page] [limit=<n>|size=<n>|all]",
+        "url": "https://gateway-host.example.com/slack/events"
+      },
+      {
+        "command": "/help",
+        "description": "Show the short help summary",
+        "url": "https://gateway-host.example.com/slack/events"
+      },
+      {
+        "command": "/commands",
+        "description": "Show the generated command catalog",
+        "url": "https://gateway-host.example.com/slack/events"
+      },
+      {
+        "command": "/tools",
+        "description": "Show what the current agent can use right now",
+        "usage_hint": "[compact|verbose]",
+        "url": "https://gateway-host.example.com/slack/events"
+      },
+      {
+        "command": "/agentstatus",
+        "description": "Show runtime status, including provider usage/quota when available",
+        "url": "https://gateway-host.example.com/slack/events"
+      },
+      {
+        "command": "/tasks",
+        "description": "List active/recent background tasks for the current session",
+        "url": "https://gateway-host.example.com/slack/events"
+      },
+      {
+        "command": "/context",
+        "description": "Explain how context is assembled",
+        "usage_hint": "[list|detail|json]",
+        "url": "https://gateway-host.example.com/slack/events"
+      },
+      {
+        "command": "/whoami",
+        "description": "Show your sender identity",
+        "url": "https://gateway-host.example.com/slack/events"
+      },
+      {
+        "command": "/skill",
+        "description": "Run a skill by name",
+        "usage_hint": "<name> [input]",
+        "url": "https://gateway-host.example.com/slack/events"
+      },
+      {
+        "command": "/btw",
+        "description": "Ask a side question without changing session context",
+        "usage_hint": "<question>",
+        "url": "https://gateway-host.example.com/slack/events"
+      },
+      {
+        "command": "/usage",
+        "description": "Control the usage footer or show cost summary",
+        "usage_hint": "off|tokens|full|cost",
+        "url": "https://gateway-host.example.com/slack/events"
+      }
+    ]
+```
+
+      </Tab>
+    </Tabs>
+
+  </Accordion>
  <Accordion title="Optional authorship scopes (write operations)">
    Add the `chat:write.customize` bot scope if you want outgoing messages to use the active agent identity (custom username and icon) instead of the default Slack app identity.

@@ -536,30 +808,37 @@ Notes:

 ## Commands and slash behavior

- Native command auto-mode is **off** for Slack (`commands.native: "auto"` does not enable Slack native commands).
- Enable native Slack command handlers with `channels.slack.commands.native: true` (or global `commands.native: true`).
- When native commands are enabled, register matching slash commands in Slack (`/<command>` names), with one exception:
-  - register `/agentstatus` for the status command (Slack reserves `/status`)
- If native commands are not enabled, you can run a single configured slash command via `channels.slack.slashCommand`.
- Native arg menus now adapt their rendering strategy:
-  - up to 5 options: button blocks
-  - 6-100 options: static select menu
-  - more than 100 options: external select with async option filtering when interactivity options handlers are available
-  - if encoded option values exceed Slack limits, the flow falls back to buttons
- For long option payloads, Slash command argument menus use a confirm dialog before dispatching a selected value.
-
-Default slash command settings:
+Slash commands appear in Slack as either a single configured command or multiple native commands. Configure `channels.slack.slashCommand` to change command defaults:

 - `enabled: false`
 - `name: "openclaw"`
 - `sessionPrefix: "slack:slash"`
 - `ephemeral: true`

-Slash sessions use isolated keys:
+```txt
+/openclaw /help
+```

- `agent:<agentId>:slack:slash:<userId>`
+Native commands require [additional manifest settings](#additional-manifest-settings) in your Slack app and are enabled with `channels.slack.commands.native: true` or `commands.native: true` in global configurations instead.

-and still route command execution against the target conversation session (`CommandTargetSessionKey`).
+- Native command auto-mode is **off** for Slack so `commands.native: "auto"` does not enable Slack native commands.
+
+```txt
+/help
+```
+
+Native argument menus use an adaptive rendering strategy that shows a confirmation modal before dispatching a selected option value:
+
+- up to 5 options: button blocks
+- 6-100 options: static select menu
+- more than 100 options: external select with async option filtering when interactivity options handlers are available
+- exceeded Slack limits: encoded option values fall back to buttons
+
+```txt
+/think
+```
+
+Slash sessions use isolated keys like `agent:<agentId>:slack:slash:<userId>` and still route command executions to the target conversation session using `CommandTargetSessionKey`.

 ## Interactive replies

--- a/docs/cli/devices.md
+++ b/docs/cli/devices.md
@@ -49,8 +49,10 @@ openclaw devices clear --yes --pending --json

 ### `openclaw devices approve [requestId] [--latest]`

-Approve a pending device pairing request. If `requestId` is omitted, OpenClaw
-automatically approves the most recent pending request.
+Approve a pending device pairing request by exact `requestId`. If `requestId`
+is omitted or `--latest` is passed, OpenClaw only prints the selected pending
+request and exits; rerun approval with the exact request ID after verifying
+the details.

 Note: if a device retries pairing with changed auth details (role/scopes/public
 key), OpenClaw supersedes the previous pending entry and issues a new
@@ -126,7 +128,7 @@ Pass `--token` or `--password` explicitly. Missing explicit credentials is an er
  `operator.admin`.
 - `devices clear` is intentionally gated by `--yes`.
 - If pairing scope is unavailable on local loopback (and no explicit `--url` is passed), list/approve can use a local pairing fallback.
- `devices approve` picks the newest pending request automatically when you omit `requestId` or pass `--latest`.
+- `devices approve` requires an explicit request ID before minting tokens; omitting `requestId` or passing `--latest` only previews the newest pending request.

 ## Token drift recovery checklist

--- a/docs/cli/index.md
+++ b/docs/cli/index.md
@@ -852,7 +852,7 @@ Subcommands:
 Notes:

 - `devices list` and `devices approve` can fall back to local pairing files on local loopback when direct pairing scope is unavailable.
- `devices approve` auto-selects the newest pending request when no `requestId` is passed or `--latest` is set.
+- `devices approve` requires an explicit request ID before minting tokens; omitting `requestId` or passing `--latest` only previews the newest pending request.
 - Stored-token reconnects reuse the token's cached approved scopes; explicit
  `devices rotate --scope ...` updates that stored scope set for future
  cached-token reconnects.
--- a/docs/concepts/active-memory.md
+++ b/docs/concepts/active-memory.md
@@ -35,7 +35,7 @@ self-contained, safe-default setup:
          enabled: true,
          agents: ["main"],
          allowedChatTypes: ["direct"],
-          modelFallbackPolicy: "default-remote",
+          modelFallback: "google/gemini-3-flash",
          queryMode: "recent",
          promptStyle: "balanced",
          timeoutMs: 15000,
@@ -51,13 +51,13 @@ self-contained, safe-default setup:

 This turns the plugin on for the `main` agent, keeps it limited to direct-message
 style sessions by default, lets it inherit the current session model first, and
-still allows the built-in remote fallback if no explicit or inherited model is
+uses the configured fallback model only if no explicit or inherited model is
 available.

 After that, restart the gateway:

 ```bash
-node scripts/run-node.mjs gateway --profile dev
+openclaw gateway
 ```

 To inspect it live in a conversation:
@@ -85,7 +85,7 @@ Start with this in `openclaw.json`:
        config: {
          agents: ["main"],
          allowedChatTypes: ["direct"],
-          modelFallbackPolicy: "default-remote",
+          modelFallback: "google/gemini-3-flash",
          queryMode: "recent",
          promptStyle: "balanced",
          timeoutMs: 15000,
@@ -102,7 +102,7 @@ Start with this in `openclaw.json`:
 Then restart the gateway:

 ```bash
-node scripts/run-node.mjs gateway --profile dev
+openclaw gateway
 ```

 What this means:
@@ -111,7 +111,7 @@ What this means:
 - `config.agents: ["main"]` opts only the `main` agent into active memory
 - `config.allowedChatTypes: ["direct"]` keeps active memory on for direct-message style sessions only by default
 - if `config.model` is unset, active memory inherits the current session model first
- `config.modelFallbackPolicy: "default-remote"` keeps the built-in remote fallback as the default when no explicit or inherited model is available
+- `config.modelFallback` optionally provides your own fallback provider/model for recall
 - `config.promptStyle: "balanced"` uses the default general-purpose prompt style for `recent` mode
 - active memory still runs only on eligible interactive persistent chat sessions

@@ -335,26 +335,22 @@ If `config.model` is unset, Active Memory tries to resolve a model in this order
 explicit plugin model
 -> current session model
 -> agent primary model
-> optional built-in remote fallback
+-> optional configured fallback model
 ```

-`config.modelFallbackPolicy` controls the last step.
+`config.modelFallback` controls the configured fallback step.

-Default:
+Optional custom fallback:

 ```json5
-modelFallbackPolicy: "default-remote"
+modelFallback: "google/gemini-3-flash"
 ```

-Other option:
+If no explicit, inherited, or configured fallback model resolves, Active Memory
+skips recall for that turn.

-```json5
-modelFallbackPolicy: "resolved-only"
-```
-
-Use `resolved-only` if you want Active Memory to skip recall instead of falling
-back to the built-in remote default when no explicit or inherited model is
-available.
+`config.modelFallbackPolicy` is retained only as a deprecated compatibility
+field for older configs. It no longer changes runtime behavior.

 ## Advanced escape hatches

--- a/docs/concepts/memory-qmd.md
+++ b/docs/concepts/memory-qmd.md
@@ -51,6 +51,9 @@ legacy `--mask` collection flags and older MCP tool names when needed.
 - OpenClaw creates collections from your workspace memory files and any
  configured `memory.qmd.paths`, then runs `qmd update` + `qmd embed` on boot
  and periodically (default every 5 minutes).
+- The default workspace collection tracks `MEMORY.md` plus the `memory/`
+  tree. Lowercase `memory.md` remains a bootstrap fallback, not a separate QMD
+  collection.
 - Boot refresh runs in the background so chat startup is not blocked.
 - Searches use the configured `searchMode` (default: `search`; also supports
  `vsearch` and `query`). If a mode fails, OpenClaw retries with `qmd query`.
@@ -114,8 +117,8 @@ collection under `~/.openclaw/agents/<id>/qmd/sessions/`.

 ## Search scope

-By default, QMD search results are only surfaced in DM sessions (not groups or
-channels). Configure `memory.qmd.scope` to change this:
+By default, QMD search results are surfaced in direct and channel sessions
+(not groups). Configure `memory.qmd.scope` to change this:

 ```json5
 {
@@ -164,7 +167,7 @@ with `qmd query "test"` using the same XDG dirs OpenClaw uses.
 Set to `120000` for slower hardware.

 **Empty results in group chats?** Check `memory.qmd.scope` -- the default only
-allows DM sessions.
+allows direct and channel sessions.

 **Workspace-visible temp repos causing `ENAMETOOLONG` or broken indexing?**
 QMD traversal currently follows the underlying QMD scanner behavior rather than
--- a/docs/concepts/memory-search.md
+++ b/docs/concepts/memory-search.md
@@ -67,6 +67,8 @@ flowchart LR

 If only one path is available (no embeddings or no FTS), the other runs alone.

+When embeddings are unavailable, OpenClaw still uses lexical ranking over FTS results instead of falling back to raw exact-match ordering only. That degraded mode boosts chunks with stronger query-term coverage and relevant file paths, which keeps recall useful even without `sqlite-vec` or an embedding provider.
+
 ## Improving search quality

 Two optional features help when you have a large note history:
--- a/docs/concepts/qa-e2e-automation.md
+++ b/docs/concepts/qa-e2e-automation.md
@@ -52,6 +52,47 @@ pnpm qa:lab:watch
 rebuilds that bundle on change, and the browser auto-reloads when the QA Lab
 asset hash changes.

+For a transport-real Matrix smoke lane, run:
+
+```bash
+pnpm openclaw qa matrix
+```
+
+That lane provisions a disposable Tuwunel homeserver in Docker, registers
+temporary driver, SUT, and observer users, creates one private room, then runs
+the real Matrix plugin inside a QA gateway child. The live transport lane keeps
+the child config scoped to the transport under test, so Matrix runs without
+`qa-channel` in the child config.
+
+For a transport-real Telegram smoke lane, run:
+
+```bash
+pnpm openclaw qa telegram
+```
+
+That lane targets one real private Telegram group instead of provisioning a
+disposable server. It requires `OPENCLAW_QA_TELEGRAM_GROUP_ID`,
+`OPENCLAW_QA_TELEGRAM_DRIVER_BOT_TOKEN`, and
+`OPENCLAW_QA_TELEGRAM_SUT_BOT_TOKEN`, plus two distinct bots in the same
+private group. The SUT bot must have a Telegram username, and bot-to-bot
+observation works best when both bots have Bot-to-Bot Communication Mode
+enabled in `@BotFather`.
+
+Live transport lanes now share one smaller contract instead of each inventing
+their own scenario list shape:
+
+`qa-channel` remains the broad synthetic product-behavior suite and is not part
+of the live transport coverage matrix.
+
+| Lane     | Canary | Mention gating | Allowlist block | Top-level reply | Restart resume | Thread follow-up | Thread isolation | Reaction observation | Help command |
+| -------- | ------ | -------------- | --------------- | --------------- | -------------- | ---------------- | ---------------- | -------------------- | ------------ |
+| Matrix   | x      | x              | x               | x               | x              | x                | x                | x                    |              |
+| Telegram | x      |                |                 |                 |                |                  |                  |                      | x            |
+
+This keeps `qa-channel` as the broad product-behavior suite while Matrix,
+Telegram, and future live transports share one explicit transport-contract
+checklist.
+
 For a disposable Linux VM lane without bringing Docker into the QA path, run:

 ```bash
--- a/docs/concepts/system-prompt.md
+++ b/docs/concepts/system-prompt.md
@@ -110,9 +110,12 @@ heartbeats are disabled for the default agent or
 files concise — especially `MEMORY.md`, which can grow over time and lead to
 unexpectedly high context usage and more frequent compaction.

-> **Note:** `memory/*.md` daily files are **not** injected automatically. They
-> are accessed on demand via the `memory_search` and `memory_get` tools, so they
-> do not count against the context window unless the model explicitly reads them.
+> **Note:** `memory/*.md` daily files are **not** part of the normal bootstrap
+> Project Context. On ordinary turns they are accessed on demand via the
+> `memory_search` and `memory_get` tools, so they do not count against the
+> context window unless the model explicitly reads them. Bare `/new` and
+> `/reset` turns are the exception: the runtime can prepend recent daily memory
+> as a one-shot startup-context block for that first turn.

 Large files are truncated with a marker. The max per-file size is controlled by
 `agents.defaults.bootstrapMaxChars` (default: 20000). Total injected bootstrap
--- a/docs/docs.json
+++ b/docs/docs.json
@@ -52,6 +52,10 @@
    ]
  },
  "redirects": [
+    {
+      "source": "/mcp",
+      "destination": "/cli/mcp"
+    },
    {
      "source": "/providers/modelstudio",
      "destination": "/providers/qwen"
@@ -1116,6 +1120,8 @@
                  "plugins/codex-harness",
                  "plugins/webhooks",
                  "plugins/voice-call",
+                  "plugins/memory-wiki",
+                  "plugins/zalouser",
                  {
                    "group": "Building Plugins",
                    "pages": [
@@ -1188,6 +1194,7 @@
                      "tools/gemini-search",
                      "tools/grok-search",
                      "tools/kimi-search",
+                      "tools/minimax-search",
                      "tools/ollama-search",
                      "tools/perplexity-search",
                      "tools/searxng-search",
@@ -1237,22 +1244,24 @@
                "group": "Providers",
                "pages": [
                  "providers/alibaba",
-                  "providers/anthropic",
-                  "providers/arcee",
                  "providers/bedrock",
                  "providers/bedrock-mantle",
+                  "providers/anthropic",
+                  "providers/arcee",
                  "providers/chutes",
-                  "providers/comfy",
                  "providers/claude-max-api-proxy",
                  "providers/cloudflare-ai-gateway",
+                  "providers/comfy",
                  "providers/deepgram",
                  "providers/deepseek",
                  "providers/fal",
+                  "providers/fireworks",
                  "providers/github-copilot",
                  "providers/glm",
                  "providers/google",
                  "providers/groq",
                  "providers/huggingface",
+                  "providers/inferrs",
                  "providers/kilocode",
                  "providers/litellm",
                  "providers/minimax",
@@ -1274,9 +1283,9 @@
                  "providers/together",
                  "providers/venice",
                  "providers/vercel-ai-gateway",
-                  "providers/vydra",
                  "providers/vllm",
                  "providers/volcengine",
+                  "providers/vydra",
                  "providers/xai",
                  "providers/xiaomi",
                  "providers/zai"
@@ -1455,6 +1464,7 @@
                      "cli/agent",
                      "cli/agents",
                      "cli/hooks",
+                      "cli/infer",
                      "cli/memory",
                      "cli/message",
                      "cli/models",
@@ -1505,7 +1515,8 @@
                      "cli/completion",
                      "cli/dns",
                      "cli/docs",
-                      "cli/mcp"
+                      "cli/mcp",
+                      "cli/wiki"
                    ]
                  }
                ]
@@ -1539,6 +1550,7 @@
                  "reference/api-usage-costs",
                  "reference/transcript-hygiene",
                  "reference/memory-config",
+                  "reference/rich-output-protocol",
                  "date-time"
                ]
              },
--- a/docs/gateway/cli-backends.md
+++ b/docs/gateway/cli-backends.md
@@ -263,6 +263,31 @@ CLI backend defaults are now part of the plugin surface:
 - Backend-specific config cleanup stays plugin-owned through the optional
  `normalizeConfig` hook.

+Plugins that need tiny prompt/message compatibility shims can declare
+bidirectional text transforms without replacing a provider or CLI backend:
+
+```typescript
+api.registerTextTransforms({
+  input: [
+    { from: /red basket/g, to: "blue basket" },
+    { from: /paper ticket/g, to: "digital ticket" },
+    { from: /left shelf/g, to: "right shelf" },
+  ],
+  output: [
+    { from: /blue basket/g, to: "red basket" },
+    { from: /digital ticket/g, to: "paper ticket" },
+    { from: /right shelf/g, to: "left shelf" },
+  ],
+});
+```
+
+`input` rewrites the system prompt and user prompt passed to the CLI. `output`
+rewrites streamed assistant deltas and parsed final text before OpenClaw handles
+its own control markers and channel delivery.
+
+For CLIs that emit Claude Code stream-json compatible JSONL, set
+`jsonlDialect: "claude-stream-json"` on that backend's config.
+
 ## Bundle MCP overlays

 CLI backends do **not** receive OpenClaw tool calls directly, but a backend can
--- a/docs/gateway/configuration-reference.md
+++ b/docs/gateway/configuration-reference.md
@@ -1224,6 +1224,7 @@ Periodic heartbeat runs.
        prompt: "Read HEARTBEAT.md if it exists...",
        ackMaxChars: 300,
        suppressToolErrorWarnings: false,
+        timeoutSeconds: 45,
      },
    },
  },
@@ -1233,6 +1234,7 @@ Periodic heartbeat runs.
 - `every`: duration string (ms/s/m/h). Default: `30m` (API-key auth) or `1h` (OAuth auth). Set to `0m` to disable.
 - `includeSystemPromptSection`: when false, omits the Heartbeat section from the system prompt and skips `HEARTBEAT.md` injection into bootstrap context. Default: `true`.
 - `suppressToolErrorWarnings`: when true, suppresses tool error warning payloads during heartbeat runs.
+- `timeoutSeconds`: maximum time in seconds allowed for a heartbeat agent turn before it is aborted. Leave unset to use `agents.defaults.timeoutSeconds`.
 - `directPolicy`: direct/DM delivery policy. `allow` (default) permits direct-target delivery. `block` suppresses direct-target delivery and emits `reason=dm-blocked`.
 - `lightContext`: when true, heartbeat runs use lightweight bootstrap context and keep only `HEARTBEAT.md` from workspace bootstrap files.
 - `isolatedSession`: when true, each heartbeat runs in a fresh session with no prior conversation history. Same isolation pattern as cron `sessionTarget: "isolated"`. Reduces per-heartbeat token cost from ~100K to ~2-5K tokens.
@@ -2893,6 +2895,8 @@ See [Plugins](/tools/plugin).
      enabled: true,
      basePath: "/openclaw",
      // root: "dist/control-ui",
+      // embedSandbox: "scripts", // strict | scripts | trusted
+      // allowExternalEmbedUrls: false, // dangerous: allow absolute external http(s) embed URLs
      // allowedOrigins: ["https://control.example.com"], // required for non-loopback Control UI
      // dangerouslyAllowHostHeaderOriginFallback: false, // dangerous Host-header origin fallback mode
      // allowInsecureAuth: false,
--- a/docs/gateway/heartbeat.md
+++ b/docs/gateway/heartbeat.md
@@ -146,6 +146,7 @@ Example: two agents, only the second agent runs heartbeats.
          every: "1h",
          target: "whatsapp",
          to: "+15551234567",
+          timeoutSeconds: 45,
          prompt: "Read HEARTBEAT.md if it exists (workspace context). Follow it strictly. Do not infer or repeat old tasks from prior chats. If nothing needs attention, reply HEARTBEAT_OK.",
        },
      },
--- a/docs/help/gpt54-codex-agentic-parity-maintainers.md
+++ b/docs/help/gpt54-codex-agentic-parity-maintainers.md
@@ -0,0 +1,164 @@
+# GPT-5.4 / Codex Parity Maintainer Notes
+
+This note explains how to review the GPT-5.4 / Codex parity program as four merge units without losing the original six-contract architecture.
+
+## Merge units
+
+### PR A: strict-agentic execution
+
+Owns:
+
+- `executionContract`
+- GPT-5-first same-turn follow-through
+- `update_plan` as non-terminal progress tracking
+- explicit blocked states instead of plan-only silent stops
+
+Does not own:
+
+- auth/runtime failure classification
+- permission truthfulness
+- replay/continuation redesign
+- parity benchmarking
+
+### PR B: runtime truthfulness
+
+Owns:
+
+- Codex OAuth scope correctness
+- typed provider/runtime failure classification
+- truthful `/elevated full` availability and blocked reasons
+
+Does not own:
+
+- tool schema normalization
+- replay/liveness state
+- benchmark gating
+
+### PR C: execution correctness
+
+Owns:
+
+- provider-owned OpenAI/Codex tool compatibility
+- parameter-free strict schema handling
+- replay-invalid surfacing
+- paused, blocked, and abandoned long-task state visibility
+
+Does not own:
+
+- self-elected continuation
+- generic Codex dialect behavior outside provider hooks
+- benchmark gating
+
+### PR D: parity harness
+
+Owns:
+
+- first-wave GPT-5.4 vs Opus 4.6 scenario pack
+- parity documentation
+- parity report and release-gate mechanics
+
+Does not own:
+
+- runtime behavior changes outside QA-lab
+- auth/proxy/DNS simulation inside the harness
+
+## Mapping back to the original six contracts
+
+| Original contract                        | Merge unit |
+| ---------------------------------------- | ---------- |
+| Provider transport/auth correctness      | PR B       |
+| Tool contract/schema compatibility       | PR C       |
+| Same-turn execution                      | PR A       |
+| Permission truthfulness                  | PR B       |
+| Replay/continuation/liveness correctness | PR C       |
+| Benchmark/release gate                   | PR D       |
+
+## Review order
+
+1. PR A
+2. PR B
+3. PR C
+4. PR D
+
+PR D is the proof layer. It should not be the reason runtime-correctness PRs are delayed.
+
+## What to look for
+
+### PR A
+
+- GPT-5 runs act or fail closed instead of stopping at commentary
+- `update_plan` no longer looks like progress by itself
+- behavior stays GPT-5-first and embedded-Pi scoped
+
+### PR B
+
+- auth/proxy/runtime failures stop collapsing into generic “model failed” handling
+- `/elevated full` is only described as available when it is actually available
+- blocked reasons are visible to both the model and the user-facing runtime
+
+### PR C
+
+- strict OpenAI/Codex tool registration behaves predictably
+- parameter-free tools do not fail strict schema checks
+- replay and compaction outcomes preserve truthful liveness state
+
+### PR D
+
+- the scenario pack is understandable and reproducible
+- the pack includes a mutating replay-safety lane, not only read-only flows
+- reports are readable by humans and automation
+- parity claims are evidence-backed, not anecdotal
+
+Expected artifacts from PR D:
+
+- `qa-suite-report.md` / `qa-suite-summary.json` for each model run
+- `qa-agentic-parity-report.md` with aggregate and scenario-level comparison
+- `qa-agentic-parity-summary.json` with a machine-readable verdict
+
+## Release gate
+
+Do not claim GPT-5.4 parity or superiority over Opus 4.6 until:
+
+- PR A, PR B, and PR C are merged
+- PR D runs the first-wave parity pack cleanly
+- runtime-truthfulness regression suites remain green
+- the parity report shows no fake-success cases and no regression in stop behavior
+
+```mermaid
+flowchart LR
+    A["PR A-C merged"] --> B["Run GPT-5.4 parity pack"]
+    A --> C["Run Opus 4.6 parity pack"]
+    B --> D["qa-suite-summary.json"]
+    C --> E["qa-suite-summary.json"]
+    D --> F["qa parity-report"]
+    E --> F
+    F --> G["Markdown report + JSON verdict"]
+    G --> H{"Pass?"}
+    H -- "yes" --> I["Parity claim allowed"]
+    H -- "no" --> J["Keep runtime fixes / review loop open"]
+```
+
+The parity harness is not the only evidence source. Keep this split explicit in review:
+
+- PR D owns the scenario-based GPT-5.4 vs Opus 4.6 comparison
+- PR B deterministic suites still own auth/proxy/DNS and full-access truthfulness evidence
+
+## Goal-to-evidence map
+
+| Completion gate item                     | Primary owner | Review artifact                                                     |
+| ---------------------------------------- | ------------- | ------------------------------------------------------------------- |
+| No plan-only stalls                      | PR A          | strict-agentic runtime tests and `approval-turn-tool-followthrough` |
+| No fake progress or fake tool completion | PR A + PR D   | parity fake-success count plus scenario-level report details        |
+| No false `/elevated full` guidance       | PR B          | deterministic runtime-truthfulness suites                           |
+| Replay/liveness failures remain explicit | PR C + PR D   | lifecycle/replay suites plus `compaction-retry-mutating-tool`       |
+| GPT-5.4 matches or beats Opus 4.6        | PR D          | `qa-agentic-parity-report.md` and `qa-agentic-parity-summary.json`  |
+
+## Reviewer shorthand: before vs after
+
+| User-visible problem before                                 | Review signal after                                                                     |
+| ----------------------------------------------------------- | --------------------------------------------------------------------------------------- |
+| GPT-5.4 stopped after planning                              | PR A shows act-or-block behavior instead of commentary-only completion                  |
+| Tool use felt brittle with strict OpenAI/Codex schemas      | PR C keeps tool registration and parameter-free invocation predictable                  |
+| `/elevated full` hints were sometimes misleading            | PR B ties guidance to actual runtime capability and blocked reasons                     |
+| Long tasks could disappear into replay/compaction ambiguity | PR C emits explicit paused, blocked, abandoned, and replay-invalid state                |
+| Parity claims were anecdotal                                | PR D produces a report plus JSON verdict with the same scenario coverage on both models |
--- a/docs/help/gpt54-codex-agentic-parity.md
+++ b/docs/help/gpt54-codex-agentic-parity.md
@@ -0,0 +1,219 @@
+# GPT-5.4 / Codex Agentic Parity in OpenClaw
+
+OpenClaw already worked well with tool-using frontier models, but GPT-5.4 and Codex-style models were still underperforming in a few practical ways:
+
+- they could stop after planning instead of doing the work
+- they could use strict OpenAI/Codex tool schemas incorrectly
+- they could ask for `/elevated full` even when full access was impossible
+- they could lose long-running task state during replay or compaction
+- parity claims against Claude Opus 4.6 were based on anecdotes instead of repeatable scenarios
+
+This parity program fixes those gaps in four reviewable slices.
+
+## What changed
+
+### PR A: strict-agentic execution
+
+This slice adds an opt-in `strict-agentic` execution contract for embedded Pi GPT-5 runs.
+
+When enabled, OpenClaw stops accepting plan-only turns as “good enough” completion. If the model only says what it intends to do and does not actually use tools or make progress, OpenClaw retries with an act-now steer and then fails closed with an explicit blocked state instead of silently ending the task.
+
+This improves the GPT-5.4 experience most on:
+
+- short “ok do it” follow-ups
+- code tasks where the first step is obvious
+- flows where `update_plan` should be progress tracking rather than filler text
+
+### PR B: runtime truthfulness
+
+This slice makes OpenClaw tell the truth about two things:
+
+- why the provider/runtime call failed
+- whether `/elevated full` is actually available
+
+That means GPT-5.4 gets better runtime signals for missing scope, auth refresh failures, HTML 403 auth failures, proxy issues, DNS or timeout failures, and blocked full-access modes. The model is less likely to hallucinate the wrong remediation or keep asking for a permission mode the runtime cannot provide.
+
+### PR C: execution correctness
+
+This slice improves two kinds of correctness:
+
+- provider-owned OpenAI/Codex tool-schema compatibility
+- replay and long-task liveness surfacing
+
+The tool-compat work reduces schema friction for strict OpenAI/Codex tool registration, especially around parameter-free tools and strict object-root expectations. The replay/liveness work makes long-running tasks more observable, so paused, blocked, and abandoned states are visible instead of disappearing into generic failure text.
+
+### PR D: parity harness
+
+This slice adds the first-wave QA-lab parity pack so GPT-5.4 and Opus 4.6 can be exercised through the same scenarios and compared using shared evidence.
+
+The parity pack is the proof layer. It does not change runtime behavior by itself.
+
+After you have two `qa-suite-summary.json` artifacts, generate the release-gate comparison with:
+
+```bash
+pnpm openclaw qa parity-report \
+  --repo-root . \
+  --candidate-summary .artifacts/qa-e2e/gpt54/qa-suite-summary.json \
+  --baseline-summary .artifacts/qa-e2e/opus46/qa-suite-summary.json \
+  --output-dir .artifacts/qa-e2e/parity
+```
+
+That command writes:
+
+- a human-readable Markdown report
+- a machine-readable JSON verdict
+- an explicit `pass` / `fail` gate result
+
+## Why this improves GPT-5.4 in practice
+
+Before this work, GPT-5.4 on OpenClaw could feel less agentic than Opus in real coding sessions because the runtime tolerated behaviors that are especially harmful for GPT-5-style models:
+
+- commentary-only turns
+- schema friction around tools
+- vague permission feedback
+- silent replay or compaction breakage
+
+The goal is not to make GPT-5.4 imitate Opus. The goal is to give GPT-5.4 a runtime contract that rewards real progress, supplies cleaner tool and permission semantics, and turns failure modes into explicit machine- and human-readable states.
+
+That changes the user experience from:
+
+- “the model had a good plan but stopped”
+
+to:
+
+- “the model either acted, or OpenClaw surfaced the exact reason it could not”
+
+## Before vs after for GPT-5.4 users
+
+| Before this program                                                                            | After PR A-D                                                                             |
+| ---------------------------------------------------------------------------------------------- | ---------------------------------------------------------------------------------------- |
+| GPT-5.4 could stop after a reasonable plan without taking the next tool step                   | PR A turns “plan only” into “act now or surface a blocked state”                         |
+| Strict tool schemas could reject parameter-free or OpenAI/Codex-shaped tools in confusing ways | PR C makes provider-owned tool registration and invocation more predictable              |
+| `/elevated full` guidance could be vague or wrong in blocked runtimes                          | PR B gives GPT-5.4 and the user truthful runtime and permission hints                    |
+| Replay or compaction failures could feel like the task silently disappeared                    | PR C surfaces paused, blocked, abandoned, and replay-invalid outcomes explicitly         |
+| “GPT-5.4 feels worse than Opus” was mostly anecdotal                                           | PR D turns that into the same scenario pack, the same metrics, and a hard pass/fail gate |
+
+## Architecture
+
+```mermaid
+flowchart TD
+    A["User request"] --> B["Embedded Pi runtime"]
+    B --> C["Strict-agentic execution contract"]
+    B --> D["Provider-owned tool compatibility"]
+    B --> E["Runtime truthfulness"]
+    B --> F["Replay and liveness state"]
+    C --> G["Tool call or explicit blocked state"]
+    D --> G
+    E --> G
+    F --> G
+    G --> H["QA-lab parity pack"]
+    H --> I["Scenario report and parity gate"]
+```
+
+## Release flow
+
+```mermaid
+flowchart LR
+    A["Merged runtime slices (PR A-C)"] --> B["Run GPT-5.4 parity pack"]
+    A --> C["Run Opus 4.6 parity pack"]
+    B --> D["qa-suite-summary.json"]
+    C --> E["qa-suite-summary.json"]
+    D --> F["openclaw qa parity-report"]
+    E --> F
+    F --> G["qa-agentic-parity-report.md"]
+    F --> H["qa-agentic-parity-summary.json"]
+    H --> I{"Gate pass?"}
+    I -- "yes" --> J["Evidence-backed parity claim"]
+    I -- "no" --> K["Keep runtime/review loop open"]
+```
+
+## Scenario pack
+
+The first-wave parity pack currently covers five scenarios:
+
+### `approval-turn-tool-followthrough`
+
+Checks that the model does not stop at “I’ll do that” after a short approval. It should take the first concrete action in the same turn.
+
+### `model-switch-tool-continuity`
+
+Checks that tool-using work remains coherent across model/runtime switching boundaries instead of resetting into commentary or losing execution context.
+
+### `source-docs-discovery-report`
+
+Checks that the model can read source and docs, synthesize findings, and continue the task agentically rather than producing a thin summary and stopping early.
+
+### `image-understanding-attachment`
+
+Checks that mixed-mode tasks involving attachments remain actionable and do not collapse into vague narration.
+
+### `compaction-retry-mutating-tool`
+
+Checks that a task with a real mutating write keeps replay-unsafety explicit instead of quietly looking replay-safe if the run compacts, retries, or loses reply state under pressure.
+
+## Scenario matrix
+
+| Scenario                           | What it tests                           | Good GPT-5.4 behavior                                                          | Failure signal                                                                 |
+| ---------------------------------- | --------------------------------------- | ------------------------------------------------------------------------------ | ------------------------------------------------------------------------------ |
+| `approval-turn-tool-followthrough` | Short approval turns after a plan       | Starts the first concrete tool action immediately instead of restating intent  | plan-only follow-up, no tool activity, or blocked turn without a real blocker  |
+| `model-switch-tool-continuity`     | Runtime/model switching under tool use  | Preserves task context and continues acting coherently                         | resets into commentary, loses tool context, or stops after switch              |
+| `source-docs-discovery-report`     | Source reading + synthesis + action     | Finds sources, uses tools, and produces a useful report without stalling       | thin summary, missing tool work, or incomplete-turn stop                       |
+| `image-understanding-attachment`   | Attachment-driven agentic work          | Interprets the attachment, connects it to tools, and continues the task        | vague narration, attachment ignored, or no concrete next action                |
+| `compaction-retry-mutating-tool`   | Mutating work under compaction pressure | Performs a real write and keeps replay-unsafety explicit after the side effect | mutating write happens but replay safety is implied, missing, or contradictory |
+
+## Release gate
+
+GPT-5.4 can only be considered at parity or better when the merged runtime passes the parity pack and the runtime-truthfulness regressions at the same time.
+
+Required outcomes:
+
+- no plan-only stall when the next tool action is clear
+- no fake completion without real execution
+- no incorrect `/elevated full` guidance
+- no silent replay or compaction abandonment
+- parity-pack metrics that are at least as strong as the agreed Opus 4.6 baseline
+
+For the first-wave harness, the gate compares:
+
+- completion rate
+- unintended-stop rate
+- valid-tool-call rate
+- fake-success count
+
+Parity evidence is intentionally split across two layers:
+
+- PR D proves same-scenario GPT-5.4 vs Opus 4.6 behavior with QA-lab
+- PR B deterministic suites prove auth, proxy, DNS, and `/elevated full` truthfulness outside the harness
+
+## Goal-to-evidence matrix
+
+| Completion gate item                                     | Owning PR   | Evidence source                                                    | Pass signal                                                                              |
+| -------------------------------------------------------- | ----------- | ------------------------------------------------------------------ | ---------------------------------------------------------------------------------------- |
+| GPT-5.4 no longer stalls after planning                  | PR A        | `approval-turn-tool-followthrough` plus PR A runtime suites        | approval turns trigger real work or an explicit blocked state                            |
+| GPT-5.4 no longer fakes progress or fake tool completion | PR A + PR D | parity report scenario outcomes and fake-success count             | no suspicious pass results and no commentary-only completion                             |
+| GPT-5.4 no longer gives false `/elevated full` guidance  | PR B        | deterministic truthfulness suites                                  | blocked reasons and full-access hints stay runtime-accurate                              |
+| Replay/liveness failures stay explicit                   | PR C + PR D | PR C lifecycle/replay suites plus `compaction-retry-mutating-tool` | mutating work keeps replay-unsafety explicit instead of silently disappearing            |
+| GPT-5.4 matches or beats Opus 4.6 on the agreed metrics  | PR D        | `qa-agentic-parity-report.md` and `qa-agentic-parity-summary.json` | same scenario coverage and no regression on completion, stop behavior, or valid tool use |
+
+## How to read the parity verdict
+
+Use the verdict in `qa-agentic-parity-summary.json` as the final machine-readable decision for the first-wave parity pack.
+
+- `pass` means GPT-5.4 covered the same scenarios as Opus 4.6 and did not regress on the agreed aggregate metrics.
+- `fail` means at least one hard gate tripped: weaker completion, worse unintended stops, weaker valid tool use, any fake-success case, or mismatched scenario coverage.
+- “shared/base CI issue” is not itself a parity result. If CI noise outside PR D blocks a run, the verdict should wait for a clean merged-runtime execution instead of being inferred from branch-era logs.
+- Auth, proxy, DNS, and `/elevated full` truthfulness still come from PR B’s deterministic suites, so the final release claim needs both: a passing PR D parity verdict and green PR B truthfulness coverage.
+
+## Who should enable `strict-agentic`
+
+Use `strict-agentic` when:
+
+- the agent is expected to act immediately when a next step is obvious
+- GPT-5.4 or Codex-family models are the primary runtime
+- you prefer explicit blocked states over “helpful” recap-only replies
+
+Keep the default contract when:
+
+- you want the existing looser behavior
+- you are not using GPT-5-family models
+- you are testing prompts rather than runtime enforcement
--- a/docs/help/testing.md
+++ b/docs/help/testing.md
@@ -65,6 +65,27 @@ These commands sit beside the main test suites when you need QA-lab realism:
    `.artifacts/qa-e2e/...`.
 - `pnpm qa:lab:up`
  - Starts the Docker-backed QA site for operator-style QA work.
+- `pnpm openclaw qa matrix`
+  - Runs the Matrix live QA lane against a disposable Docker-backed Tuwunel homeserver.
+  - Provisions three temporary Matrix users (`driver`, `sut`, `observer`) plus one private room, then starts a QA gateway child with the real Matrix plugin as the SUT transport.
+  - Uses the pinned stable Tuwunel image `ghcr.io/matrix-construct/tuwunel:v1.5.1` by default. Override with `OPENCLAW_QA_MATRIX_TUWUNEL_IMAGE` when you need to test a different image.
+  - Writes a Matrix QA report, summary, and observed-events artifact under `.artifacts/qa-e2e/...`.
+- `pnpm openclaw qa telegram`
+  - Runs the Telegram live QA lane against a real private group using the driver and SUT bot tokens from env.
+  - Requires `OPENCLAW_QA_TELEGRAM_GROUP_ID`, `OPENCLAW_QA_TELEGRAM_DRIVER_BOT_TOKEN`, and `OPENCLAW_QA_TELEGRAM_SUT_BOT_TOKEN`. The group id must be the numeric Telegram chat id.
+  - Requires two distinct bots in the same private group, with the SUT bot exposing a Telegram username.
+  - For stable bot-to-bot observation, enable Bot-to-Bot Communication Mode in `@BotFather` for both bots and ensure the driver bot can observe group bot traffic.
+  - Writes a Telegram QA report, summary, and observed-messages artifact under `.artifacts/qa-e2e/...`.
+
+Live transport lanes share one standard contract so new transports do not drift:
+
+`qa-channel` remains the broad synthetic QA suite and is not part of the live
+transport coverage matrix.
+
+| Lane     | Canary | Mention gating | Allowlist block | Top-level reply | Restart resume | Thread follow-up | Thread isolation | Reaction observation | Help command |
+| -------- | ------ | -------------- | --------------- | --------------- | -------------- | ---------------- | ---------------- | -------------------- | ------------ |
+| Matrix   | x      | x              | x               | x               | x              | x                | x                | x                    |              |
+| Telegram | x      |                |                 |                 |                |                  |                  |                      | x            |

 ## Test suites (what runs where)

@@ -438,6 +459,9 @@ Docker notes:
 - Docker enables the image and MCP/tool probes by default. Set
  `OPENCLAW_LIVE_CODEX_HARNESS_IMAGE_PROBE=0` or
  `OPENCLAW_LIVE_CODEX_HARNESS_MCP_PROBE=0` when you need a narrower debug run.
+- Docker also exports `OPENCLAW_AGENT_HARNESS_FALLBACK=none`, matching the live
+  test config so `openai-codex/*` or PI fallback cannot hide a Codex harness
+  regression.

 ### Recommended live recipes

--- a/docs/plugins/architecture.md
+++ b/docs/plugins/architecture.md
@@ -519,10 +519,30 @@ The manifest is the control-plane source of truth. OpenClaw uses it to:
 - validate `plugins.entries.<id>.config`
 - augment Control UI labels/placeholders
 - show install/catalog metadata
+- preserve cheap activation and setup descriptors without loading plugin runtime

 For native plugins, the runtime module is the data-plane part. It registers
 actual behavior such as hooks, tools, commands, or provider flows.

+Optional manifest `activation` and `setup` blocks stay on the control plane.
+They are metadata-only descriptors for activation planning and setup discovery;
+they do not replace runtime registration, `register(...)`, or `setupEntry`.
+The first live activation consumers now use manifest command, channel, and provider hints
+to narrow plugin loading before broader registry materialization:
+
+- CLI loading narrows to plugins that own the requested primary command
+- channel setup/plugin resolution narrows to plugins that own the requested
+  channel id
+- explicit provider setup/runtime resolution narrows to plugins that own the
+  requested provider id
+
+Setup discovery now prefers descriptor-owned ids such as `setup.providers` and
+`setup.cliBackends` to narrow candidate plugins before it falls back to
+`setup-api` for plugins that still need setup-time runtime hooks. If more than
+one discovered plugin claims the same normalized setup provider or CLI backend
+id, setup lookup refuses the ambiguous owner instead of relying on discovery
+order.
+
 ### What the loader caches

 OpenClaw keeps short in-process caches for:
--- a/docs/plugins/codex-harness.md
+++ b/docs/plugins/codex-harness.md
@@ -452,7 +452,9 @@ continue through the normal OpenClaw delivery path.

 When the selected model uses the Codex harness, native thread compaction is
 delegated to Codex app-server. OpenClaw keeps a transcript mirror for channel
-history, search, `/new`, `/reset`, and future model or harness switching.
+history, search, `/new`, `/reset`, and future model or harness switching. The
+mirror includes the user prompt, final assistant text, and lightweight Codex
+reasoning or plan records when the app-server emits them.

 Media generation does not require PI. Image, video, music, PDF, TTS, and media
 understanding continue to use the matching provider/model settings such as
--- a/docs/plugins/manifest.md
+++ b/docs/plugins/manifest.md
@@ -47,6 +47,10 @@ Use it for:
 - config validation
 - auth and onboarding metadata that should be available without booting plugin
  runtime
+- cheap activation hints that control-plane surfaces can inspect before runtime
+  loads
+- cheap setup descriptors that setup/onboarding surfaces can inspect before
+  runtime loads
 - alias and auto-enable metadata that should resolve before plugin runtime loads
 - shorthand model-family ownership metadata that should auto-activate the
  plugin before runtime loads
@@ -152,6 +156,8 @@ Those belong in your plugin code and `package.json`.
 | `providerAuthAliases`               | No       | `Record<string, string>`         | Provider ids that should reuse another provider id for auth lookup, for example a coding provider that shares the base provider API key and auth profiles.                                                   |
 | `channelEnvVars`                    | No       | `Record<string, string[]>`       | Cheap channel env metadata that OpenClaw can inspect without loading plugin code. Use this for env-driven channel setup or auth surfaces that generic startup/config helpers should see.                     |
 | `providerAuthChoices`               | No       | `object[]`                       | Cheap auth-choice metadata for onboarding pickers, preferred-provider resolution, and simple CLI flag wiring.                                                                                                |
+| `activation`                        | No       | `object`                         | Cheap activation hints for provider, command, channel, route, and capability-triggered loading. Metadata only; plugin runtime still owns actual behavior.                                                    |
+| `setup`                             | No       | `object`                         | Cheap setup/onboarding descriptors that discovery and setup surfaces can inspect without loading plugin runtime.                                                                                             |
 | `contracts`                         | No       | `object`                         | Static bundled capability snapshot for speech, realtime transcription, realtime voice, media-understanding, image-generation, music-generation, video-generation, web-fetch, web search, and tool ownership. |
 | `channelConfigs`                    | No       | `Record<string, object>`         | Manifest-owned channel config metadata merged into discovery and validation surfaces before runtime loads.                                                                                                   |
 | `skills`                            | No       | `string[]`                       | Skill directories to load, relative to the plugin root.                                                                                                                                                      |
@@ -208,6 +214,101 @@ uses this metadata for diagnostics without importing plugin runtime code.
 | `kind`       | No       | `"runtime-slash"` | Marks the alias as a chat slash command rather than a root CLI command. |
 | `cliCommand` | No       | `string`          | Related root CLI command to suggest for CLI operations, if one exists.  |

+## activation reference
+
+Use `activation` when the plugin can cheaply declare which control-plane events
+should activate it later.
+
+This block is metadata only. It does not register runtime behavior, and it does
+not replace `register(...)`, `setupEntry`, or other runtime/plugin entrypoints.
+Current consumers use it as a narrowing hint before broader plugin loading, so
+missing activation metadata usually only costs performance; it should not
+change correctness while legacy manifest ownership fallbacks still exist.
+
+```json
+{
+  "activation": {
+    "onProviders": ["openai"],
+    "onCommands": ["models"],
+    "onChannels": ["web"],
+    "onRoutes": ["gateway-webhook"],
+    "onCapabilities": ["provider", "tool"]
+  }
+}
+```
+
+| Field            | Required | Type                                                 | What it means                                                     |
+| ---------------- | -------- | ---------------------------------------------------- | ----------------------------------------------------------------- |
+| `onProviders`    | No       | `string[]`                                           | Provider ids that should activate this plugin when requested.     |
+| `onCommands`     | No       | `string[]`                                           | Command ids that should activate this plugin.                     |
+| `onChannels`     | No       | `string[]`                                           | Channel ids that should activate this plugin.                     |
+| `onRoutes`       | No       | `string[]`                                           | Route kinds that should activate this plugin.                     |
+| `onCapabilities` | No       | `Array<"provider" \| "channel" \| "tool" \| "hook">` | Broad capability hints used by control-plane activation planning. |
+
+Current live consumers:
+
+- command-triggered CLI planning falls back to legacy
+  `commandAliases[].cliCommand` or `commandAliases[].name`
+- channel-triggered setup/channel planning falls back to legacy `channels[]`
+  ownership when explicit channel activation metadata is missing
+- provider-triggered setup/runtime planning falls back to legacy
+  `providers[]` and top-level `cliBackends[]` ownership when explicit provider
+  activation metadata is missing
+
+## setup reference
+
+Use `setup` when setup and onboarding surfaces need cheap plugin-owned metadata
+before runtime loads.
+
+```json
+{
+  "setup": {
+    "providers": [
+      {
+        "id": "openai",
+        "authMethods": ["api-key"],
+        "envVars": ["OPENAI_API_KEY"]
+      }
+    ],
+    "cliBackends": ["openai-cli"],
+    "configMigrations": ["legacy-openai-auth"],
+    "requiresRuntime": false
+  }
+}
+```
+
+Top-level `cliBackends` stays valid and continues to describe CLI inference
+backends. `setup.cliBackends` is the setup-specific descriptor surface for
+control-plane/setup flows that should stay metadata-only.
+
+When present, `setup.providers` and `setup.cliBackends` are the preferred
+descriptor-first lookup surface for setup discovery. If the descriptor only
+narrows the candidate plugin and setup still needs richer setup-time runtime
+hooks, set `requiresRuntime: true` and keep `setup-api` in place as the
+fallback execution path.
+
+Because setup lookup can execute plugin-owned `setup-api` code, normalized
+`setup.providers[].id` and `setup.cliBackends[]` values must stay unique across
+discovered plugins. Ambiguous ownership fails closed instead of picking a
+winner from discovery order.
+
+### setup.providers reference
+
+| Field         | Required | Type       | What it means                                                                        |
+| ------------- | -------- | ---------- | ------------------------------------------------------------------------------------ |
+| `id`          | Yes      | `string`   | Provider id exposed during setup or onboarding. Keep normalized ids globally unique. |
+| `authMethods` | No       | `string[]` | Setup/auth method ids this provider supports without loading full runtime.           |
+| `envVars`     | No       | `string[]` | Env vars that generic setup/status surfaces can check before plugin runtime loads.   |
+
+### setup fields
+
+| Field              | Required | Type       | What it means                                                                                       |
+| ------------------ | -------- | ---------- | --------------------------------------------------------------------------------------------------- |
+| `providers`        | No       | `object[]` | Provider setup descriptors exposed during setup and onboarding.                                     |
+| `cliBackends`      | No       | `string[]` | Setup-time backend ids used for descriptor-first setup lookup. Keep normalized ids globally unique. |
+| `configMigrations` | No       | `string[]` | Config migration ids owned by this plugin's setup surface.                                          |
+| `requiresRuntime`  | No       | `boolean`  | Whether setup still needs `setup-api` execution after descriptor lookup.                            |
+
 ## uiHints reference

 `uiHints` is a map from config field names to small rendering hints.
--- a/docs/plugins/memory-wiki.md
+++ b/docs/plugins/memory-wiki.md
@@ -45,6 +45,28 @@ both layers in one pass with `memory_search corpus=all`.
 When you need wiki-specific ranking, provenance, or direct page access, use the
 wiki-native tools instead.

+## Recommended hybrid pattern
+
+A strong default for local-first setups is:
+
+- QMD as the active memory backend for recall and broad semantic search
+- `memory-wiki` in `bridge` mode for durable synthesized knowledge pages
+
+That split works well because each layer stays focused:
+
+- QMD keeps raw notes, session exports, and extra collections searchable
+- `memory-wiki` compiles stable entities, claims, dashboards, and source pages
+
+Practical rule:
+
+- use `memory_search` when you want one broad recall pass across memory
+- use `wiki_search` and `wiki_get` when you want provenance-aware wiki results
+- use `memory_search corpus=all` when you want shared search to span both layers
+
+If bridge mode reports zero exported artifacts, the active memory plugin is not
+currently exposing public bridge inputs yet. Run `openclaw wiki doctor` first,
+then confirm the active memory plugin supports public artifacts.
+
 ## Vault modes

 `memory-wiki` supports three vault modes:
@@ -304,6 +326,47 @@ Key toggles:
 - `render.createBacklinks`: generate deterministic related blocks
 - `render.createDashboards`: generate dashboard pages

+### Example: QMD + bridge mode
+
+Use this when you want QMD for recall and `memory-wiki` for a maintained
+knowledge layer:
+
+```json5
+{
+  memory: {
+    backend: "qmd",
+      "memory-wiki": {
+        enabled: true,
+        config: {
+          vaultMode: "bridge",
+          bridge: {
+            enabled: true,
+            readMemoryArtifacts: true,
+            indexDreamReports: true,
+            indexDailyNotes: true,
+            indexMemoryRoot: true,
+            followMemoryEvents: true,
+          },
+          search: {
+            backend: "shared",
+            corpus: "all",
+          },
+          context: {
+            includeCompiledDigestPrompt: false,
+          },
+        },
+      },
+    },
+  },
+}
+```
+
+This keeps:
+
+- QMD in charge of active memory recall
+- `memory-wiki` focused on compiled pages and dashboards
+- prompt shape unchanged until you intentionally enable compiled digest prompts
+
 ## CLI

 `memory-wiki` also exposes a top-level CLI surface:
--- a/docs/plugins/sdk-agent-harness.md
+++ b/docs/plugins/sdk-agent-harness.md
@@ -99,9 +99,9 @@ OpenClaw may fall back to PI when the selected plugin harness fails before a
 turn has produced side effects. Set `OPENCLAW_AGENT_HARNESS_FALLBACK=none` or
 `embeddedHarness.fallback: "none"` to make that fallback a hard failure instead.

-The bundled Codex plugin registers `codex` as its harness id. For compatibility,
-`codex-app-server` and `app-server` also resolve to that same harness when you
-set `OPENCLAW_AGENT_RUNTIME` manually.
+The bundled Codex plugin registers `codex` as its harness id. Core treats that
+as an ordinary plugin harness id; Codex-specific aliases belong in the plugin
+or operator config, not in the shared runtime selector.

 ## Provider plus harness pairing

@@ -133,6 +133,25 @@ OpenClaw requires Codex app-server `0.118.0` or newer. The Codex plugin checks
 the app-server initialize handshake and blocks older or unversioned servers so
 OpenClaw only runs against the protocol surface it has been tested with.

+### Native Codex harness mode
+
+The bundled `codex` harness is the native Codex mode for embedded OpenClaw
+agent turns. Enable the bundled `codex` plugin first, and include `codex` in
+`plugins.allow` if your config uses a restrictive allowlist. It is different
+from `openai-codex/*`:
+
+- `openai-codex/*` uses ChatGPT/Codex OAuth through the normal OpenClaw provider
+  path.
+- `codex/*` uses the bundled Codex provider and routes the turn through Codex
+  app-server.
+
+When this mode runs, Codex owns the native thread id, resume behavior,
+compaction, and app-server execution. OpenClaw still owns the chat channel,
+visible transcript mirror, tool policy, approvals, media delivery, and session
+selection. Use `embeddedHarness.runtime: "codex"` with
+`embeddedHarness.fallback: "none"` when you need to prove that the Codex
+app-server path is used and PI fallback is not hiding a broken native harness.
+
 ## Disable PI fallback

 By default, OpenClaw runs embedded agents with `agents.defaults.embeddedHarness`
--- a/docs/plugins/sdk-provider-plugins.md
+++ b/docs/plugins/sdk-provider-plugins.md
@@ -175,6 +175,28 @@ API key auth, and dynamic model resolution.
    `openclaw onboard --acme-ai-api-key <key>` and select
    `acme-ai/acme-large` as their model.

+    If the upstream provider uses different control tokens than OpenClaw, add a
+    small bidirectional text transform instead of replacing the stream path:
+
+    ```typescript
+    api.registerTextTransforms({
+      input: [
+        { from: /red basket/g, to: "blue basket" },
+        { from: /paper ticket/g, to: "digital ticket" },
+        { from: /left shelf/g, to: "right shelf" },
+      ],
+      output: [
+        { from: /blue basket/g, to: "red basket" },
+        { from: /digital ticket/g, to: "paper ticket" },
+        { from: /right shelf/g, to: "left shelf" },
+      ],
+    });
+    ```
+
+    `input` rewrites the final system prompt and text message content before
+    transport. `output` rewrites assistant text deltas and final text before
+    OpenClaw parses its own control markers or channel delivery.
+
    For bundled providers that only register one text provider with API-key
    auth plus a single catalog-backed runtime, prefer the narrower
    `defineSingleProviderPluginEntry(...)` helper:
--- a/docs/providers/alibaba.md
+++ b/docs/providers/alibaba.md
@@ -16,57 +16,101 @@ Alibaba Model Studio / DashScope.
 - Also accepted: `DASHSCOPE_API_KEY`, `QWEN_API_KEY`
 - API: DashScope / Model Studio async video generation

-## Quick start
+## Getting started

-1. Set an API key:
-
-```bash
-openclaw onboard --auth-choice qwen-standard-api-key
-```
-
-2. Set a default video model:
-
-```json5
-{
-  agents: {
-    defaults: {
-      videoGenerationModel: {
-        primary: "alibaba/wan2.6-t2v",
+<Steps>
+  <Step title="Set an API key">
+    ```bash
+    openclaw onboard --auth-choice qwen-standard-api-key
+    ```
+  </Step>
+  <Step title="Set a default video model">
+    ```json5
+    {
+      agents: {
+        defaults: {
+          videoGenerationModel: {
+            primary: "alibaba/wan2.6-t2v",
+          },
+        },
      },
-    },
-  },
-}
-```
+    }
+    ```
+  </Step>
+  <Step title="Verify the provider is available">
+    ```bash
+    openclaw models list --provider alibaba
+    ```
+  </Step>
+</Steps>
+
+<Note>
+Any of the accepted auth keys (`MODELSTUDIO_API_KEY`, `DASHSCOPE_API_KEY`, `QWEN_API_KEY`) will work. The `qwen-standard-api-key` onboarding choice configures the shared DashScope credential.
+</Note>

 ## Built-in Wan models

 The bundled `alibaba` provider currently registers:

- `alibaba/wan2.6-t2v`
- `alibaba/wan2.6-i2v`
- `alibaba/wan2.6-r2v`
- `alibaba/wan2.6-r2v-flash`
- `alibaba/wan2.7-r2v`
+| Model ref                  | Mode                      |
+| -------------------------- | ------------------------- |
+| `alibaba/wan2.6-t2v`       | Text-to-video             |
+| `alibaba/wan2.6-i2v`       | Image-to-video            |
+| `alibaba/wan2.6-r2v`       | Reference-to-video        |
+| `alibaba/wan2.6-r2v-flash` | Reference-to-video (fast) |
+| `alibaba/wan2.7-r2v`       | Reference-to-video        |

 ## Current limits

- Up to **1** output video per request
- Up to **1** input image
- Up to **4** input videos
- Up to **10 seconds** duration
- Supports `size`, `aspectRatio`, `resolution`, `audio`, and `watermark`
- Reference image/video mode currently requires **remote http(s) URLs**
+| Parameter             | Limit                                                     |
+| --------------------- | --------------------------------------------------------- |
+| Output videos         | Up to **1** per request                                   |
+| Input images          | Up to **1**                                               |
+| Input videos          | Up to **4**                                               |
+| Duration              | Up to **10 seconds**                                      |
+| Supported controls    | `size`, `aspectRatio`, `resolution`, `audio`, `watermark` |
+| Reference image/video | Remote `http(s)` URLs only                                |

-## Relationship to Qwen
+<Warning>
+Reference image/video mode currently requires **remote http(s) URLs**. Local file paths are not supported for reference inputs.
+</Warning>

-The bundled `qwen` provider also uses Alibaba-hosted DashScope endpoints for
-Wan video generation. Use:
+## Advanced configuration

- `qwen/...` when you want the canonical Qwen provider surface
- `alibaba/...` when you want the direct vendor-owned Wan video surface
+<AccordionGroup>
+  <Accordion title="Relationship to Qwen">
+    The bundled `qwen` provider also uses Alibaba-hosted DashScope endpoints for
+    Wan video generation. Use:
+
+    - `qwen/...` when you want the canonical Qwen provider surface
+    - `alibaba/...` when you want the direct vendor-owned Wan video surface
+
+    See the [Qwen provider docs](/providers/qwen) for more detail.
+
+  </Accordion>
+
+  <Accordion title="Auth key priority">
+    OpenClaw checks for auth keys in this order:
+
+    1. `MODELSTUDIO_API_KEY` (preferred)
+    2. `DASHSCOPE_API_KEY`
+    3. `QWEN_API_KEY`
+
+    Any of these will authenticate the `alibaba` provider.
+
+  </Accordion>
+</AccordionGroup>

 ## Related

- [Video Generation](/tools/video-generation)
- [Qwen](/providers/qwen)
- [Configuration Reference](/gateway/configuration-reference#agent-defaults)
+<CardGroup cols={2}>
+  <Card title="Video generation" href="/tools/video-generation" icon="video">
+    Shared video tool parameters and provider selection.
+  </Card>
+  <Card title="Qwen" href="/providers/qwen" icon="microchip">
+    Qwen provider setup and DashScope integration.
+  </Card>
+  <Card title="Configuration reference" href="/gateway/configuration-reference#agent-defaults" icon="gear">
+    Agent defaults and model configuration.
+  </Card>
+</CardGroup>
--- a/docs/providers/anthropic.md
+++ b/docs/providers/anthropic.md
@@ -7,83 +7,117 @@ title: "Anthropic"

 # Anthropic (Claude)

-Anthropic builds the **Claude** model family and provides access via an API and
-Claude CLI. In OpenClaw, Anthropic API keys and Claude CLI reuse are both
-supported. Existing legacy Anthropic token profiles are still honored at
-runtime if they are already configured.
+Anthropic builds the **Claude** model family. OpenClaw supports two auth routes:
+
+- **API key** — direct Anthropic API access with usage-based billing (`anthropic/*` models)
+- **Claude CLI** — reuse an existing Claude CLI login on the same host

 <Warning>
 Anthropic staff told us OpenClaw-style Claude CLI usage is allowed again, so
-OpenClaw treats Claude CLI reuse and `claude -p` usage as sanctioned for this
-integration unless Anthropic publishes a new policy.
+OpenClaw treats Claude CLI reuse and `claude -p` usage as sanctioned unless
+Anthropic publishes a new policy.

 For long-lived gateway hosts, Anthropic API keys are still the clearest and
-most predictable production path. If you already use Claude CLI on the host,
-OpenClaw can reuse that login directly.
+most predictable production path.

 Anthropic's current public docs:

 - [Claude Code CLI reference](https://code.claude.com/docs/en/cli-reference)
 - [Claude Agent SDK overview](https://platform.claude.com/docs/en/agent-sdk/overview)
-
 - [Using Claude Code with your Pro or Max plan](https://support.claude.com/en/articles/11145838-using-claude-code-with-your-pro-or-max-plan)
 - [Using Claude Code with your Team or Enterprise plan](https://support.anthropic.com/en/articles/11845131-using-claude-code-with-your-team-or-enterprise-plan/)
+  </Warning>

-If you want the clearest billing path, use an Anthropic API key instead.
-OpenClaw also supports other subscription-style options, including [OpenAI
-Codex](/providers/openai), [Qwen Cloud Coding Plan](/providers/qwen),
-[MiniMax Coding Plan](/providers/minimax), and [Z.AI / GLM Coding
-Plan](/providers/glm).
-</Warning>
+## Getting started

-## Option A: Anthropic API key
+<Tabs>
+  <Tab title="API key">
+    **Best for:** standard API access and usage-based billing.

-**Best for:** standard API access and usage-based billing.
-Create your API key in the Anthropic Console.
+    <Steps>
+      <Step title="Get your API key">
+        Create an API key in the [Anthropic Console](https://console.anthropic.com/).
+      </Step>
+      <Step title="Run onboarding">
+        ```bash
+        openclaw onboard
+        # choose: Anthropic API key
+        ```

-### CLI setup
+        Or pass the key directly:

-```bash
-openclaw onboard
-# choose: Anthropic API key
+        ```bash
+        openclaw onboard --anthropic-api-key "$ANTHROPIC_API_KEY"
+        ```
+      </Step>
+      <Step title="Verify the model is available">
+        ```bash
+        openclaw models list --provider anthropic
+        ```
+      </Step>
+    </Steps>

-# or non-interactive
-openclaw onboard --anthropic-api-key "$ANTHROPIC_API_KEY"
-```
+    ### Config example

-### Anthropic config snippet
+    ```json5
+    {
+      env: { ANTHROPIC_API_KEY: "sk-ant-..." },
+      agents: { defaults: { model: { primary: "anthropic/claude-opus-4-6" } } },
+    }
+    ```

-```json5
-{
-  env: { ANTHROPIC_API_KEY: "sk-ant-..." },
-  agents: { defaults: { model: { primary: "anthropic/claude-opus-4-6" } } },
-}
-```
+  </Tab>
+
+  <Tab title="Claude CLI">
+    **Best for:** reusing an existing Claude CLI login without a separate API key.
+
+    <Steps>
+      <Step title="Ensure Claude CLI is installed and logged in">
+        Verify with:
+
+        ```bash
+        claude --version
+        ```
+      </Step>
+      <Step title="Run onboarding">
+        ```bash
+        openclaw onboard
+        # choose: Claude CLI
+        ```
+
+        OpenClaw detects and reuses the existing Claude CLI credentials.
+      </Step>
+      <Step title="Verify the model is available">
+        ```bash
+        openclaw models list --provider anthropic
+        ```
+      </Step>
+    </Steps>
+
+    <Note>
+    Setup and runtime details for the Claude CLI backend are in [CLI Backends](/gateway/cli-backends).
+    </Note>
+
+    <Tip>
+    If you want the clearest billing path, use an Anthropic API key instead. OpenClaw also supports subscription-style options from [OpenAI Codex](/providers/openai), [Qwen Cloud](/providers/qwen), [MiniMax](/providers/minimax), and [Z.AI / GLM](/providers/glm).
+    </Tip>
+
+  </Tab>
+</Tabs>

 ## Thinking defaults (Claude 4.6)

- Anthropic Claude 4.6 models default to `adaptive` thinking in OpenClaw when no explicit thinking level is set.
- You can override per-message (`/think:<level>`) or in model params:
-  `agents.defaults.models["anthropic/<model>"].params.thinking`.
- Related Anthropic docs:
-  - [Adaptive thinking](https://platform.claude.com/docs/en/build-with-claude/adaptive-thinking)
-  - [Extended thinking](https://platform.claude.com/docs/en/build-with-claude/extended-thinking)
+Claude 4.6 models default to `adaptive` thinking in OpenClaw when no explicit thinking level is set.

-## Fast mode (Anthropic API)
-
-OpenClaw's shared `/fast` toggle also supports direct public Anthropic traffic, including API-key and OAuth-authenticated requests sent to `api.anthropic.com`.
-
- `/fast on` maps to `service_tier: "auto"`
- `/fast off` maps to `service_tier: "standard_only"`
- Config default:
+Override per-message with `/think:<level>` or in model params:

 ```json5
 {
  agents: {
    defaults: {
      models: {
-        "anthropic/claude-sonnet-4-6": {
-          params: { fastMode: true },
+        "anthropic/claude-opus-4-6": {
+          params: { thinking: "adaptive" },
        },
      },
    },
@@ -91,25 +125,21 @@ OpenClaw's shared `/fast` toggle also supports direct public Anthropic traffic,
 }
 ```

-Important limits:
+<Note>
+Related Anthropic docs:
+- [Adaptive thinking](https://platform.claude.com/docs/en/build-with-claude/adaptive-thinking)
+- [Extended thinking](https://platform.claude.com/docs/en/build-with-claude/extended-thinking)
+</Note>

- OpenClaw only injects Anthropic service tiers for direct `api.anthropic.com` requests. If you route `anthropic/*` through a proxy or gateway, `/fast` leaves `service_tier` untouched.
- Explicit Anthropic `serviceTier` or `service_tier` model params override the `/fast` default when both are set.
- Anthropic reports the effective tier on the response under `usage.service_tier`. On accounts without Priority Tier capacity, `service_tier: "auto"` may still resolve to `standard`.
+## Prompt caching

-## Prompt caching (Anthropic API)
+OpenClaw supports Anthropic's prompt caching feature for API-key auth.

-OpenClaw supports Anthropic's prompt caching feature. This is **API-only**; legacy Anthropic token auth does not honor cache settings.
-
-### Configuration
-
-Use the `cacheRetention` parameter in your model config:
-
-| Value   | Cache Duration | Description              |
-| ------- | -------------- | ------------------------ |
-| `none`  | No caching     | Disable prompt caching   |
-| `short` | 5 minutes      | Default for API Key auth |
-| `long`  | 1 hour         | Extended cache           |
+| Value               | Cache duration | Description                            |
+| ------------------- | -------------- | -------------------------------------- |
+| `"short"` (default) | 5 minutes      | Applied automatically for API-key auth |
+| `"long"`            | 1 hour         | Extended cache                         |
+| `"none"`            | No caching     | Disable prompt caching                 |

 ```json5
 {
@@ -125,122 +155,156 @@ Use the `cacheRetention` parameter in your model config:
 }
 ```

-### Defaults
+<AccordionGroup>
+  <Accordion title="Per-agent cache overrides">
+    Use model-level params as your baseline, then override specific agents via `agents.list[].params`:

-When using Anthropic API Key authentication, OpenClaw automatically applies `cacheRetention: "short"` (5-minute cache) for all Anthropic models. You can override this by explicitly setting `cacheRetention` in your config.
+    ```json5
+    {
+      agents: {
+        defaults: {
+          model: { primary: "anthropic/claude-opus-4-6" },
+          models: {
+            "anthropic/claude-opus-4-6": {
+              params: { cacheRetention: "long" },
+            },
+          },
+        },
+        list: [
+          { id: "research", default: true },
+          { id: "alerts", params: { cacheRetention: "none" } },
+        ],
+      },
+    }
+    ```

-### Per-agent cacheRetention overrides
+    Config merge order:

-Use model-level params as your baseline, then override specific agents via `agents.list[].params`.
+    1. `agents.defaults.models["provider/model"].params`
+    2. `agents.list[].params` (matching `id`, overrides by key)

-```json5
-{
-  agents: {
-    defaults: {
-      model: { primary: "anthropic/claude-opus-4-6" },
-      models: {
-        "anthropic/claude-opus-4-6": {
-          params: { cacheRetention: "long" }, // baseline for most agents
+    This lets one agent keep a long-lived cache while another agent on the same model disables caching for bursty/low-reuse traffic.
+
+  </Accordion>
+
+  <Accordion title="Bedrock Claude notes">
+    - Anthropic Claude models on Bedrock (`amazon-bedrock/*anthropic.claude*`) accept `cacheRetention` pass-through when configured.
+    - Non-Anthropic Bedrock models are forced to `cacheRetention: "none"` at runtime.
+    - API-key smart defaults also seed `cacheRetention: "short"` for Claude-on-Bedrock refs when no explicit value is set.
+  </Accordion>
+</AccordionGroup>
+
+## Advanced configuration
+
+<AccordionGroup>
+  <Accordion title="Fast mode">
+    OpenClaw's shared `/fast` toggle supports direct Anthropic traffic (API-key and OAuth to `api.anthropic.com`).
+
+    | Command | Maps to |
+    |---------|---------|
+    | `/fast on` | `service_tier: "auto"` |
+    | `/fast off` | `service_tier: "standard_only"` |
+
+    ```json5
+    {
+      agents: {
+        defaults: {
+          models: {
+            "anthropic/claude-sonnet-4-6": {
+              params: { fastMode: true },
+            },
+          },
        },
      },
-    },
-    list: [
-      { id: "research", default: true },
-      { id: "alerts", params: { cacheRetention: "none" } }, // override for this agent only
-    ],
-  },
-}
-```
+    }
+    ```

-Config merge order for cache-related params:
+    <Note>
+    - Only injected for direct `api.anthropic.com` requests. Proxy routes leave `service_tier` untouched.
+    - Explicit `serviceTier` or `service_tier` params override `/fast` when both are set.
+    - On accounts without Priority Tier capacity, `service_tier: "auto"` may resolve to `standard`.
+    </Note>

-1. `agents.defaults.models["provider/model"].params`
-2. `agents.list[].params` (matching `id`, overrides by key)
+  </Accordion>

-This lets one agent keep a long-lived cache while another agent on the same model disables caching to avoid write costs on bursty/low-reuse traffic.
+  <Accordion title="Media understanding (image and PDF)">
+    The bundled Anthropic plugin registers image and PDF understanding. OpenClaw
+    auto-resolves media capabilities from the configured Anthropic auth — no
+    additional config is needed.

-### Bedrock Claude notes
+    | Property       | Value                |
+    | -------------- | -------------------- |
+    | Default model  | `claude-opus-4-6`    |
+    | Supported input | Images, PDF documents |

- Anthropic Claude models on Bedrock (`amazon-bedrock/*anthropic.claude*`) accept `cacheRetention` pass-through when configured.
- Non-Anthropic Bedrock models are forced to `cacheRetention: "none"` at runtime.
- Anthropic API-key smart defaults also seed `cacheRetention: "short"` for Claude-on-Bedrock model refs when no explicit value is set.
+    When an image or PDF is attached to a conversation, OpenClaw automatically
+    routes it through the Anthropic media understanding provider.

-## 1M context window (Anthropic beta)
+  </Accordion>

-Anthropic's 1M context window is beta-gated. In OpenClaw, enable it per model
-with `params.context1m: true` for supported Opus/Sonnet models.
+  <Accordion title="1M context window (beta)">
+    Anthropic's 1M context window is beta-gated. Enable it per model:

-```json5
-{
-  agents: {
-    defaults: {
-      models: {
-        "anthropic/claude-opus-4-6": {
-          params: { context1m: true },
+    ```json5
+    {
+      agents: {
+        defaults: {
+          models: {
+            "anthropic/claude-opus-4-6": {
+              params: { context1m: true },
+            },
+          },
        },
      },
-    },
-  },
-}
-```
+    }
+    ```

-OpenClaw maps this to `anthropic-beta: context-1m-2025-08-07` on Anthropic
-requests.
+    OpenClaw maps this to `anthropic-beta: context-1m-2025-08-07` on requests.

-This only activates when `params.context1m` is explicitly set to `true` for
-that model.
+    <Warning>
+    Requires long-context access on your Anthropic credential. Legacy token auth (`sk-ant-oat-*`) is rejected for 1M context requests — OpenClaw logs a warning and falls back to the standard context window.
+    </Warning>

-Requirement: Anthropic must allow long-context usage on that credential.
-
-Note: Anthropic currently rejects `context-1m-*` beta requests when using
-legacy Anthropic token auth (`sk-ant-oat-*`). If you configure
-`context1m: true` with that legacy auth mode, OpenClaw logs a warning and
-falls back to the standard context window by skipping the context1m beta
-header while keeping the required OAuth betas.
-
-## Claude CLI backend
-
-The bundled Anthropic `claude-cli` backend is supported in OpenClaw.
-
- Anthropic staff told us this usage is allowed again.
- OpenClaw therefore treats Claude CLI reuse and `claude -p` usage as
-  sanctioned for this integration unless Anthropic publishes a new policy.
- Anthropic API keys remain the clearest production path for always-on gateway
-  hosts and explicit server-side billing control.
- Setup and runtime details are in [/gateway/cli-backends](/gateway/cli-backends).
-
-## Notes
-
- Anthropic's public Claude Code docs still document direct CLI usage such as
-  `claude -p`, and Anthropic staff told us OpenClaw-style Claude CLI usage is
-  allowed again. We are treating that guidance as settled unless Anthropic
-  publishes a new policy change.
- Anthropic setup-token remains available in OpenClaw as a supported token-auth path, but OpenClaw now prefers Claude CLI reuse and `claude -p` when available.
- Auth details + reuse rules are in [/concepts/oauth](/concepts/oauth).
+  </Accordion>
+</AccordionGroup>

 ## Troubleshooting

-**401 errors / token suddenly invalid**
+<AccordionGroup>
+  <Accordion title="401 errors / token suddenly invalid">
+    Anthropic token auth can expire or be revoked. For new setups, migrate to an Anthropic API key.
+  </Accordion>

- Anthropic token auth can expire or be revoked.
- For new setup, migrate to an Anthropic API key.
+  <Accordion title='No API key found for provider "anthropic"'>
+    Auth is **per agent**. New agents don't inherit the main agent's keys. Re-run onboarding for that agent, or configure an API key on the gateway host, then verify with `openclaw models status`.
+  </Accordion>

-**No API key found for provider "anthropic"**
+  <Accordion title='No credentials found for profile "anthropic:default"'>
+    Run `openclaw models status` to see which auth profile is active. Re-run onboarding, or configure an API key for that profile path.
+  </Accordion>

- Auth is **per agent**. New agents don’t inherit the main agent’s keys.
- Re-run onboarding for that agent, or configure an API key on the gateway
-  host, then verify with `openclaw models status`.
+  <Accordion title="No available auth profile (all in cooldown)">
+    Check `openclaw models status --json` for `auth.unusableProfiles`. Anthropic rate-limit cooldowns can be model-scoped, so a sibling Anthropic model may still be usable. Add another Anthropic profile or wait for cooldown.
+  </Accordion>
+</AccordionGroup>

-**No credentials found for profile `anthropic:default`**
+<Note>
+More help: [Troubleshooting](/help/troubleshooting) and [FAQ](/help/faq).
+</Note>

- Run `openclaw models status` to see which auth profile is active.
- Re-run onboarding, or configure an API key for that profile path.
+## Related

-**No available auth profile (all in cooldown/unavailable)**
-
- Check `openclaw models status --json` for `auth.unusableProfiles`.
- Anthropic rate-limit cooldowns can be model-scoped, so a sibling Anthropic
-  model may still be usable even when the current one is cooling down.
- Add another Anthropic profile or wait for cooldown.
-
-More: [/gateway/troubleshooting](/gateway/troubleshooting) and [/help/faq](/help/faq).
+<CardGroup cols={2}>
+  <Card title="Model selection" href="/concepts/model-providers" icon="layers">
+    Choosing providers, model refs, and failover behavior.
+  </Card>
+  <Card title="CLI backends" href="/gateway/cli-backends" icon="terminal">
+    Claude CLI backend setup and runtime details.
+  </Card>
+  <Card title="Prompt caching" href="/reference/prompt-caching" icon="database">
+    How prompt caching works across providers.
+  </Card>
+  <Card title="OAuth and auth" href="/gateway/authentication" icon="key">
+    Auth details and credential reuse rules.
+  </Card>
+</CardGroup>
--- a/docs/providers/arcee.md
+++ b/docs/providers/arcee.md
@@ -12,58 +12,89 @@ read_when:

 Arcee AI models can be accessed directly via the Arcee platform or through [OpenRouter](/providers/openrouter).

- Provider: `arcee`
- Auth: `ARCEEAI_API_KEY` (direct) or `OPENROUTER_API_KEY` (via OpenRouter)
- API: OpenAI-compatible
- Base URL: `https://api.arcee.ai/api/v1` (direct) or `https://openrouter.ai/api/v1` (OpenRouter)
+| Property | Value                                                                                 |
+| -------- | ------------------------------------------------------------------------------------- |
+| Provider | `arcee`                                                                               |
+| Auth     | `ARCEEAI_API_KEY` (direct) or `OPENROUTER_API_KEY` (via OpenRouter)                   |
+| API      | OpenAI-compatible                                                                     |
+| Base URL | `https://api.arcee.ai/api/v1` (direct) or `https://openrouter.ai/api/v1` (OpenRouter) |

-## Quick start
+## Getting started

-1. Get an API key from [Arcee AI](https://chat.arcee.ai/) or [OpenRouter](https://openrouter.ai/keys).
+<Tabs>
+  <Tab title="Direct (Arcee platform)">
+    <Steps>
+      <Step title="Get an API key">
+        Create an API key at [Arcee AI](https://chat.arcee.ai/).
+      </Step>
+      <Step title="Run onboarding">
+        ```bash
+        openclaw onboard --auth-choice arceeai-api-key
+        ```
+      </Step>
+      <Step title="Set a default model">
+        ```json5
+        {
+          agents: {
+            defaults: {
+              model: { primary: "arcee/trinity-large-thinking" },
+            },
+          },
+        }
+        ```
+      </Step>
+    </Steps>
+  </Tab>

-2. Set the API key (recommended: store it for the Gateway):
+  <Tab title="Via OpenRouter">
+    <Steps>
+      <Step title="Get an API key">
+        Create an API key at [OpenRouter](https://openrouter.ai/keys).
+      </Step>
+      <Step title="Run onboarding">
+        ```bash
+        openclaw onboard --auth-choice arceeai-openrouter
+        ```
+      </Step>
+      <Step title="Set a default model">
+        ```json5
+        {
+          agents: {
+            defaults: {
+              model: { primary: "arcee/trinity-large-thinking" },
+            },
+          },
+        }
+        ```

-```bash
-# Direct (Arcee platform)
-openclaw onboard --auth-choice arceeai-api-key
+        The same model refs work for both direct and OpenRouter setups (for example `arcee/trinity-large-thinking`).
+      </Step>
+    </Steps>

-# Via OpenRouter
-openclaw onboard --auth-choice arceeai-openrouter
-```
+  </Tab>
+</Tabs>

-3. Set a default model:
+## Non-interactive setup

-```json5
-{
-  agents: {
-    defaults: {
-      model: { primary: "arcee/trinity-large-thinking" },
-    },
-  },
-}
-```
+<Tabs>
+  <Tab title="Direct (Arcee platform)">
+    ```bash
+    openclaw onboard --non-interactive \
+      --mode local \
+      --auth-choice arceeai-api-key \
+      --arceeai-api-key "$ARCEEAI_API_KEY"
+    ```
+  </Tab>

-## Non-interactive example
-
-```bash
-# Direct (Arcee platform)
-openclaw onboard --non-interactive \
-  --mode local \
-  --auth-choice arceeai-api-key \
-  --arceeai-api-key "$ARCEEAI_API_KEY"
-
-# Via OpenRouter
-openclaw onboard --non-interactive \
-  --mode local \
-  --auth-choice arceeai-openrouter \
-  --openrouter-api-key "$OPENROUTER_API_KEY"
-```
-
-## Environment note
-
-If the Gateway runs as a daemon (launchd/systemd), make sure `ARCEEAI_API_KEY`
-(or `OPENROUTER_API_KEY`) is available to that process (for example, in
-`~/.openclaw/.env` or via `env.shellEnv`).
+  <Tab title="Via OpenRouter">
+    ```bash
+    openclaw onboard --non-interactive \
+      --mode local \
+      --auth-choice arceeai-openrouter \
+      --openrouter-api-key "$OPENROUTER_API_KEY"
+    ```
+  </Tab>
+</Tabs>

 ## Built-in catalog

@@ -75,13 +106,41 @@ OpenClaw currently ships this bundled Arcee catalog:
 | `arcee/trinity-large-preview`  | Trinity Large Preview  | text  | 128K    | $0.25 / $1.00        | General-purpose; 400B params, 13B active  |
 | `arcee/trinity-mini`           | Trinity Mini 26B       | text  | 128K    | $0.045 / $0.15       | Fast and cost-efficient; function calling |

-The same model refs work for both direct and OpenRouter setups (for example `arcee/trinity-large-thinking`).
-
+<Tip>
 The onboarding preset sets `arcee/trinity-large-thinking` as the default model.
+</Tip>

 ## Supported features

- Streaming
- Tool use / function calling
- Structured output (JSON mode and JSON schema)
- Extended thinking (Trinity Large Thinking)
+| Feature                                       | Supported                    |
+| --------------------------------------------- | ---------------------------- |
+| Streaming                                     | Yes                          |
+| Tool use / function calling                   | Yes                          |
+| Structured output (JSON mode and JSON schema) | Yes                          |
+| Extended thinking                             | Yes (Trinity Large Thinking) |
+
+<AccordionGroup>
+  <Accordion title="Environment note">
+    If the Gateway runs as a daemon (launchd/systemd), make sure `ARCEEAI_API_KEY`
+    (or `OPENROUTER_API_KEY`) is available to that process (for example, in
+    `~/.openclaw/.env` or via `env.shellEnv`).
+  </Accordion>
+
+  <Accordion title="OpenRouter routing">
+    When using Arcee models via OpenRouter, the same `arcee/*` model refs apply.
+    OpenClaw handles routing transparently based on your auth choice. See the
+    [OpenRouter provider docs](/providers/openrouter) for OpenRouter-specific
+    configuration details.
+  </Accordion>
+</AccordionGroup>
+
+## Related
+
+<CardGroup cols={2}>
+  <Card title="OpenRouter" href="/providers/openrouter" icon="shuffle">
+    Access Arcee models and many others through a single API key.
+  </Card>
+  <Card title="Model selection" href="/concepts/model-providers" icon="layers">
+    Choosing providers, model refs, and failover behavior.
+  </Card>
+</CardGroup>
--- a/docs/providers/bedrock-mantle.md
+++ b/docs/providers/bedrock-mantle.md
@@ -13,55 +13,95 @@ the Mantle OpenAI-compatible endpoint. Mantle hosts open-source and
 third-party models (GPT-OSS, Qwen, Kimi, GLM, and similar) through a standard
 `/v1/chat/completions` surface backed by Bedrock infrastructure.

-## What OpenClaw supports
+| Property       | Value                                                                               |
+| -------------- | ----------------------------------------------------------------------------------- |
+| Provider ID    | `amazon-bedrock-mantle`                                                             |
+| API            | `openai-completions` (OpenAI-compatible)                                            |
+| Auth           | Explicit `AWS_BEARER_TOKEN_BEDROCK` or IAM credential-chain bearer-token generation |
+| Default region | `us-east-1` (override with `AWS_REGION` or `AWS_DEFAULT_REGION`)                    |

- Provider: `amazon-bedrock-mantle`
- API: `openai-completions` (OpenAI-compatible)
- Auth: explicit `AWS_BEARER_TOKEN_BEDROCK` or IAM credential-chain bearer-token generation
- Region: `AWS_REGION` or `AWS_DEFAULT_REGION` (default: `us-east-1`)
+## Getting started
+
+Choose your preferred auth method and follow the setup steps.
+
+<Tabs>
+  <Tab title="Explicit bearer token">
+    **Best for:** environments where you already have a Mantle bearer token.
+
+    <Steps>
+      <Step title="Set the bearer token on the gateway host">
+        ```bash
+        export AWS_BEARER_TOKEN_BEDROCK="..."
+        ```
+
+        Optionally set a region (defaults to `us-east-1`):
+
+        ```bash
+        export AWS_REGION="us-west-2"
+        ```
+      </Step>
+      <Step title="Verify models are discovered">
+        ```bash
+        openclaw models list
+        ```
+
+        Discovered models appear under the `amazon-bedrock-mantle` provider. No
+        additional config is required unless you want to override defaults.
+      </Step>
+    </Steps>
+
+  </Tab>
+
+  <Tab title="IAM credentials">
+    **Best for:** using AWS SDK-compatible credentials (shared config, SSO, web identity, instance or task roles).
+
+    <Steps>
+      <Step title="Configure AWS credentials on the gateway host">
+        Any AWS SDK-compatible auth source works:
+
+        ```bash
+        export AWS_PROFILE="default"
+        export AWS_REGION="us-west-2"
+        ```
+      </Step>
+      <Step title="Verify models are discovered">
+        ```bash
+        openclaw models list
+        ```
+
+        OpenClaw generates a Mantle bearer token from the credential chain automatically.
+      </Step>
+    </Steps>
+
+    <Tip>
+    When `AWS_BEARER_TOKEN_BEDROCK` is not set, OpenClaw mints the bearer token for you from the AWS default credential chain, including shared credentials/config profiles, SSO, web identity, and instance or task roles.
+    </Tip>
+
+  </Tab>
+</Tabs>

 ## Automatic model discovery

 When `AWS_BEARER_TOKEN_BEDROCK` is set, OpenClaw uses it directly. Otherwise,
 OpenClaw attempts to generate a Mantle bearer token from the AWS default
-credential chain, including shared credentials/config profiles, SSO, web
-identity, and instance or task roles. It then discovers available Mantle
-models by querying the region's `/v1/models` endpoint. Discovery results are
-cached for 1 hour, and IAM-derived bearer tokens are refreshed hourly.
+credential chain. It then discovers available Mantle models by querying the
+region's `/v1/models` endpoint.

-Supported regions: `us-east-1`, `us-east-2`, `us-west-2`, `ap-northeast-1`,
+| Behavior          | Detail                    |
+| ----------------- | ------------------------- |
+| Discovery cache   | Results cached for 1 hour |
+| IAM token refresh | Hourly                    |
+
+<Note>
+The bearer token is the same `AWS_BEARER_TOKEN_BEDROCK` used by the standard [Amazon Bedrock](/providers/bedrock) provider.
+</Note>
+
+### Supported regions
+
+`us-east-1`, `us-east-2`, `us-west-2`, `ap-northeast-1`,
 `ap-south-1`, `ap-southeast-3`, `eu-central-1`, `eu-west-1`, `eu-west-2`,
 `eu-south-1`, `eu-north-1`, `sa-east-1`.

-## Onboarding
-
-1. Choose one auth path on the **gateway host**:
-
-Explicit bearer token:
-
-```bash
-export AWS_BEARER_TOKEN_BEDROCK="..."
-# Optional (defaults to us-east-1):
-export AWS_REGION="us-west-2"
-```
-
-IAM credentials:
-
-```bash
-# Any AWS SDK-compatible auth source works here, for example:
-export AWS_PROFILE="default"
-export AWS_REGION="us-west-2"
-```
-
-2. Verify models are discovered:
-
-```bash
-openclaw models list
-```
-
-Discovered models appear under the `amazon-bedrock-mantle` provider. No
-additional config is required unless you want to override defaults.
-
 ## Manual configuration

 If you prefer explicit config instead of auto-discovery:
@@ -92,13 +132,46 @@ If you prefer explicit config instead of auto-discovery:
 }
 ```

-## Notes
+## Advanced notes

- OpenClaw can mint the Mantle bearer token for you from AWS SDK-compatible
-  IAM credentials when `AWS_BEARER_TOKEN_BEDROCK` is not set.
- The bearer token is the same `AWS_BEARER_TOKEN_BEDROCK` used by the standard
-  [Amazon Bedrock](/providers/bedrock) provider.
- Reasoning support is inferred from model IDs containing patterns like
-  `thinking`, `reasoner`, or `gpt-oss-120b`.
- If the Mantle endpoint is unavailable or returns no models, the provider is
-  silently skipped.
+<AccordionGroup>
+  <Accordion title="Reasoning support">
+    Reasoning support is inferred from model IDs containing patterns like
+    `thinking`, `reasoner`, or `gpt-oss-120b`. OpenClaw sets `reasoning: true`
+    automatically for matching models during discovery.
+  </Accordion>
+
+  <Accordion title="Endpoint unavailability">
+    If the Mantle endpoint is unavailable or returns no models, the provider is
+    silently skipped. OpenClaw does not error; other configured providers
+    continue to work normally.
+  </Accordion>
+
+  <Accordion title="Relationship to Amazon Bedrock provider">
+    Bedrock Mantle is a separate provider from the standard
+    [Amazon Bedrock](/providers/bedrock) provider. Mantle uses an
+    OpenAI-compatible `/v1` surface, while the standard Bedrock provider uses
+    the native Bedrock API.
+
+    Both providers share the same `AWS_BEARER_TOKEN_BEDROCK` credential when
+    present.
+
+  </Accordion>
+</AccordionGroup>
+
+## Related
+
+<CardGroup cols={2}>
+  <Card title="Amazon Bedrock" href="/providers/bedrock" icon="cloud">
+    Native Bedrock provider for Anthropic Claude, Titan, and other models.
+  </Card>
+  <Card title="Model selection" href="/concepts/model-providers" icon="layers">
+    Choosing providers, model refs, and failover behavior.
+  </Card>
+  <Card title="OAuth and auth" href="/gateway/authentication" icon="key">
+    Auth details and credential reuse rules.
+  </Card>
+  <Card title="Troubleshooting" href="/help/troubleshooting" icon="wrench">
+    Common issues and how to resolve them.
+  </Card>
+</CardGroup>
--- a/docs/providers/bedrock.md
+++ b/docs/providers/bedrock.md
@@ -8,16 +8,130 @@ title: "Amazon Bedrock"

 # Amazon Bedrock

-OpenClaw can use **Amazon Bedrock** models via pi‑ai’s **Bedrock Converse**
+OpenClaw can use **Amazon Bedrock** models via pi-ai's **Bedrock Converse**
 streaming provider. Bedrock auth uses the **AWS SDK default credential chain**,
 not an API key.

-## What pi-ai supports
+| Property | Value                                                       |
+| -------- | ----------------------------------------------------------- |
+| Provider | `amazon-bedrock`                                            |
+| API      | `bedrock-converse-stream`                                   |
+| Auth     | AWS credentials (env vars, shared config, or instance role) |
+| Region   | `AWS_REGION` or `AWS_DEFAULT_REGION` (default: `us-east-1`) |

- Provider: `amazon-bedrock`
- API: `bedrock-converse-stream`
- Auth: AWS credentials (env vars, shared config, or instance role)
- Region: `AWS_REGION` or `AWS_DEFAULT_REGION` (default: `us-east-1`)
+## Getting started
+
+Choose your preferred auth method and follow the setup steps.
+
+<Tabs>
+  <Tab title="Access keys / env vars">
+    **Best for:** developer machines, CI, or hosts where you manage AWS credentials directly.
+
+    <Steps>
+      <Step title="Set AWS credentials on the gateway host">
+        ```bash
+        export AWS_ACCESS_KEY_ID="AKIA..."
+        export AWS_SECRET_ACCESS_KEY="..."
+        export AWS_REGION="us-east-1"
+        # Optional:
+        export AWS_SESSION_TOKEN="..."
+        export AWS_PROFILE="your-profile"
+        # Optional (Bedrock API key/bearer token):
+        export AWS_BEARER_TOKEN_BEDROCK="..."
+        ```
+      </Step>
+      <Step title="Add a Bedrock provider and model to your config">
+        No `apiKey` is required. Configure the provider with `auth: "aws-sdk"`:
+
+        ```json5
+        {
+          models: {
+            providers: {
+              "amazon-bedrock": {
+                baseUrl: "https://bedrock-runtime.us-east-1.amazonaws.com",
+                api: "bedrock-converse-stream",
+                auth: "aws-sdk",
+                models: [
+                  {
+                    id: "us.anthropic.claude-opus-4-6-v1:0",
+                    name: "Claude Opus 4.6 (Bedrock)",
+                    reasoning: true,
+                    input: ["text", "image"],
+                    cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 },
+                    contextWindow: 200000,
+                    maxTokens: 8192,
+                  },
+                ],
+              },
+            },
+          },
+          agents: {
+            defaults: {
+              model: { primary: "amazon-bedrock/us.anthropic.claude-opus-4-6-v1:0" },
+            },
+          },
+        }
+        ```
+      </Step>
+      <Step title="Verify models are available">
+        ```bash
+        openclaw models list
+        ```
+      </Step>
+    </Steps>
+
+    <Tip>
+    With env-marker auth (`AWS_ACCESS_KEY_ID`, `AWS_PROFILE`, or `AWS_BEARER_TOKEN_BEDROCK`), OpenClaw auto-enables the implicit Bedrock provider for model discovery without extra config.
+    </Tip>
+
+  </Tab>
+
+  <Tab title="EC2 instance roles (IMDS)">
+    **Best for:** EC2 instances with an IAM role attached, using the instance metadata service for authentication.
+
+    <Steps>
+      <Step title="Enable discovery explicitly">
+        When using IMDS, OpenClaw cannot detect AWS auth from env markers alone, so you must opt in:
+
+        ```bash
+        openclaw config set plugins.entries.amazon-bedrock.config.discovery.enabled true
+        openclaw config set plugins.entries.amazon-bedrock.config.discovery.region us-east-1
+        ```
+      </Step>
+      <Step title="Optionally add an env marker for auto mode">
+        If you also want the env-marker auto-detection path to work (for example, for `openclaw status` surfaces):
+
+        ```bash
+        export AWS_PROFILE=default
+        export AWS_REGION=us-east-1
+        ```
+
+        You do **not** need a fake API key.
+      </Step>
+      <Step title="Verify models are discovered">
+        ```bash
+        openclaw models list
+        ```
+      </Step>
+    </Steps>
+
+    <Warning>
+    The IAM role attached to your EC2 instance must have the following permissions:
+
+    - `bedrock:InvokeModel`
+    - `bedrock:InvokeModelWithResponseStream`
+    - `bedrock:ListFoundationModels` (for automatic discovery)
+    - `bedrock:ListInferenceProfiles` (for inference profile discovery)
+
+    Or attach the managed policy `AmazonBedrockFullAccess`.
+    </Warning>
+
+    <Note>
+    You only need `AWS_PROFILE=default` if you specifically want an env marker for auto mode or status surfaces. The actual Bedrock runtime auth path uses the AWS SDK default chain, so IMDS instance-role auth works even without env markers.
+    </Note>
+
+  </Tab>
+</Tabs>

 ## Automatic model discovery

@@ -38,127 +152,52 @@ How the implicit provider is enabled:
  shared config, SSO, and IMDS instance-role auth can work even when discovery
  needed `enabled: true` to opt in.

-Config options live under `plugins.entries.amazon-bedrock.config.discovery`:
+<Note>
+For explicit `models.providers["amazon-bedrock"]` entries, OpenClaw can still resolve Bedrock env-marker auth early from AWS env markers such as `AWS_BEARER_TOKEN_BEDROCK` without forcing full runtime auth loading. The actual model-call auth path still uses the AWS SDK default chain.
+</Note>

-```json5
-{
-  plugins: {
-    entries: {
-      "amazon-bedrock": {
-        config: {
-          discovery: {
-            enabled: true,
-            region: "us-east-1",
-            providerFilter: ["anthropic", "amazon"],
-            refreshInterval: 3600,
-            defaultContextWindow: 32000,
-            defaultMaxTokens: 4096,
+<AccordionGroup>
+  <Accordion title="Discovery config options">
+    Config options live under `plugins.entries.amazon-bedrock.config.discovery`:
+
+    ```json5
+    {
+      plugins: {
+        entries: {
+          "amazon-bedrock": {
+            config: {
+              discovery: {
+                enabled: true,
+                region: "us-east-1",
+                providerFilter: ["anthropic", "amazon"],
+                refreshInterval: 3600,
+                defaultContextWindow: 32000,
+                defaultMaxTokens: 4096,
+              },
+            },
          },
        },
      },
-    },
-  },
-}
-```
+    }
+    ```

-Notes:
+    | Option | Default | Description |
+    | ------ | ------- | ----------- |
+    | `enabled` | auto | In auto mode, OpenClaw only enables the implicit Bedrock provider when it sees a supported AWS env marker. Set `true` to force discovery. |
+    | `region` | `AWS_REGION` / `AWS_DEFAULT_REGION` / `us-east-1` | AWS region used for discovery API calls. |
+    | `providerFilter` | (all) | Matches Bedrock provider names (for example `anthropic`, `amazon`). |
+    | `refreshInterval` | `3600` | Cache duration in seconds. Set to `0` to disable caching. |
+    | `defaultContextWindow` | `32000` | Context window used for discovered models (override if you know your model limits). |
+    | `defaultMaxTokens` | `4096` | Max output tokens used for discovered models (override if you know your model limits). |

- `enabled` defaults to auto mode. In auto mode, OpenClaw only enables the
-  implicit Bedrock provider when it sees a supported AWS env marker.
- `region` defaults to `AWS_REGION` or `AWS_DEFAULT_REGION`, then `us-east-1`.
- `providerFilter` matches Bedrock provider names (for example `anthropic`).
- `refreshInterval` is seconds; set to `0` to disable caching.
- `defaultContextWindow` (default: `32000`) and `defaultMaxTokens` (default: `4096`)
-  are used for discovered models (override if you know your model limits).
- For explicit `models.providers["amazon-bedrock"]` entries, OpenClaw can still
-  resolve Bedrock env-marker auth early from AWS env markers such as
-  `AWS_BEARER_TOKEN_BEDROCK` without forcing full runtime auth loading. The
-  actual model-call auth path still uses the AWS SDK default chain.
-
-## Onboarding
-
-1. Ensure AWS credentials are available on the **gateway host**:
-
-```bash
-export AWS_ACCESS_KEY_ID="AKIA..."
-export AWS_SECRET_ACCESS_KEY="..."
-export AWS_REGION="us-east-1"
-# Optional:
-export AWS_SESSION_TOKEN="..."
-export AWS_PROFILE="your-profile"
-# Optional (Bedrock API key/bearer token):
-export AWS_BEARER_TOKEN_BEDROCK="..."
-```
-
-2. Add a Bedrock provider and model to your config (no `apiKey` required):
-
-```json5
-{
-  models: {
-    providers: {
-      "amazon-bedrock": {
-        baseUrl: "https://bedrock-runtime.us-east-1.amazonaws.com",
-        api: "bedrock-converse-stream",
-        auth: "aws-sdk",
-        models: [
-          {
-            id: "us.anthropic.claude-opus-4-6-v1:0",
-            name: "Claude Opus 4.6 (Bedrock)",
-            reasoning: true,
-            input: ["text", "image"],
-            cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 },
-            contextWindow: 200000,
-            maxTokens: 8192,
-          },
-        ],
-      },
-    },
-  },
-  agents: {
-    defaults: {
-      model: { primary: "amazon-bedrock/us.anthropic.claude-opus-4-6-v1:0" },
-    },
-  },
-}
-```
-
-## EC2 Instance Roles
-
-When running OpenClaw on an EC2 instance with an IAM role attached, the AWS SDK
-can use the instance metadata service (IMDS) for authentication. For Bedrock
-model discovery, OpenClaw only auto-enables the implicit provider from AWS env
-markers unless you explicitly set
-`plugins.entries.amazon-bedrock.config.discovery.enabled: true`.
-
-Recommended setup for IMDS-backed hosts:
-
- Set `plugins.entries.amazon-bedrock.config.discovery.enabled` to `true`.
- Set `plugins.entries.amazon-bedrock.config.discovery.region` (or export `AWS_REGION`).
- You do **not** need a fake API key.
- You only need `AWS_PROFILE=default` if you specifically want an env marker
-  for auto mode or status surfaces.
-
-```bash
-# Recommended: explicit discovery enable + region
-openclaw config set plugins.entries.amazon-bedrock.config.discovery.enabled true
-openclaw config set plugins.entries.amazon-bedrock.config.discovery.region us-east-1
-
-# Optional: add an env marker if you want auto mode without explicit enable
-export AWS_PROFILE=default
-export AWS_REGION=us-east-1
-```
-
-**Required IAM permissions** for the EC2 instance role:
-
- `bedrock:InvokeModel`
- `bedrock:InvokeModelWithResponseStream`
- `bedrock:ListFoundationModels` (for automatic discovery)
- `bedrock:ListInferenceProfiles` (for inference profile discovery)
-
-Or attach the managed policy `AmazonBedrockFullAccess`.
+  </Accordion>
+</AccordionGroup>

 ## Quick setup (AWS path)

+This walkthrough creates an IAM role, attaches Bedrock permissions, associates
+the instance profile, and enables OpenClaw discovery on the EC2 host.
+
 ```bash
 # 1. Create IAM role and instance profile
 aws iam create-role --role-name EC2-Bedrock-Access \
@@ -197,106 +236,127 @@ source ~/.bashrc
 openclaw models list
 ```

-## Inference profiles
+## Advanced configuration

-OpenClaw discovers **regional and global inference profiles** alongside
-foundation models. When a profile maps to a known foundation model, the
-profile inherits that model's capabilities (context window, max tokens,
-reasoning, vision) and the correct Bedrock request region is injected
-automatically. This means cross-region Claude profiles work without manual
-provider overrides.
+<AccordionGroup>
+  <Accordion title="Inference profiles">
+    OpenClaw discovers **regional and global inference profiles** alongside
+    foundation models. When a profile maps to a known foundation model, the
+    profile inherits that model's capabilities (context window, max tokens,
+    reasoning, vision) and the correct Bedrock request region is injected
+    automatically. This means cross-region Claude profiles work without manual
+    provider overrides.

-Inference profile IDs look like `us.anthropic.claude-opus-4-6-v1:0` (regional)
-or `anthropic.claude-opus-4-6-v1:0` (global). If the backing model is already
-in the discovery results, the profile inherits its full capability set;
-otherwise safe defaults apply.
+    Inference profile IDs look like `us.anthropic.claude-opus-4-6-v1:0` (regional)
+    or `anthropic.claude-opus-4-6-v1:0` (global). If the backing model is already
+    in the discovery results, the profile inherits its full capability set;
+    otherwise safe defaults apply.

-No extra configuration is needed. As long as discovery is enabled and the IAM
-principal has `bedrock:ListInferenceProfiles`, profiles appear alongside
-foundation models in `openclaw models list`.
+    No extra configuration is needed. As long as discovery is enabled and the IAM
+    principal has `bedrock:ListInferenceProfiles`, profiles appear alongside
+    foundation models in `openclaw models list`.

-## Notes
+  </Accordion>

- Bedrock requires **model access** enabled in your AWS account/region.
- Automatic discovery needs the `bedrock:ListFoundationModels` and
-  `bedrock:ListInferenceProfiles` permissions.
- If you rely on auto mode, set one of the supported AWS auth env markers on the
-  gateway host. If you prefer IMDS/shared-config auth without env markers, set
-  `plugins.entries.amazon-bedrock.config.discovery.enabled: true`.
- OpenClaw surfaces the credential source in this order: `AWS_BEARER_TOKEN_BEDROCK`,
-  then `AWS_ACCESS_KEY_ID` + `AWS_SECRET_ACCESS_KEY`, then `AWS_PROFILE`, then the
-  default AWS SDK chain.
- Reasoning support depends on the model; check the Bedrock model card for
-  current capabilities.
- If you prefer a managed key flow, you can also place an OpenAI‑compatible
-  proxy in front of Bedrock and configure it as an OpenAI provider instead.
+  <Accordion title="Guardrails">
+    You can apply [Amazon Bedrock Guardrails](https://docs.aws.amazon.com/bedrock/latest/userguide/guardrails.html)
+    to all Bedrock model invocations by adding a `guardrail` object to the
+    `amazon-bedrock` plugin config. Guardrails let you enforce content filtering,
+    topic denial, word filters, sensitive information filters, and contextual
+    grounding checks.

-## Guardrails
-
-You can apply [Amazon Bedrock Guardrails](https://docs.aws.amazon.com/bedrock/latest/userguide/guardrails.html)
-to all Bedrock model invocations by adding a `guardrail` object to the
-`amazon-bedrock` plugin config. Guardrails let you enforce content filtering,
-topic denial, word filters, sensitive information filters, and contextual
-grounding checks.
-
-```json5
-{
-  plugins: {
-    entries: {
-      "amazon-bedrock": {
-        config: {
-          guardrail: {
-            guardrailIdentifier: "abc123", // guardrail ID or full ARN
-            guardrailVersion: "1", // version number or "DRAFT"
-            streamProcessingMode: "sync", // optional: "sync" or "async"
-            trace: "enabled", // optional: "enabled", "disabled", or "enabled_full"
+    ```json5
+    {
+      plugins: {
+        entries: {
+          "amazon-bedrock": {
+            config: {
+              guardrail: {
+                guardrailIdentifier: "abc123", // guardrail ID or full ARN
+                guardrailVersion: "1", // version number or "DRAFT"
+                streamProcessingMode: "sync", // optional: "sync" or "async"
+                trace: "enabled", // optional: "enabled", "disabled", or "enabled_full"
+              },
+            },
          },
        },
      },
-    },
-  },
-}
-```
+    }
+    ```

- `guardrailIdentifier` (required) accepts a guardrail ID (e.g. `abc123`) or a
-  full ARN (e.g. `arn:aws:bedrock:us-east-1:123456789012:guardrail/abc123`).
- `guardrailVersion` (required) specifies which published version to use, or
-  `"DRAFT"` for the working draft.
- `streamProcessingMode` (optional) controls whether guardrail evaluation runs
-  synchronously (`"sync"`) or asynchronously (`"async"`) during streaming. If
-  omitted, Bedrock uses its default behavior.
- `trace` (optional) enables guardrail trace output in the API response. Set to
-  `"enabled"` or `"enabled_full"` for debugging; omit or set `"disabled"` for
-  production.
+    | Option | Required | Description |
+    | ------ | -------- | ----------- |
+    | `guardrailIdentifier` | Yes | Guardrail ID (e.g. `abc123`) or full ARN (e.g. `arn:aws:bedrock:us-east-1:123456789012:guardrail/abc123`). |
+    | `guardrailVersion` | Yes | Published version number, or `"DRAFT"` for the working draft. |
+    | `streamProcessingMode` | No | `"sync"` or `"async"` for guardrail evaluation during streaming. If omitted, Bedrock uses its default. |
+    | `trace` | No | `"enabled"` or `"enabled_full"` for debugging; omit or set `"disabled"` for production. |

-The IAM principal used by the gateway must have the `bedrock:ApplyGuardrail`
-permission in addition to the standard invoke permissions.
+    <Warning>
+    The IAM principal used by the gateway must have the `bedrock:ApplyGuardrail` permission in addition to the standard invoke permissions.
+    </Warning>

-## Embeddings for memory search
+  </Accordion>

-Bedrock can also serve as the embedding provider for
-[memory search](/concepts/memory-search). This is configured separately from the
-inference provider — set `agents.defaults.memorySearch.provider` to `"bedrock"`:
+  <Accordion title="Embeddings for memory search">
+    Bedrock can also serve as the embedding provider for
+    [memory search](/concepts/memory-search). This is configured separately from the
+    inference provider -- set `agents.defaults.memorySearch.provider` to `"bedrock"`:

-```json5
-{
-  agents: {
-    defaults: {
-      memorySearch: {
-        provider: "bedrock",
-        model: "amazon.titan-embed-text-v2:0", // default
+    ```json5
+    {
+      agents: {
+        defaults: {
+          memorySearch: {
+            provider: "bedrock",
+            model: "amazon.titan-embed-text-v2:0", // default
+          },
+        },
      },
-    },
-  },
-}
-```
+    }
+    ```

-Bedrock embeddings use the same AWS SDK credential chain as inference (instance
-roles, SSO, access keys, shared config, and web identity). No API key is
-needed. When `provider` is `"auto"`, Bedrock is auto-detected if that
-credential chain resolves successfully.
+    Bedrock embeddings use the same AWS SDK credential chain as inference (instance
+    roles, SSO, access keys, shared config, and web identity). No API key is
+    needed. When `provider` is `"auto"`, Bedrock is auto-detected if that
+    credential chain resolves successfully.

-Supported embedding models include Amazon Titan Embed (v1, v2), Amazon Nova
-Embed, Cohere Embed (v3, v4), and TwelveLabs Marengo. See
-[Memory configuration reference — Bedrock](/reference/memory-config#bedrock-embedding-config)
-for the full model list and dimension options.
+    Supported embedding models include Amazon Titan Embed (v1, v2), Amazon Nova
+    Embed, Cohere Embed (v3, v4), and TwelveLabs Marengo. See
+    [Memory configuration reference -- Bedrock](/reference/memory-config#bedrock-embedding-config)
+    for the full model list and dimension options.
+
+  </Accordion>
+
+  <Accordion title="Notes and caveats">
+    - Bedrock requires **model access** enabled in your AWS account/region.
+    - Automatic discovery needs the `bedrock:ListFoundationModels` and
+      `bedrock:ListInferenceProfiles` permissions.
+    - If you rely on auto mode, set one of the supported AWS auth env markers on the
+      gateway host. If you prefer IMDS/shared-config auth without env markers, set
+      `plugins.entries.amazon-bedrock.config.discovery.enabled: true`.
+    - OpenClaw surfaces the credential source in this order: `AWS_BEARER_TOKEN_BEDROCK`,
+      then `AWS_ACCESS_KEY_ID` + `AWS_SECRET_ACCESS_KEY`, then `AWS_PROFILE`, then the
+      default AWS SDK chain.
+    - Reasoning support depends on the model; check the Bedrock model card for
+      current capabilities.
+    - If you prefer a managed key flow, you can also place an OpenAI-compatible
+      proxy in front of Bedrock and configure it as an OpenAI provider instead.
+  </Accordion>
+</AccordionGroup>
+
+## Related
+
+<CardGroup cols={2}>
+  <Card title="Model selection" href="/concepts/model-providers" icon="layers">
+    Choosing providers, model refs, and failover behavior.
+  </Card>
+  <Card title="Memory search" href="/concepts/memory-search" icon="magnifying-glass">
+    Bedrock embeddings for memory search configuration.
+  </Card>
+  <Card title="Memory config reference" href="/reference/memory-config#bedrock-embedding-config" icon="database">
+    Full Bedrock embedding model list and dimension options.
+  </Card>
+  <Card title="Troubleshooting" href="/help/troubleshooting" icon="wrench">
+    General troubleshooting and FAQ.
+  </Card>
+</CardGroup>
--- a/docs/providers/chutes.md
+++ b/docs/providers/chutes.md
@@ -13,44 +13,58 @@ read_when:
 OpenAI-compatible API. OpenClaw supports both browser OAuth and direct API-key
 auth for the bundled `chutes` provider.

- Provider: `chutes`
- API: OpenAI-compatible
- Base URL: `https://llm.chutes.ai/v1`
- Auth:
-  - OAuth via `openclaw onboard --auth-choice chutes`
-  - API key via `openclaw onboard --auth-choice chutes-api-key`
-  - Runtime env vars: `CHUTES_API_KEY`, `CHUTES_OAUTH_TOKEN`
+| Property | Value                        |
+| -------- | ---------------------------- |
+| Provider | `chutes`                     |
+| API      | OpenAI-compatible            |
+| Base URL | `https://llm.chutes.ai/v1`   |
+| Auth     | OAuth or API key (see below) |

-## Quick start
+## Getting started

-### OAuth
+<Tabs>
+  <Tab title="OAuth">
+    <Steps>
+      <Step title="Run the OAuth onboarding flow">
+        ```bash
+        openclaw onboard --auth-choice chutes
+        ```
+        OpenClaw launches the browser flow locally, or shows a URL + redirect-paste
+        flow on remote/headless hosts. OAuth tokens auto-refresh through OpenClaw auth
+        profiles.
+      </Step>
+      <Step title="Verify the default model">
+        After onboarding, the default model is set to
+        `chutes/zai-org/GLM-4.7-TEE` and the bundled Chutes catalog is
+        registered.
+      </Step>
+    </Steps>
+  </Tab>
+  <Tab title="API key">
+    <Steps>
+      <Step title="Get an API key">
+        Create a key at
+        [chutes.ai/settings/api-keys](https://chutes.ai/settings/api-keys).
+      </Step>
+      <Step title="Run the API key onboarding flow">
+        ```bash
+        openclaw onboard --auth-choice chutes-api-key
+        ```
+      </Step>
+      <Step title="Verify the default model">
+        After onboarding, the default model is set to
+        `chutes/zai-org/GLM-4.7-TEE` and the bundled Chutes catalog is
+        registered.
+      </Step>
+    </Steps>
+  </Tab>
+</Tabs>

-```bash
-openclaw onboard --auth-choice chutes
-```
-
-OpenClaw launches the browser flow locally, or shows a URL + redirect-paste
-flow on remote/headless hosts. OAuth tokens auto-refresh through OpenClaw auth
-profiles.
-
-Optional OAuth overrides:
-
- `CHUTES_CLIENT_ID`
- `CHUTES_CLIENT_SECRET`
- `CHUTES_OAUTH_REDIRECT_URI`
- `CHUTES_OAUTH_SCOPES`
-
-### API key
-
-```bash
-openclaw onboard --auth-choice chutes-api-key
-```
-
-Get your key at
-[chutes.ai/settings/api-keys](https://chutes.ai/settings/api-keys).
-
-Both auth paths register the bundled Chutes catalog and set the default model
-to `chutes/zai-org/GLM-4.7-TEE`.
+<Note>
+Both auth paths register the bundled Chutes catalog and set the default model to
+`chutes/zai-org/GLM-4.7-TEE`. Runtime environment variables: `CHUTES_API_KEY`,
+`CHUTES_OAUTH_TOKEN`.
+</Note>

 ## Discovery behavior

@@ -60,25 +74,28 @@ back to a bundled static catalog so onboarding and startup still work.

 ## Default aliases

-OpenClaw also registers three convenience aliases for the bundled Chutes
-catalog:
+OpenClaw registers three convenience aliases for the bundled Chutes catalog:

- `chutes-fast` -> `chutes/zai-org/GLM-4.7-FP8`
- `chutes-pro` -> `chutes/deepseek-ai/DeepSeek-V3.2-TEE`
- `chutes-vision` -> `chutes/chutesai/Mistral-Small-3.2-24B-Instruct-2506`
+| Alias           | Target model                                          |
+| --------------- | ----------------------------------------------------- |
+| `chutes-fast`   | `chutes/zai-org/GLM-4.7-FP8`                          |
+| `chutes-pro`    | `chutes/deepseek-ai/DeepSeek-V3.2-TEE`                |
+| `chutes-vision` | `chutes/chutesai/Mistral-Small-3.2-24B-Instruct-2506` |

 ## Built-in starter catalog

-The bundled fallback catalog includes current Chutes refs such as:
+The bundled fallback catalog includes current Chutes refs:

- `chutes/zai-org/GLM-4.7-TEE`
- `chutes/zai-org/GLM-5-TEE`
- `chutes/deepseek-ai/DeepSeek-V3.2-TEE`
- `chutes/deepseek-ai/DeepSeek-R1-0528-TEE`
- `chutes/moonshotai/Kimi-K2.5-TEE`
- `chutes/chutesai/Mistral-Small-3.2-24B-Instruct-2506`
- `chutes/Qwen/Qwen3-Coder-Next-TEE`
- `chutes/openai/gpt-oss-120b-TEE`
+| Model ref                                             |
+| ----------------------------------------------------- |
+| `chutes/zai-org/GLM-4.7-TEE`                          |
+| `chutes/zai-org/GLM-5-TEE`                            |
+| `chutes/deepseek-ai/DeepSeek-V3.2-TEE`                |
+| `chutes/deepseek-ai/DeepSeek-R1-0528-TEE`             |
+| `chutes/moonshotai/Kimi-K2.5-TEE`                     |
+| `chutes/chutesai/Mistral-Small-3.2-24B-Instruct-2506` |
+| `chutes/Qwen/Qwen3-Coder-Next-TEE`                    |
+| `chutes/openai/gpt-oss-120b-TEE`                      |

 ## Config example

@@ -96,8 +113,42 @@ The bundled fallback catalog includes current Chutes refs such as:
 }
 ```

-## Notes
+<AccordionGroup>
+  <Accordion title="OAuth overrides">
+    You can customize the OAuth flow with optional environment variables:

- OAuth help and redirect-app requirements: [Chutes OAuth docs](https://chutes.ai/docs/sign-in-with-chutes/overview)
- API-key and OAuth discovery both use the same `chutes` provider id.
- Chutes models are registered as `chutes/<model-id>`.
+    | Variable | Purpose |
+    | -------- | ------- |
+    | `CHUTES_CLIENT_ID` | Custom OAuth client ID |
+    | `CHUTES_CLIENT_SECRET` | Custom OAuth client secret |
+    | `CHUTES_OAUTH_REDIRECT_URI` | Custom redirect URI |
+    | `CHUTES_OAUTH_SCOPES` | Custom OAuth scopes |
+
+    See the [Chutes OAuth docs](https://chutes.ai/docs/sign-in-with-chutes/overview)
+    for redirect-app requirements and help.
+
+  </Accordion>
+
+  <Accordion title="Notes">
+    - API-key and OAuth discovery both use the same `chutes` provider id.
+    - Chutes models are registered as `chutes/<model-id>`.
+    - If discovery fails at startup, the bundled static catalog is used automatically.
+  </Accordion>
+</AccordionGroup>
+
+## Related
+
+<CardGroup cols={2}>
+  <Card title="Model providers" href="/concepts/model-providers" icon="layers">
+    Provider rules, model refs, and failover behavior.
+  </Card>
+  <Card title="Configuration reference" href="/gateway/configuration-reference" icon="gear">
+    Full config schema including provider settings.
+  </Card>
+  <Card title="Chutes" href="https://chutes.ai" icon="arrow-up-right-from-square">
+    Chutes dashboard and API docs.
+  </Card>
+  <Card title="Chutes API keys" href="https://chutes.ai/settings/api-keys" icon="key">
+    Create and manage Chutes API keys.
+  </Card>
+</CardGroup>
--- a/docs/providers/claude-max-api-proxy.md
+++ b/docs/providers/claude-max-api-proxy.md
@@ -17,7 +17,7 @@ usage outside Claude Code in the past. You must decide for yourself whether to u
 it and verify Anthropic's current terms before relying on it.
 </Warning>

-## Why Use This?
+## Why use this?

 | Approach                | Cost                                                | Best For                                   |
 | ----------------------- | --------------------------------------------------- | ------------------------------------------ |
@@ -26,7 +26,7 @@ it and verify Anthropic's current terms before relying on it.

 If you have a Claude Max subscription and want to use it with OpenAI-compatible tools, this proxy may reduce cost for some workflows. API keys remain the clearer policy path for production use.

-## How It Works
+## How it works

 ```
 Your App → claude-max-api-proxy → Claude Code CLI → Anthropic (via subscription)
@@ -39,71 +39,65 @@ The proxy:
 2. Converts them to Claude Code CLI commands
 3. Returns responses in OpenAI format (streaming supported)

-## Installation
+## Getting started

-```bash
-# Requires Node.js 20+ and Claude Code CLI
-npm install -g claude-max-api-proxy
+<Steps>
+  <Step title="Install the proxy">
+    Requires Node.js 20+ and Claude Code CLI.

-# Verify Claude CLI is authenticated
-claude --version
-```
+    ```bash
+    npm install -g claude-max-api-proxy

-## Usage
+    # Verify Claude CLI is authenticated
+    claude --version
+    ```

-### Start the server
+  </Step>
+  <Step title="Start the server">
+    ```bash
+    claude-max-api
+    # Server runs at http://localhost:3456
+    ```
+  </Step>
+  <Step title="Test the proxy">
+    ```bash
+    # Health check
+    curl http://localhost:3456/health

-```bash
-claude-max-api
-# Server runs at http://localhost:3456
-```
+    # List models
+    curl http://localhost:3456/v1/models

-### Test it
+    # Chat completion
+    curl http://localhost:3456/v1/chat/completions \
+      -H "Content-Type: application/json" \
+      -d '{
+        "model": "claude-opus-4",
+        "messages": [{"role": "user", "content": "Hello!"}]
+      }'
+    ```

-```bash
-# Health check
-curl http://localhost:3456/health
+  </Step>
+  <Step title="Configure OpenClaw">
+    Point OpenClaw at the proxy as a custom OpenAI-compatible endpoint:

-# List models
-curl http://localhost:3456/v1/models
+    ```json5
+    {
+      env: {
+        OPENAI_API_KEY: "not-needed",
+        OPENAI_BASE_URL: "http://localhost:3456/v1",
+      },
+      agents: {
+        defaults: {
+          model: { primary: "openai/claude-opus-4" },
+        },
+      },
+    }
+    ```

-# Chat completion
-curl http://localhost:3456/v1/chat/completions \
-  -H "Content-Type: application/json" \
-  -d '{
-    "model": "claude-opus-4",
-    "messages": [{"role": "user", "content": "Hello!"}]
-  }'
-```
+  </Step>
+</Steps>

-### With OpenClaw
-
-You can point OpenClaw at the proxy as a custom OpenAI-compatible endpoint:
-
-```json5
-{
-  env: {
-    OPENAI_API_KEY: "not-needed",
-    OPENAI_BASE_URL: "http://localhost:3456/v1",
-  },
-  agents: {
-    defaults: {
-      model: { primary: "openai/claude-opus-4" },
-    },
-  },
-}
-```
-
-This path uses the same proxy-style OpenAI-compatible route as other custom
-`/v1` backends:
-
- native OpenAI-only request shaping does not apply
- no `service_tier`, no Responses `store`, no prompt-cache hints, and no
-  OpenAI reasoning-compat payload shaping
- hidden OpenClaw attribution headers (`originator`, `version`, `User-Agent`)
-  are not injected on the proxy URL
-
-## Available Models
+## Available models

 | Model ID          | Maps To         |
 | ----------------- | --------------- |
@@ -111,38 +105,55 @@ This path uses the same proxy-style OpenAI-compatible route as other custom
 | `claude-sonnet-4` | Claude Sonnet 4 |
 | `claude-haiku-4`  | Claude Haiku 4  |

-## Auto-Start on macOS
+## Advanced

-Create a LaunchAgent to run the proxy automatically:
+<AccordionGroup>
+  <Accordion title="Proxy-style OpenAI-compatible notes">
+    This path uses the same proxy-style OpenAI-compatible route as other custom
+    `/v1` backends:

-```bash
-cat > ~/Library/LaunchAgents/com.claude-max-api.plist << 'EOF'
-<?xml version="1.0" encoding="UTF-8"?>
-<!DOCTYPE plist PUBLIC "-//Apple//DTD PLIST 1.0//EN" "http://www.apple.com/DTDs/PropertyList-1.0.dtd">
-<plist version="1.0">
-<dict>
-  <key>Label</key>
-  <string>com.claude-max-api</string>
-  <key>RunAtLoad</key>
-  <true/>
-  <key>KeepAlive</key>
-  <true/>
-  <key>ProgramArguments</key>
-  <array>
-    <string>/usr/local/bin/node</string>
-    <string>/usr/local/lib/node_modules/claude-max-api-proxy/dist/server/standalone.js</string>
-  </array>
-  <key>EnvironmentVariables</key>
-  <dict>
-    <key>PATH</key>
-    <string>/usr/local/bin:/opt/homebrew/bin:~/.local/bin:/usr/bin:/bin</string>
-  </dict>
-</dict>
-</plist>
-EOF
+    - Native OpenAI-only request shaping does not apply
+    - No `service_tier`, no Responses `store`, no prompt-cache hints, and no
+      OpenAI reasoning-compat payload shaping
+    - Hidden OpenClaw attribution headers (`originator`, `version`, `User-Agent`)
+      are not injected on the proxy URL

-launchctl bootstrap gui/$(id -u) ~/Library/LaunchAgents/com.claude-max-api.plist
-```
+  </Accordion>
+
+  <Accordion title="Auto-start on macOS with LaunchAgent">
+    Create a LaunchAgent to run the proxy automatically:
+
+    ```bash
+    cat > ~/Library/LaunchAgents/com.claude-max-api.plist << 'EOF'
+    <?xml version="1.0" encoding="UTF-8"?>
+    <!DOCTYPE plist PUBLIC "-//Apple//DTD PLIST 1.0//EN" "http://www.apple.com/DTDs/PropertyList-1.0.dtd">
+    <plist version="1.0">
+    <dict>
+      <key>Label</key>
+      <string>com.claude-max-api</string>
+      <key>RunAtLoad</key>
+      <true/>
+      <key>KeepAlive</key>
+      <true/>
+      <key>ProgramArguments</key>
+      <array>
+        <string>/usr/local/bin/node</string>
+        <string>/usr/local/lib/node_modules/claude-max-api-proxy/dist/server/standalone.js</string>
+      </array>
+      <key>EnvironmentVariables</key>
+      <dict>
+        <key>PATH</key>
+        <string>/usr/local/bin:/opt/homebrew/bin:~/.local/bin:/usr/bin:/bin</string>
+      </dict>
+    </dict>
+    </plist>
+    EOF
+
+    launchctl bootstrap gui/$(id -u) ~/Library/LaunchAgents/com.claude-max-api.plist
+    ```
+
+  </Accordion>
+</AccordionGroup>

 ## Links

@@ -157,7 +168,23 @@ launchctl bootstrap gui/$(id -u) ~/Library/LaunchAgents/com.claude-max-api.plist
 - The proxy runs locally and does not send data to any third-party servers
 - Streaming responses are fully supported

-## See Also
+<Note>
+For native Anthropic integration with Claude CLI or API keys, see [Anthropic provider](/providers/anthropic). For OpenAI/Codex subscriptions, see [OpenAI provider](/providers/openai).
+</Note>

- [Anthropic provider](/providers/anthropic) - Native OpenClaw integration with Claude CLI or API keys
- [OpenAI provider](/providers/openai) - For OpenAI/Codex subscriptions
+## Related
+
+<CardGroup cols={2}>
+  <Card title="Anthropic provider" href="/providers/anthropic" icon="bolt">
+    Native OpenClaw integration with Claude CLI or API keys.
+  </Card>
+  <Card title="OpenAI provider" href="/providers/openai" icon="robot">
+    For OpenAI/Codex subscriptions.
+  </Card>
+  <Card title="Model providers" href="/concepts/model-providers" icon="layers">
+    Overview of all providers, model refs, and failover behavior.
+  </Card>
+  <Card title="Configuration" href="/gateway/configuration" icon="gear">
+    Full config reference.
+  </Card>
+</CardGroup>
--- a/docs/providers/cloudflare-ai-gateway.md
+++ b/docs/providers/cloudflare-ai-gateway.md
@@ -10,35 +10,55 @@ read_when:

 Cloudflare AI Gateway sits in front of provider APIs and lets you add analytics, caching, and controls. For Anthropic, OpenClaw uses the Anthropic Messages API through your Gateway endpoint.

- Provider: `cloudflare-ai-gateway`
- Base URL: `https://gateway.ai.cloudflare.com/v1/<account_id>/<gateway_id>/anthropic`
- Default model: `cloudflare-ai-gateway/claude-sonnet-4-5`
- API key: `CLOUDFLARE_AI_GATEWAY_API_KEY` (your provider API key for requests through the Gateway)
+| Property      | Value                                                                                    |
+| ------------- | ---------------------------------------------------------------------------------------- |
+| Provider      | `cloudflare-ai-gateway`                                                                  |
+| Base URL      | `https://gateway.ai.cloudflare.com/v1/<account_id>/<gateway_id>/anthropic`               |
+| Default model | `cloudflare-ai-gateway/claude-sonnet-4-5`                                                |
+| API key       | `CLOUDFLARE_AI_GATEWAY_API_KEY` (your provider API key for requests through the Gateway) |

-For Anthropic models, use your Anthropic API key.
+<Note>
+For Anthropic models routed through Cloudflare AI Gateway, use your **Anthropic API key** as the provider key.
+</Note>

-## Quick start
+## Getting started

-1. Set the provider API key and Gateway details:
+<Steps>
+  <Step title="Set the provider API key and Gateway details">
+    Run onboarding and choose the Cloudflare AI Gateway auth option:

-```bash
-openclaw onboard --auth-choice cloudflare-ai-gateway-api-key
-```
+    ```bash
+    openclaw onboard --auth-choice cloudflare-ai-gateway-api-key
+    ```

-2. Set a default model:
+    This prompts for your account ID, gateway ID, and API key.

-```json5
-{
-  agents: {
-    defaults: {
-      model: { primary: "cloudflare-ai-gateway/claude-sonnet-4-5" },
-    },
-  },
-}
-```
+  </Step>
+  <Step title="Set a default model">
+    Add the model to your OpenClaw config:
+
+    ```json5
+    {
+      agents: {
+        defaults: {
+          model: { primary: "cloudflare-ai-gateway/claude-sonnet-4-5" },
+        },
+      },
+    }
+    ```
+
+  </Step>
+  <Step title="Verify the model is available">
+    ```bash
+    openclaw models list --provider cloudflare-ai-gateway
+    ```
+  </Step>
+</Steps>

 ## Non-interactive example

+For scripted or CI setups, pass all values on the command line:
+
 ```bash
 openclaw onboard --non-interactive \
  --mode local \
@@ -48,24 +68,49 @@ openclaw onboard --non-interactive \
  --cloudflare-ai-gateway-api-key "$CLOUDFLARE_AI_GATEWAY_API_KEY"
 ```

-## Authenticated gateways
+## Advanced configuration

-If you enabled Gateway authentication in Cloudflare, add the `cf-aig-authorization` header (this is in addition to your provider API key).
+<AccordionGroup>
+  <Accordion title="Authenticated gateways">
+    If you enabled Gateway authentication in Cloudflare, add the `cf-aig-authorization` header. This is **in addition to** your provider API key.

-```json5
-{
-  models: {
-    providers: {
-      "cloudflare-ai-gateway": {
-        headers: {
-          "cf-aig-authorization": "Bearer <cloudflare-ai-gateway-token>",
+    ```json5
+    {
+      models: {
+        providers: {
+          "cloudflare-ai-gateway": {
+            headers: {
+              "cf-aig-authorization": "Bearer <cloudflare-ai-gateway-token>",
+            },
+          },
        },
      },
-    },
-  },
-}
-```
+    }
+    ```

-## Environment note
+    <Tip>
+    The `cf-aig-authorization` header authenticates with the Cloudflare Gateway itself, while the provider API key (for example, your Anthropic key) authenticates with the upstream provider.
+    </Tip>

-If the Gateway runs as a daemon (launchd/systemd), make sure `CLOUDFLARE_AI_GATEWAY_API_KEY` is available to that process (for example, in `~/.openclaw/.env` or via `env.shellEnv`).
+  </Accordion>
+
+  <Accordion title="Environment note">
+    If the Gateway runs as a daemon (launchd/systemd), make sure `CLOUDFLARE_AI_GATEWAY_API_KEY` is available to that process.
+
+    <Warning>
+    A key sitting only in `~/.profile` will not help a launchd/systemd daemon unless that environment is imported there as well. Set the key in `~/.openclaw/.env` or via `env.shellEnv` to ensure the gateway process can read it.
+    </Warning>
+
+  </Accordion>
+</AccordionGroup>
+
+## Related
+
+<CardGroup cols={2}>
+  <Card title="Model selection" href="/concepts/model-providers" icon="layers">
+    Choosing providers, model refs, and failover behavior.
+  </Card>
+  <Card title="Troubleshooting" href="/help/troubleshooting" icon="wrench">
+    General troubleshooting and FAQ.
+  </Card>
+</CardGroup>
--- a/docs/providers/comfy.md
+++ b/docs/providers/comfy.md
@@ -9,13 +9,15 @@ read_when:

 # ComfyUI

-OpenClaw ships a bundled `comfy` plugin for workflow-driven ComfyUI runs.
+OpenClaw ships a bundled `comfy` plugin for workflow-driven ComfyUI runs. The plugin is entirely workflow-driven, so OpenClaw does not try to map generic `size`, `aspectRatio`, `resolution`, `durationSeconds`, or TTS-style controls onto your graph.

- Provider: `comfy`
- Models: `comfy/workflow`
- Shared surfaces: `image_generate`, `video_generate`, `music_generate`
- Auth: none for local ComfyUI; `COMFY_API_KEY` or `COMFY_CLOUD_API_KEY` for Comfy Cloud
- API: ComfyUI `/prompt` / `/history` / `/view` and Comfy Cloud `/api/*`
+| Property        | Detail                                                                           |
+| --------------- | -------------------------------------------------------------------------------- |
+| Provider        | `comfy`                                                                          |
+| Models          | `comfy/workflow`                                                                 |
+| Shared surfaces | `image_generate`, `video_generate`, `music_generate`                             |
+| Auth            | None for local ComfyUI; `COMFY_API_KEY` or `COMFY_CLOUD_API_KEY` for Comfy Cloud |
+| API             | ComfyUI `/prompt` / `/history` / `/view` and Comfy Cloud `/api/*`                |

 ## What it supports

@@ -26,14 +28,140 @@ OpenClaw ships a bundled `comfy` plugin for workflow-driven ComfyUI runs.
 - Music or audio generation through the shared `music_generate` tool
 - Output download from a configured node or all matching output nodes

-The bundled plugin is workflow-driven, so OpenClaw does not try to map generic
-`size`, `aspectRatio`, `resolution`, `durationSeconds`, or TTS-style controls
-onto your graph.
+## Getting started

-## Config layout
+Choose between running ComfyUI on your own machine or using Comfy Cloud.

-Comfy supports shared top-level connection settings plus per-capability workflow
-sections:
+<Tabs>
+  <Tab title="Local">
+    **Best for:** running your own ComfyUI instance on your machine or LAN.
+
+    <Steps>
+      <Step title="Start ComfyUI locally">
+        Make sure your local ComfyUI instance is running (defaults to `http://127.0.0.1:8188`).
+      </Step>
+      <Step title="Prepare your workflow JSON">
+        Export or create a ComfyUI workflow JSON file. Note the node IDs for the prompt input node and the output node you want OpenClaw to read from.
+      </Step>
+      <Step title="Configure the provider">
+        Set `mode: "local"` and point at your workflow file. Here is a minimal image example:
+
+        ```json5
+        {
+          models: {
+            providers: {
+              comfy: {
+                mode: "local",
+                baseUrl: "http://127.0.0.1:8188",
+                image: {
+                  workflowPath: "./workflows/flux-api.json",
+                  promptNodeId: "6",
+                  outputNodeId: "9",
+                },
+              },
+            },
+          },
+        }
+        ```
+      </Step>
+      <Step title="Set the default model">
+        Point OpenClaw at the `comfy/workflow` model for the capability you configured:
+
+        ```json5
+        {
+          agents: {
+            defaults: {
+              imageGenerationModel: {
+                primary: "comfy/workflow",
+              },
+            },
+          },
+        }
+        ```
+      </Step>
+      <Step title="Verify">
+        ```bash
+        openclaw models list --provider comfy
+        ```
+      </Step>
+    </Steps>
+
+  </Tab>
+
+  <Tab title="Comfy Cloud">
+    **Best for:** running workflows on Comfy Cloud without managing local GPU resources.
+
+    <Steps>
+      <Step title="Get an API key">
+        Sign up at [comfy.org](https://comfy.org) and generate an API key from your account dashboard.
+      </Step>
+      <Step title="Set the API key">
+        Provide your key through one of these methods:
+
+        ```bash
+        # Environment variable (preferred)
+        export COMFY_API_KEY="your-key"
+
+        # Alternative environment variable
+        export COMFY_CLOUD_API_KEY="your-key"
+
+        # Or inline in config
+        openclaw config set models.providers.comfy.apiKey "your-key"
+        ```
+      </Step>
+      <Step title="Prepare your workflow JSON">
+        Export or create a ComfyUI workflow JSON file. Note the node IDs for the prompt input node and the output node.
+      </Step>
+      <Step title="Configure the provider">
+        Set `mode: "cloud"` and point at your workflow file:
+
+        ```json5
+        {
+          models: {
+            providers: {
+              comfy: {
+                mode: "cloud",
+                image: {
+                  workflowPath: "./workflows/flux-api.json",
+                  promptNodeId: "6",
+                  outputNodeId: "9",
+                },
+              },
+            },
+          },
+        }
+        ```
+
+        <Tip>
+        Cloud mode defaults `baseUrl` to `https://cloud.comfy.org`. You only need to set `baseUrl` if you use a custom cloud endpoint.
+        </Tip>
+      </Step>
+      <Step title="Set the default model">
+        ```json5
+        {
+          agents: {
+            defaults: {
+              imageGenerationModel: {
+                primary: "comfy/workflow",
+              },
+            },
+          },
+        }
+        ```
+      </Step>
+      <Step title="Verify">
+        ```bash
+        openclaw models list --provider comfy
+        ```
+      </Step>
+    </Steps>
+
+  </Tab>
+</Tabs>
+
+## Configuration
+
+Comfy supports shared top-level connection settings plus per-capability workflow sections (`image`, `video`, `music`):

 ```json5
 {
@@ -63,139 +191,164 @@ sections:
 }
 ```

-Shared keys:
+### Shared keys

- `mode`: `local` or `cloud`
- `baseUrl`: defaults to `http://127.0.0.1:8188` for local or `https://cloud.comfy.org` for cloud
- `apiKey`: optional inline key alternative to env vars
- `allowPrivateNetwork`: allow a private/LAN `baseUrl` in cloud mode
+| Key                   | Type                   | Description                                                                           |
+| --------------------- | ---------------------- | ------------------------------------------------------------------------------------- |
+| `mode`                | `"local"` or `"cloud"` | Connection mode.                                                                      |
+| `baseUrl`             | string                 | Defaults to `http://127.0.0.1:8188` for local or `https://cloud.comfy.org` for cloud. |
+| `apiKey`              | string                 | Optional inline key, alternative to `COMFY_API_KEY` / `COMFY_CLOUD_API_KEY` env vars. |
+| `allowPrivateNetwork` | boolean                | Allow a private/LAN `baseUrl` in cloud mode.                                          |

-Per-capability keys under `image`, `video`, or `music`:
+### Per-capability keys

- `workflow` or `workflowPath`: required
- `promptNodeId`: required
- `promptInputName`: defaults to `text`
- `outputNodeId`: optional
- `pollIntervalMs`: optional
- `timeoutMs`: optional
+These keys apply inside the `image`, `video`, or `music` sections:

-Image and video sections also support:
+| Key                          | Required | Default  | Description                                                                  |
+| ---------------------------- | -------- | -------- | ---------------------------------------------------------------------------- |
+| `workflow` or `workflowPath` | Yes      | --       | Path to the ComfyUI workflow JSON file.                                      |
+| `promptNodeId`               | Yes      | --       | Node ID that receives the text prompt.                                       |
+| `promptInputName`            | No       | `"text"` | Input name on the prompt node.                                               |
+| `outputNodeId`               | No       | --       | Node ID to read output from. If omitted, all matching output nodes are used. |
+| `pollIntervalMs`             | No       | --       | Polling interval in milliseconds for job completion.                         |
+| `timeoutMs`                  | No       | --       | Timeout in milliseconds for the workflow run.                                |

- `inputImageNodeId`: required when you pass a reference image
- `inputImageInputName`: defaults to `image`
+The `image` and `video` sections also support:

-## Backward compatibility
+| Key                   | Required                             | Default   | Description                                         |
+| --------------------- | ------------------------------------ | --------- | --------------------------------------------------- |
+| `inputImageNodeId`    | Yes (when passing a reference image) | --        | Node ID that receives the uploaded reference image. |
+| `inputImageInputName` | No                                   | `"image"` | Input name on the image node.                       |

-Existing top-level image config still works:
+## Workflow details

-```json5
-{
-  models: {
-    providers: {
-      comfy: {
-        workflowPath: "./workflows/flux-api.json",
-        promptNodeId: "6",
-        outputNodeId: "9",
-      },
-    },
-  },
-}
-```
+<AccordionGroup>
+  <Accordion title="Image workflows">
+    Set the default image model to `comfy/workflow`:

-OpenClaw treats that legacy shape as the image workflow config.
-
-## Image workflows
-
-Set the default image model:
-
-```json5
-{
-  agents: {
-    defaults: {
-      imageGenerationModel: {
-        primary: "comfy/workflow",
-      },
-    },
-  },
-}
-```
-
-Reference-image editing example:
-
-```json5
-{
-  models: {
-    providers: {
-      comfy: {
-        image: {
-          workflowPath: "./workflows/edit-api.json",
-          promptNodeId: "6",
-          inputImageNodeId: "7",
-          inputImageInputName: "image",
-          outputNodeId: "9",
+    ```json5
+    {
+      agents: {
+        defaults: {
+          imageGenerationModel: {
+            primary: "comfy/workflow",
+          },
        },
      },
-    },
-  },
-}
-```
+    }
+    ```

-## Video workflows
+    **Reference-image editing example:**

-Set the default video model:
+    To enable image editing with an uploaded reference image, add `inputImageNodeId` to your image config:

-```json5
-{
-  agents: {
-    defaults: {
-      videoGenerationModel: {
-        primary: "comfy/workflow",
+    ```json5
+    {
+      models: {
+        providers: {
+          comfy: {
+            image: {
+              workflowPath: "./workflows/edit-api.json",
+              promptNodeId: "6",
+              inputImageNodeId: "7",
+              inputImageInputName: "image",
+              outputNodeId: "9",
+            },
+          },
+        },
      },
-    },
-  },
-}
-```
+    }
+    ```

-Comfy video workflows currently support text-to-video and image-to-video through
-the configured graph. OpenClaw does not pass input videos into Comfy workflows.
+  </Accordion>

-## Music workflows
+  <Accordion title="Video workflows">
+    Set the default video model to `comfy/workflow`:

-The bundled plugin registers a music-generation provider for workflow-defined
-audio or music outputs, surfaced through the shared `music_generate` tool:
+    ```json5
+    {
+      agents: {
+        defaults: {
+          videoGenerationModel: {
+            primary: "comfy/workflow",
+          },
+        },
+      },
+    }
+    ```

-```text
-/tool music_generate prompt="Warm ambient synth loop with soft tape texture"
-```
+    Comfy video workflows support text-to-video and image-to-video through the configured graph.

-Use the `music` config section to point at your audio workflow JSON and output
-node.
+    <Note>
+    OpenClaw does not pass input videos into Comfy workflows. Only text prompts and single reference images are supported as inputs.
+    </Note>

-## Comfy Cloud
+  </Accordion>

-Use `mode: "cloud"` plus one of:
+  <Accordion title="Music workflows">
+    The bundled plugin registers a music-generation provider for workflow-defined audio or music outputs, surfaced through the shared `music_generate` tool:

- `COMFY_API_KEY`
- `COMFY_CLOUD_API_KEY`
- `models.providers.comfy.apiKey`
+    ```text
+    /tool music_generate prompt="Warm ambient synth loop with soft tape texture"
+    ```

-Cloud mode still uses the same `image`, `video`, and `music` workflow sections.
+    Use the `music` config section to point at your audio workflow JSON and output node.

-## Live tests
+  </Accordion>

-Opt-in live coverage exists for the bundled plugin:
+  <Accordion title="Backward compatibility">
+    Existing top-level image config (without the nested `image` section) still works:

-```bash
-OPENCLAW_LIVE_TEST=1 COMFY_LIVE_TEST=1 pnpm test:live -- extensions/comfy/comfy.live.test.ts
-```
+    ```json5
+    {
+      models: {
+        providers: {
+          comfy: {
+            workflowPath: "./workflows/flux-api.json",
+            promptNodeId: "6",
+            outputNodeId: "9",
+          },
+        },
+      },
+    }
+    ```

-The live test skips individual image, video, or music cases unless the matching
-Comfy workflow section is configured.
+    OpenClaw treats that legacy shape as the image workflow config. You do not need to migrate immediately, but the nested `image` / `video` / `music` sections are recommended for new setups.
+
+    <Tip>
+    If you only use image generation, the legacy flat config and the new nested `image` section are functionally equivalent.
+    </Tip>
+
+  </Accordion>
+
+  <Accordion title="Live tests">
+    Opt-in live coverage exists for the bundled plugin:
+
+    ```bash
+    OPENCLAW_LIVE_TEST=1 COMFY_LIVE_TEST=1 pnpm test:live -- extensions/comfy/comfy.live.test.ts
+    ```
+
+    The live test skips individual image, video, or music cases unless the matching Comfy workflow section is configured.
+
+  </Accordion>
+</AccordionGroup>

 ## Related

- [Image Generation](/tools/image-generation)
- [Video Generation](/tools/video-generation)
- [Music Generation](/tools/music-generation)
- [Provider Directory](/providers/index)
- [Configuration Reference](/gateway/configuration-reference#agent-defaults)
+<CardGroup cols={2}>
+  <Card title="Image Generation" href="/tools/image-generation" icon="image">
+    Image generation tool configuration and usage.
+  </Card>
+  <Card title="Video Generation" href="/tools/video-generation" icon="video">
+    Video generation tool configuration and usage.
+  </Card>
+  <Card title="Music Generation" href="/tools/music-generation" icon="music">
+    Music and audio generation tool setup.
+  </Card>
+  <Card title="Provider Directory" href="/providers/index" icon="layers">
+    Overview of all providers and model refs.
+  </Card>
+  <Card title="Configuration Reference" href="/gateway/configuration-reference#agent-defaults" icon="gear">
+    Full config reference including agent defaults.
+  </Card>
+</CardGroup>
--- a/docs/providers/deepgram.md
+++ b/docs/providers/deepgram.md
@@ -15,79 +15,128 @@ When enabled, OpenClaw uploads the audio file to Deepgram and injects the transc
 into the reply pipeline (`{{Transcript}}` + `[Audio]` block). This is **not streaming**;
 it uses the pre-recorded transcription endpoint.

-Website: [https://deepgram.com](https://deepgram.com)  
-Docs: [https://developers.deepgram.com](https://developers.deepgram.com)
+| Detail        | Value                                                      |
+| ------------- | ---------------------------------------------------------- |
+| Website       | [deepgram.com](https://deepgram.com)                       |
+| Docs          | [developers.deepgram.com](https://developers.deepgram.com) |
+| Auth          | `DEEPGRAM_API_KEY`                                         |
+| Default model | `nova-3`                                                   |

-## Quick start
+## Getting started

-1. Set your API key:
+<Steps>
+  <Step title="Set your API key">
+    Add your Deepgram API key to the environment:

-```
-DEEPGRAM_API_KEY=dg_...
-```
+    ```
+    DEEPGRAM_API_KEY=dg_...
+    ```

-2. Enable the provider:
-
-```json5
-{
-  tools: {
-    media: {
-      audio: {
-        enabled: true,
-        models: [{ provider: "deepgram", model: "nova-3" }],
-      },
-    },
-  },
-}
-```
-
-## Options
-
- `model`: Deepgram model id (default: `nova-3`)
- `language`: language hint (optional)
- `tools.media.audio.providerOptions.deepgram.detect_language`: enable language detection (optional)
- `tools.media.audio.providerOptions.deepgram.punctuate`: enable punctuation (optional)
- `tools.media.audio.providerOptions.deepgram.smart_format`: enable smart formatting (optional)
-
-Example with language:
-
-```json5
-{
-  tools: {
-    media: {
-      audio: {
-        enabled: true,
-        models: [{ provider: "deepgram", model: "nova-3", language: "en" }],
-      },
-    },
-  },
-}
-```
-
-Example with Deepgram options:
-
-```json5
-{
-  tools: {
-    media: {
-      audio: {
-        enabled: true,
-        providerOptions: {
-          deepgram: {
-            detect_language: true,
-            punctuate: true,
-            smart_format: true,
+  </Step>
+  <Step title="Enable the audio provider">
+    ```json5
+    {
+      tools: {
+        media: {
+          audio: {
+            enabled: true,
+            models: [{ provider: "deepgram", model: "nova-3" }],
          },
        },
-        models: [{ provider: "deepgram", model: "nova-3" }],
      },
-    },
-  },
-}
-```
+    }
+    ```
+  </Step>
+  <Step title="Send a voice note">
+    Send an audio message through any connected channel. OpenClaw transcribes it
+    via Deepgram and injects the transcript into the reply pipeline.
+  </Step>
+</Steps>
+
+## Configuration options
+
+| Option            | Path                                                         | Description                           |
+| ----------------- | ------------------------------------------------------------ | ------------------------------------- |
+| `model`           | `tools.media.audio.models[].model`                           | Deepgram model id (default: `nova-3`) |
+| `language`        | `tools.media.audio.models[].language`                        | Language hint (optional)              |
+| `detect_language` | `tools.media.audio.providerOptions.deepgram.detect_language` | Enable language detection (optional)  |
+| `punctuate`       | `tools.media.audio.providerOptions.deepgram.punctuate`       | Enable punctuation (optional)         |
+| `smart_format`    | `tools.media.audio.providerOptions.deepgram.smart_format`    | Enable smart formatting (optional)    |
+
+<Tabs>
+  <Tab title="With language hint">
+    ```json5
+    {
+      tools: {
+        media: {
+          audio: {
+            enabled: true,
+            models: [{ provider: "deepgram", model: "nova-3", language: "en" }],
+          },
+        },
+      },
+    }
+    ```
+  </Tab>
+  <Tab title="With Deepgram options">
+    ```json5
+    {
+      tools: {
+        media: {
+          audio: {
+            enabled: true,
+            providerOptions: {
+              deepgram: {
+                detect_language: true,
+                punctuate: true,
+                smart_format: true,
+              },
+            },
+            models: [{ provider: "deepgram", model: "nova-3" }],
+          },
+        },
+      },
+    }
+    ```
+  </Tab>
+</Tabs>

 ## Notes

- Authentication follows the standard provider auth order; `DEEPGRAM_API_KEY` is the simplest path.
- Override endpoints or headers with `tools.media.audio.baseUrl` and `tools.media.audio.headers` when using a proxy.
- Output follows the same audio rules as other providers (size caps, timeouts, transcript injection).
+<AccordionGroup>
+  <Accordion title="Authentication">
+    Authentication follows the standard provider auth order. `DEEPGRAM_API_KEY` is
+    the simplest path.
+  </Accordion>
+  <Accordion title="Proxy and custom endpoints">
+    Override endpoints or headers with `tools.media.audio.baseUrl` and
+    `tools.media.audio.headers` when using a proxy.
+  </Accordion>
+  <Accordion title="Output behavior">
+    Output follows the same audio rules as other providers (size caps, timeouts,
+    transcript injection).
+  </Accordion>
+</AccordionGroup>
+
+<Note>
+Deepgram transcription is **pre-recorded only** (not real-time streaming). OpenClaw
+uploads the complete audio file and waits for the full transcript before injecting
+it into the conversation.
+</Note>
+
+## Related
+
+<CardGroup cols={2}>
+  <Card title="Media tools" href="/tools/media" icon="photo-film">
+    Audio, image, and video processing pipeline overview.
+  </Card>
+  <Card title="Configuration" href="/configuration" icon="gear">
+    Full config reference including media tool settings.
+  </Card>
+  <Card title="Troubleshooting" href="/help/troubleshooting" icon="wrench">
+    Common issues and debugging steps.
+  </Card>
+  <Card title="FAQ" href="/help/faq" icon="circle-question">
+    Frequently asked questions about OpenClaw setup.
+  </Card>
+</CardGroup>
--- a/docs/providers/deepseek.md
+++ b/docs/providers/deepseek.md
@@ -1,4 +1,5 @@
 ---
+title: "DeepSeek"
 summary: "DeepSeek setup (auth + model selection)"
 read_when:
  - You want to use DeepSeek with OpenClaw
@@ -9,37 +10,55 @@ read_when:

 [DeepSeek](https://www.deepseek.com) provides powerful AI models with an OpenAI-compatible API.

- Provider: `deepseek`
- Auth: `DEEPSEEK_API_KEY`
- API: OpenAI-compatible
- Base URL: `https://api.deepseek.com`
+| Property | Value                      |
+| -------- | -------------------------- |
+| Provider | `deepseek`                 |
+| Auth     | `DEEPSEEK_API_KEY`         |
+| API      | OpenAI-compatible          |
+| Base URL | `https://api.deepseek.com` |

-## Quick start
+## Getting started

-Set the API key (recommended: store it for the Gateway):
+<Steps>
+  <Step title="Get your API key">
+    Create an API key at [platform.deepseek.com](https://platform.deepseek.com/api_keys).
+  </Step>
+  <Step title="Run onboarding">
+    ```bash
+    openclaw onboard --auth-choice deepseek-api-key
+    ```

-```bash
-openclaw onboard --auth-choice deepseek-api-key
-```
+    This will prompt for your API key and set `deepseek/deepseek-chat` as the default model.

-This will prompt for your API key and set `deepseek/deepseek-chat` as the default model.
+  </Step>
+  <Step title="Verify models are available">
+    ```bash
+    openclaw models list --provider deepseek
+    ```
+  </Step>
+</Steps>

-## Non-interactive example
+<AccordionGroup>
+  <Accordion title="Non-interactive setup">
+    For scripted or headless installations, pass all flags directly:

-```bash
-openclaw onboard --non-interactive \
-  --mode local \
-  --auth-choice deepseek-api-key \
-  --deepseek-api-key "$DEEPSEEK_API_KEY" \
-  --skip-health \
-  --accept-risk
-```
+    ```bash
+    openclaw onboard --non-interactive \
+      --mode local \
+      --auth-choice deepseek-api-key \
+      --deepseek-api-key "$DEEPSEEK_API_KEY" \
+      --skip-health \
+      --accept-risk
+    ```

-## Environment note
+  </Accordion>
+</AccordionGroup>

+<Warning>
 If the Gateway runs as a daemon (launchd/systemd), make sure `DEEPSEEK_API_KEY`
 is available to that process (for example, in `~/.openclaw/.env` or via
 `env.shellEnv`).
+</Warning>

 ## Built-in catalog

@@ -48,6 +67,30 @@ is available to that process (for example, in `~/.openclaw/.env` or via
 | `deepseek/deepseek-chat`     | DeepSeek Chat     | text  | 131,072 | 8,192      | Default model; DeepSeek V3.2 non-thinking surface |
 | `deepseek/deepseek-reasoner` | DeepSeek Reasoner | text  | 131,072 | 65,536     | Reasoning-enabled V3.2 surface                    |

+<Tip>
 Both bundled models currently advertise streaming usage compatibility in source.
+</Tip>

-Get your API key at [platform.deepseek.com](https://platform.deepseek.com/api_keys).
+## Config example
+
+```json5
+{
+  env: { DEEPSEEK_API_KEY: "sk-..." },
+  agents: {
+    defaults: {
+      model: { primary: "deepseek/deepseek-chat" },
+    },
+  },
+}
+```
+
+## Related
+
+<CardGroup cols={2}>
+  <Card title="Model selection" href="/concepts/model-providers" icon="layers">
+    Choosing providers, model refs, and failover behavior.
+  </Card>
+  <Card title="Configuration reference" href="/gateway/configuration-reference" icon="gear">
+    Full config reference for agents, models, and providers.
+  </Card>
+</CardGroup>
--- a/docs/providers/fal.md
+++ b/docs/providers/fal.md
@@ -11,42 +11,51 @@ read_when:

 OpenClaw ships a bundled `fal` provider for hosted image and video generation.

- Provider: `fal`
- Auth: `FAL_KEY` (canonical; `FAL_API_KEY` also works as a fallback)
- API: fal model endpoints
+| Property | Value                                                         |
+| -------- | ------------------------------------------------------------- |
+| Provider | `fal`                                                         |
+| Auth     | `FAL_KEY` (canonical; `FAL_API_KEY` also works as a fallback) |
+| API      | fal model endpoints                                           |

-## Quick start
+## Getting started

-1. Set the API key:
-
-```bash
-openclaw onboard --auth-choice fal-api-key
-```
-
-2. Set a default image model:
-
-```json5
-{
-  agents: {
-    defaults: {
-      imageGenerationModel: {
-        primary: "fal/fal-ai/flux/dev",
+<Steps>
+  <Step title="Set the API key">
+    ```bash
+    openclaw onboard --auth-choice fal-api-key
+    ```
+  </Step>
+  <Step title="Set a default image model">
+    ```json5
+    {
+      agents: {
+        defaults: {
+          imageGenerationModel: {
+            primary: "fal/fal-ai/flux/dev",
+          },
+        },
      },
-    },
-  },
-}
-```
+    }
+    ```
+  </Step>
+</Steps>

 ## Image generation

 The bundled `fal` image-generation provider defaults to
 `fal/fal-ai/flux/dev`.

- Generate: up to 4 images per request
- Edit mode: enabled, 1 reference image
- Supports `size`, `aspectRatio`, and `resolution`
- Current edit caveat: the fal image edit endpoint does **not** support
-  `aspectRatio` overrides
+| Capability     | Value                      |
+| -------------- | -------------------------- |
+| Max images     | 4 per request              |
+| Edit mode      | Enabled, 1 reference image |
+| Size overrides | Supported                  |
+| Aspect ratio   | Supported                  |
+| Resolution     | Supported                  |
+
+<Warning>
+The fal image edit endpoint does **not** support `aspectRatio` overrides.
+</Warning>

 To use fal as the default image provider:

@@ -67,25 +76,70 @@ To use fal as the default image provider:
 The bundled `fal` video-generation provider defaults to
 `fal/fal-ai/minimax/video-01-live`.

- Modes: text-to-video and single-image reference flows
- Runtime: queue-backed submit/status/result flow for long-running jobs
+| Capability | Value                                                        |
+| ---------- | ------------------------------------------------------------ |
+| Modes      | Text-to-video, single-image reference                        |
+| Runtime    | Queue-backed submit/status/result flow for long-running jobs |

-To use fal as the default video provider:
+<AccordionGroup>
+  <Accordion title="Available video models">
+    **HeyGen video-agent:**

-```json5
-{
-  agents: {
-    defaults: {
-      videoGenerationModel: {
-        primary: "fal/fal-ai/minimax/video-01-live",
+    - `fal/fal-ai/heygen/v2/video-agent`
+
+    **Seedance 2.0:**
+
+    - `fal/bytedance/seedance-2.0/fast/text-to-video`
+    - `fal/bytedance/seedance-2.0/fast/image-to-video`
+    - `fal/bytedance/seedance-2.0/text-to-video`
+    - `fal/bytedance/seedance-2.0/image-to-video`
+
+  </Accordion>
+
+  <Accordion title="Seedance 2.0 config example">
+    ```json5
+    {
+      agents: {
+        defaults: {
+          videoGenerationModel: {
+            primary: "fal/bytedance/seedance-2.0/fast/text-to-video",
+          },
+        },
      },
-    },
-  },
-}
-```
+    }
+    ```
+  </Accordion>
+
+  <Accordion title="HeyGen video-agent config example">
+    ```json5
+    {
+      agents: {
+        defaults: {
+          videoGenerationModel: {
+            primary: "fal/fal-ai/heygen/v2/video-agent",
+          },
+        },
+      },
+    }
+    ```
+  </Accordion>
+</AccordionGroup>
+
+<Tip>
+Use `openclaw models list --provider fal` to see the full list of available fal
+models, including any recently added entries.
+</Tip>

 ## Related

- [Image Generation](/tools/image-generation)
- [Video Generation](/tools/video-generation)
- [Configuration Reference](/gateway/configuration-reference#agent-defaults)
+<CardGroup cols={2}>
+  <Card title="Image generation" href="/tools/image-generation" icon="image">
+    Shared image tool parameters and provider selection.
+  </Card>
+  <Card title="Video generation" href="/tools/video-generation" icon="video">
+    Shared video tool parameters and provider selection.
+  </Card>
+  <Card title="Configuration reference" href="/gateway/configuration-reference#agent-defaults" icon="gear">
+    Agent defaults including image and video model selection.
+  </Card>
+</CardGroup>
--- a/docs/providers/fireworks.md
+++ b/docs/providers/fireworks.md
@@ -1,4 +1,5 @@
 ---
+title: "Fireworks"
 summary: "Fireworks setup (auth + model selection)"
 read_when:
  - You want to use Fireworks with OpenClaw
@@ -7,26 +8,38 @@ read_when:

 # Fireworks

-[Fireworks](https://fireworks.ai) exposes open-weight and routed models through an OpenAI-compatible API. OpenClaw now includes a bundled Fireworks provider plugin.
+[Fireworks](https://fireworks.ai) exposes open-weight and routed models through an OpenAI-compatible API. OpenClaw includes a bundled Fireworks provider plugin.

- Provider: `fireworks`
- Auth: `FIREWORKS_API_KEY`
- API: OpenAI-compatible chat/completions
- Base URL: `https://api.fireworks.ai/inference/v1`
- Default model: `fireworks/accounts/fireworks/routers/kimi-k2p5-turbo`
+| Property      | Value                                                  |
+| ------------- | ------------------------------------------------------ |
+| Provider      | `fireworks`                                            |
+| Auth          | `FIREWORKS_API_KEY`                                    |
+| API           | OpenAI-compatible chat/completions                     |
+| Base URL      | `https://api.fireworks.ai/inference/v1`                |
+| Default model | `fireworks/accounts/fireworks/routers/kimi-k2p5-turbo` |

-## Quick start
+## Getting started

-Set up Fireworks auth through onboarding:
+<Steps>
+  <Step title="Set up Fireworks auth through onboarding">
+    ```bash
+    openclaw onboard --auth-choice fireworks-api-key
+    ```

-```bash
-openclaw onboard --auth-choice fireworks-api-key
-```
+    This stores your Fireworks key in OpenClaw config and sets the Fire Pass starter model as the default.

-This stores your Fireworks key in OpenClaw config and sets the Fire Pass starter model as the default.
+  </Step>
+  <Step title="Verify the model is available">
+    ```bash
+    openclaw models list --provider fireworks
+    ```
+  </Step>
+</Steps>

 ## Non-interactive example

+For scripted or CI setups, pass all values on the command line:
+
 ```bash
 openclaw onboard --non-interactive \
  --mode local \
@@ -36,24 +49,20 @@ openclaw onboard --non-interactive \
  --accept-risk
 ```

-## Environment note
-
-If the Gateway runs outside your interactive shell, make sure `FIREWORKS_API_KEY`
-is available to that process too. A key sitting only in `~/.profile` will not
-help a launchd/systemd daemon unless that environment is imported there as well.
-
 ## Built-in catalog

 | Model ref                                              | Name                        | Input      | Context | Max output | Notes                                      |
 | ------------------------------------------------------ | --------------------------- | ---------- | ------- | ---------- | ------------------------------------------ |
 | `fireworks/accounts/fireworks/routers/kimi-k2p5-turbo` | Kimi K2.5 Turbo (Fire Pass) | text,image | 256,000 | 256,000    | Default bundled starter model on Fireworks |

+<Tip>
+If Fireworks publishes a newer model such as a fresh Qwen or Gemma release, you can switch to it directly by using its Fireworks model id without waiting for a bundled catalog update.
+</Tip>
+
 ## Custom Fireworks model ids

 OpenClaw accepts dynamic Fireworks model ids too. Use the exact model or router id shown by Fireworks and prefix it with `fireworks/`.

-Example:
-
 ```json5
 {
  agents: {
@@ -66,4 +75,34 @@ Example:
 }
 ```

-If Fireworks publishes a newer model such as a fresh Qwen or Gemma release, you can switch to it directly by using its Fireworks model id without waiting for a bundled catalog update.
+<AccordionGroup>
+  <Accordion title="How model id prefixing works">
+    Every Fireworks model ref in OpenClaw starts with `fireworks/` followed by the exact id or router path from the Fireworks platform. For example:
+
+    - Router model: `fireworks/accounts/fireworks/routers/kimi-k2p5-turbo`
+    - Direct model: `fireworks/accounts/fireworks/models/<model-name>`
+
+    OpenClaw strips the `fireworks/` prefix when building the API request and sends the remaining path to the Fireworks endpoint.
+
+  </Accordion>
+
+  <Accordion title="Environment note">
+    If the Gateway runs outside your interactive shell, make sure `FIREWORKS_API_KEY` is available to that process too.
+
+    <Warning>
+    A key sitting only in `~/.profile` will not help a launchd/systemd daemon unless that environment is imported there as well. Set the key in `~/.openclaw/.env` or via `env.shellEnv` to ensure the gateway process can read it.
+    </Warning>
+
+  </Accordion>
+</AccordionGroup>
+
+## Related
+
+<CardGroup cols={2}>
+  <Card title="Model selection" href="/concepts/model-providers" icon="layers">
+    Choosing providers, model refs, and failover behavior.
+  </Card>
+  <Card title="Troubleshooting" href="/help/troubleshooting" icon="wrench">
+    General troubleshooting and FAQ.
+  </Card>
+</CardGroup>
--- a/docs/providers/github-copilot.md
+++ b/docs/providers/github-copilot.md
@@ -8,73 +8,124 @@ title: "GitHub Copilot"

 # GitHub Copilot

-## What is GitHub Copilot?
-
 GitHub Copilot is GitHub's AI coding assistant. It provides access to Copilot
 models for your GitHub account and plan. OpenClaw can use Copilot as a model
 provider in two different ways.

 ## Two ways to use Copilot in OpenClaw

-### 1) Built-in GitHub Copilot provider (`github-copilot`)
+<Tabs>
+  <Tab title="Built-in provider (github-copilot)">
+    Use the native device-login flow to obtain a GitHub token, then exchange it for
+    Copilot API tokens when OpenClaw runs. This is the **default** and simplest path
+    because it does not require VS Code.

-Use the native device-login flow to obtain a GitHub token, then exchange it for
-Copilot API tokens when OpenClaw runs. This is the **default** and simplest path
-because it does not require VS Code.
+    <Steps>
+      <Step title="Run the login command">
+        ```bash
+        openclaw models auth login-github-copilot
+        ```

-### 2) Copilot Proxy plugin (`copilot-proxy`)
+        You will be prompted to visit a URL and enter a one-time code. Keep the
+        terminal open until it completes.
+      </Step>
+      <Step title="Set a default model">
+        ```bash
+        openclaw models set github-copilot/gpt-4o
+        ```

-Use the **Copilot Proxy** VS Code extension as a local bridge. OpenClaw talks to
-the proxy’s `/v1` endpoint and uses the model list you configure there. Choose
-this when you already run Copilot Proxy in VS Code or need to route through it.
-You must enable the plugin and keep the VS Code extension running.
+        Or in config:

-Use GitHub Copilot as a model provider (`github-copilot`). The login command runs
-the GitHub device flow, saves an auth profile, and updates your config to use that
-profile.
+        ```json5
+        {
+          agents: { defaults: { model: { primary: "github-copilot/gpt-4o" } } },
+        }
+        ```
+      </Step>
+    </Steps>

-## CLI setup
-
-```bash
-openclaw models auth login-github-copilot
-```
-
-You'll be prompted to visit a URL and enter a one-time code. Keep the terminal
-open until it completes.
-
-### Optional flags
+  </Tab>
+
+  <Tab title="Copilot Proxy plugin (copilot-proxy)">
+    Use the **Copilot Proxy** VS Code extension as a local bridge. OpenClaw talks to
+    the proxy's `/v1` endpoint and uses the model list you configure there.
+
+    <Note>
+    Choose this when you already run Copilot Proxy in VS Code or need to route
+    through it. You must enable the plugin and keep the VS Code extension running.
+    </Note>
+
+  </Tab>
+</Tabs>
+
+## Optional flags
+
+| Flag            | Description                                         |
+| --------------- | --------------------------------------------------- |
+| `--yes`         | Skip the confirmation prompt                        |
+| `--set-default` | Also apply the provider's recommended default model |

 ```bash
+# Skip confirmation
 openclaw models auth login-github-copilot --yes
-```

-To also apply the provider's recommended default model in one step, use the
-generic auth command instead:
-
-```bash
+# Login and set the default model in one step
 openclaw models auth login --provider github-copilot --method device --set-default
 ```

-## Set a default model
+<AccordionGroup>
+  <Accordion title="Interactive TTY required">
+    The device-login flow requires an interactive TTY. Run it directly in a
+    terminal, not in a non-interactive script or CI pipeline.
+  </Accordion>

-```bash
-openclaw models set github-copilot/gpt-4o
-```
+  <Accordion title="Model availability depends on your plan">
+    Copilot model availability depends on your GitHub plan. If a model is
+    rejected, try another ID (for example `github-copilot/gpt-4.1`).
+  </Accordion>

-### Config snippet
+  <Accordion title="Transport selection">
+    Claude model IDs use the Anthropic Messages transport automatically. GPT,
+    o-series, and Gemini models keep the OpenAI Responses transport. OpenClaw
+    selects the correct transport based on the model ref.
+  </Accordion>

-```json5
-{
-  agents: { defaults: { model: { primary: "github-copilot/gpt-4o" } } },
-}
-```
+  <Accordion title="Environment variable resolution order">
+    OpenClaw resolves Copilot auth from environment variables in the following
+    priority order:

-## Notes
+    | Priority | Variable              | Notes                            |
+    | -------- | --------------------- | -------------------------------- |
+    | 1        | `COPILOT_GITHUB_TOKEN` | Highest priority, Copilot-specific |
+    | 2        | `GH_TOKEN`            | GitHub CLI token (fallback)      |
+    | 3        | `GITHUB_TOKEN`        | Standard GitHub token (lowest)   |

- Requires an interactive TTY; run it directly in a terminal.
- Copilot model availability depends on your plan; if a model is rejected, try
-  another ID (for example `github-copilot/gpt-4.1`).
- Claude model IDs use the Anthropic Messages transport automatically; GPT, o-series,
-  and Gemini models keep the OpenAI Responses transport.
- The login stores a GitHub token in the auth profile store and exchanges it for a
-  Copilot API token when OpenClaw runs.
+    When multiple variables are set, OpenClaw uses the highest-priority one.
+    The device-login flow (`openclaw models auth login-github-copilot`) stores
+    its token in the auth profile store and takes precedence over all environment
+    variables.
+
+  </Accordion>
+
+  <Accordion title="Token storage">
+    The login stores a GitHub token in the auth profile store and exchanges it
+    for a Copilot API token when OpenClaw runs. You do not need to manage the
+    token manually.
+  </Accordion>
+</AccordionGroup>
+
+<Warning>
+Requires an interactive TTY. Run the login command directly in a terminal, not
+inside a headless script or CI job.
+</Warning>
+
+## Related
+
+<CardGroup cols={2}>
+  <Card title="Model selection" href="/concepts/model-providers" icon="layers">
+    Choosing providers, model refs, and failover behavior.
+  </Card>
+  <Card title="OAuth and auth" href="/gateway/authentication" icon="key">
+    Auth details and credential reuse rules.
+  </Card>
+</CardGroup>
--- a/docs/providers/glm.md
+++ b/docs/providers/glm.md
@@ -3,7 +3,7 @@ summary: "GLM model family overview + how to use it in OpenClaw"
 read_when:
  - You want GLM models in OpenClaw
  - You need the model naming convention and setup
-title: "GLM Models"
+title: "GLM (Zhipu)"
 ---

 # GLM models
@@ -11,26 +11,42 @@ title: "GLM Models"
 GLM is a **model family** (not a company) available through the Z.AI platform. In OpenClaw, GLM
 models are accessed via the `zai` provider and model IDs like `zai/glm-5`.

-## CLI setup
+## Getting started

-```bash
-# Generic API-key setup with endpoint auto-detection
-openclaw onboard --auth-choice zai-api-key
+<Steps>
+  <Step title="Choose an auth route and run onboarding">
+    Pick the onboarding choice that matches your Z.AI plan and region:

-# Coding Plan Global, recommended for Coding Plan users
-openclaw onboard --auth-choice zai-coding-global
+    | Auth choice | Best for |
+    | ----------- | -------- |
+    | `zai-api-key` | Generic API-key setup with endpoint auto-detection |
+    | `zai-coding-global` | Coding Plan users (global) |
+    | `zai-coding-cn` | Coding Plan users (China region) |
+    | `zai-global` | General API (global) |
+    | `zai-cn` | General API (China region) |

-# Coding Plan CN (China region), recommended for Coding Plan users
-openclaw onboard --auth-choice zai-coding-cn
+    ```bash
+    # Example: generic auto-detect
+    openclaw onboard --auth-choice zai-api-key

-# General API
-openclaw onboard --auth-choice zai-global
+    # Example: Coding Plan global
+    openclaw onboard --auth-choice zai-coding-global
+    ```

-# General API CN (China region)
-openclaw onboard --auth-choice zai-cn
-```
+  </Step>
+  <Step title="Set GLM as the default model">
+    ```bash
+    openclaw config set agents.defaults.model.primary "zai/glm-5.1"
+    ```
+  </Step>
+  <Step title="Verify models are available">
+    ```bash
+    openclaw models list --provider zai
+    ```
+  </Step>
+</Steps>

-## Config snippet
+## Config example

 ```json5
 {
@@ -39,30 +55,56 @@ openclaw onboard --auth-choice zai-cn
 }
 ```

+<Tip>
 `zai-api-key` lets OpenClaw detect the matching Z.AI endpoint from the key and
 apply the correct base URL automatically. Use the explicit regional choices when
 you want to force a specific Coding Plan or general API surface.
+</Tip>

-## Current bundled GLM models
+## Bundled GLM models

 OpenClaw currently seeds the bundled `zai` provider with these GLM refs:

- `glm-5.1`
- `glm-5`
- `glm-5-turbo`
- `glm-5v-turbo`
- `glm-4.7`
- `glm-4.7-flash`
- `glm-4.7-flashx`
- `glm-4.6`
- `glm-4.6v`
- `glm-4.5`
- `glm-4.5-air`
- `glm-4.5-flash`
- `glm-4.5v`
+| Model           | Model            |
+| --------------- | ---------------- |
+| `glm-5.1`       | `glm-4.7`        |
+| `glm-5`         | `glm-4.7-flash`  |
+| `glm-5-turbo`   | `glm-4.7-flashx` |
+| `glm-5v-turbo`  | `glm-4.6`        |
+| `glm-4.5`       | `glm-4.6v`       |
+| `glm-4.5-air`   |                  |
+| `glm-4.5-flash` |                  |
+| `glm-4.5v`      |                  |

-## Notes
+<Note>
+The default bundled model ref is `zai/glm-5.1`. GLM versions and availability
+can change; check Z.AI's docs for the latest.
+</Note>

- GLM versions and availability can change; check Z.AI's docs for the latest.
- Default bundled model ref is `zai/glm-5.1`.
- For provider details, see [/providers/zai](/providers/zai).
+## Advanced notes
+
+<AccordionGroup>
+  <Accordion title="Endpoint auto-detection">
+    When you use the `zai-api-key` auth choice, OpenClaw inspects the key format
+    to determine the correct Z.AI base URL. Explicit regional choices
+    (`zai-coding-global`, `zai-coding-cn`, `zai-global`, `zai-cn`) override
+    auto-detection and pin the endpoint directly.
+  </Accordion>
+
+  <Accordion title="Provider details">
+    GLM models are served by the `zai` runtime provider. For full provider
+    configuration, regional endpoints, and additional capabilities, see
+    [Z.AI provider docs](/providers/zai).
+  </Accordion>
+</AccordionGroup>
+
+## Related
+
+<CardGroup cols={2}>
+  <Card title="Z.AI provider" href="/providers/zai" icon="server">
+    Full Z.AI provider configuration and regional endpoints.
+  </Card>
+  <Card title="Model selection" href="/concepts/model-providers" icon="layers">
+    Choosing providers, model refs, and failover behavior.
+  </Card>
+</CardGroup>
--- a/docs/providers/google.md
+++ b/docs/providers/google.md
@@ -17,74 +17,114 @@ Gemini Grounding.
 - API: Google Gemini API
 - Alternative provider: `google-gemini-cli` (OAuth)

-## Quick start
+## Getting started

-1. Set the API key:
+Choose your preferred auth method and follow the setup steps.

-```bash
-openclaw onboard --auth-choice gemini-api-key
-```
+<Tabs>
+  <Tab title="API key">
+    **Best for:** standard Gemini API access through Google AI Studio.

-2. Set a default model:
+    <Steps>
+      <Step title="Run onboarding">
+        ```bash
+        openclaw onboard --auth-choice gemini-api-key
+        ```

-```json5
-{
-  agents: {
-    defaults: {
-      model: { primary: "google/gemini-3.1-pro-preview" },
-    },
-  },
-}
-```
+        Or pass the key directly:

-## Non-interactive example
+        ```bash
+        openclaw onboard --non-interactive \
+          --mode local \
+          --auth-choice gemini-api-key \
+          --gemini-api-key "$GEMINI_API_KEY"
+        ```
+      </Step>
+      <Step title="Set a default model">
+        ```json5
+        {
+          agents: {
+            defaults: {
+              model: { primary: "google/gemini-3.1-pro-preview" },
+            },
+          },
+        }
+        ```
+      </Step>
+      <Step title="Verify the model is available">
+        ```bash
+        openclaw models list --provider google
+        ```
+      </Step>
+    </Steps>

-```bash
-openclaw onboard --non-interactive \
-  --mode local \
-  --auth-choice gemini-api-key \
-  --gemini-api-key "$GEMINI_API_KEY"
-```
+    <Tip>
+    The environment variables `GEMINI_API_KEY` and `GOOGLE_API_KEY` are both accepted. Use whichever you already have configured.
+    </Tip>

-## OAuth (Gemini CLI)
+  </Tab>

-An alternative provider `google-gemini-cli` uses PKCE OAuth instead of an API
-key. This is an unofficial integration; some users report account
-restrictions. Use at your own risk.
+  <Tab title="Gemini CLI (OAuth)">
+    **Best for:** reusing an existing Gemini CLI login via PKCE OAuth instead of a separate API key.

- Default model: `google-gemini-cli/gemini-3-flash-preview`
- Alias: `gemini-cli`
- Install prerequisite: local Gemini CLI available as `gemini`
-  - Homebrew: `brew install gemini-cli`
-  - npm: `npm install -g @google/gemini-cli`
- Login:
+    <Warning>
+    The `google-gemini-cli` provider is an unofficial integration. Some users
+    report account restrictions when using OAuth this way. Use at your own risk.
+    </Warning>

-```bash
-openclaw models auth login --provider google-gemini-cli --set-default
-```
+    <Steps>
+      <Step title="Install the Gemini CLI">
+        The local `gemini` command must be available on `PATH`.

-Environment variables:
+        ```bash
+        # Homebrew
+        brew install gemini-cli

- `OPENCLAW_GEMINI_OAUTH_CLIENT_ID`
- `OPENCLAW_GEMINI_OAUTH_CLIENT_SECRET`
+        # or npm
+        npm install -g @google/gemini-cli
+        ```

-(Or the `GEMINI_CLI_*` variants.)
+        OpenClaw supports both Homebrew installs and global npm installs, including
+        common Windows/npm layouts.
+      </Step>
+      <Step title="Log in via OAuth">
+        ```bash
+        openclaw models auth login --provider google-gemini-cli --set-default
+        ```
+      </Step>
+      <Step title="Verify the model is available">
+        ```bash
+        openclaw models list --provider google-gemini-cli
+        ```
+      </Step>
+    </Steps>

-If Gemini CLI OAuth requests fail after login, set
-`GOOGLE_CLOUD_PROJECT` or `GOOGLE_CLOUD_PROJECT_ID` on the gateway host and
-retry.
+    - Default model: `google-gemini-cli/gemini-3-flash-preview`
+    - Alias: `gemini-cli`

-If login fails before the browser flow starts, make sure the local `gemini`
-command is installed and on `PATH`. OpenClaw supports both Homebrew installs
-and global npm installs, including common Windows/npm layouts.
+    **Environment variables:**

-Gemini CLI JSON usage notes:
+    - `OPENCLAW_GEMINI_OAUTH_CLIENT_ID`
+    - `OPENCLAW_GEMINI_OAUTH_CLIENT_SECRET`

- Reply text comes from the CLI JSON `response` field.
- Usage falls back to `stats` when the CLI leaves `usage` empty.
- `stats.cached` is normalized into OpenClaw `cacheRead`.
- If `stats.input` is missing, OpenClaw derives input tokens from
-  `stats.input_tokens - stats.cached`.
+    (Or the `GEMINI_CLI_*` variants.)
+
+    <Note>
+    If Gemini CLI OAuth requests fail after login, set `GOOGLE_CLOUD_PROJECT` or
+    `GOOGLE_CLOUD_PROJECT_ID` on the gateway host and retry.
+    </Note>
+
+    <Note>
+    If login fails before the browser flow starts, make sure the local `gemini`
+    command is installed and on `PATH`.
+    </Note>
+
+    The OAuth-only `google-gemini-cli` provider is a separate text-inference
+    surface. Image generation, media understanding, and Gemini Grounding stay on
+    the `google` provider id.
+
+  </Tab>
+</Tabs>

 ## Capabilities

@@ -100,37 +140,12 @@ Gemini CLI JSON usage notes:
 | Thinking/reasoning     | Yes (Gemini 3.1+) |
 | Gemma 4 models         | Yes               |

-Gemma 4 models (for example `gemma-4-26b-a4b-it`) support thinking mode. OpenClaw rewrites `thinkingBudget` to a supported Google `thinkingLevel` for Gemma 4. Setting thinking to `off` preserves thinking disabled instead of mapping to `MINIMAL`.
-
-## Direct Gemini cache reuse
-
-For direct Gemini API runs (`api: "google-generative-ai"`), OpenClaw now
-passes a configured `cachedContent` handle through to Gemini requests.
-
- Configure per-model or global params with either
-  `cachedContent` or legacy `cached_content`
- If both are present, `cachedContent` wins
- Example value: `cachedContents/prebuilt-context`
- Gemini cache-hit usage is normalized into OpenClaw `cacheRead` from
-  upstream `cachedContentTokenCount`
-
-Example:
-
-```json5
-{
-  agents: {
-    defaults: {
-      models: {
-        "google/gemini-2.5-pro": {
-          params: {
-            cachedContent: "cachedContents/prebuilt-context",
-          },
-        },
-      },
-    },
-  },
-}
-```
+<Tip>
+Gemma 4 models (for example `gemma-4-26b-a4b-it`) support thinking mode. OpenClaw
+rewrites `thinkingBudget` to a supported Google `thinkingLevel` for Gemma 4.
+Setting thinking to `off` preserves thinking disabled instead of mapping to
+`MINIMAL`.
+</Tip>

 ## Image generation

@@ -142,10 +157,6 @@ The bundled `google` image-generation provider defaults to
 - Edit mode: enabled, up to 5 input images
 - Geometry controls: `size`, `aspectRatio`, and `resolution`

-The OAuth-only `google-gemini-cli` provider is a separate text-inference
-surface. Image generation, media understanding, and Gemini Grounding stay on
-the `google` provider id.
-
 To use Google as the default image provider:

 ```json5
@@ -160,8 +171,9 @@ To use Google as the default image provider:
 }
 ```

-See [Image Generation](/tools/image-generation) for the shared tool
-parameters, provider selection, and failover behavior.
+<Note>
+See [Image Generation](/tools/image-generation) for shared tool parameters, provider selection, and failover behavior.
+</Note>

 ## Video generation

@@ -187,8 +199,9 @@ To use Google as the default video provider:
 }
 ```

-See [Video Generation](/tools/video-generation) for the shared tool
-parameters, provider selection, and failover behavior.
+<Note>
+See [Video Generation](/tools/video-generation) for shared tool parameters, provider selection, and failover behavior.
+</Note>

 ## Music generation

@@ -216,11 +229,74 @@ To use Google as the default music provider:
 }
 ```

-See [Music Generation](/tools/music-generation) for the shared tool
-parameters, provider selection, and failover behavior.
+<Note>
+See [Music Generation](/tools/music-generation) for shared tool parameters, provider selection, and failover behavior.
+</Note>

-## Environment note
+## Advanced configuration

-If the Gateway runs as a daemon (launchd/systemd), make sure `GEMINI_API_KEY`
-is available to that process (for example, in `~/.openclaw/.env` or via
-`env.shellEnv`).
+<AccordionGroup>
+  <Accordion title="Direct Gemini cache reuse">
+    For direct Gemini API runs (`api: "google-generative-ai"`), OpenClaw
+    passes a configured `cachedContent` handle through to Gemini requests.
+
+    - Configure per-model or global params with either
+      `cachedContent` or legacy `cached_content`
+    - If both are present, `cachedContent` wins
+    - Example value: `cachedContents/prebuilt-context`
+    - Gemini cache-hit usage is normalized into OpenClaw `cacheRead` from
+      upstream `cachedContentTokenCount`
+
+    ```json5
+    {
+      agents: {
+        defaults: {
+          models: {
+            "google/gemini-2.5-pro": {
+              params: {
+                cachedContent: "cachedContents/prebuilt-context",
+              },
+            },
+          },
+        },
+      },
+    }
+    ```
+
+  </Accordion>
+
+  <Accordion title="Gemini CLI JSON usage notes">
+    When using the `google-gemini-cli` OAuth provider, OpenClaw normalizes
+    the CLI JSON output as follows:
+
+    - Reply text comes from the CLI JSON `response` field.
+    - Usage falls back to `stats` when the CLI leaves `usage` empty.
+    - `stats.cached` is normalized into OpenClaw `cacheRead`.
+    - If `stats.input` is missing, OpenClaw derives input tokens from
+      `stats.input_tokens - stats.cached`.
+
+  </Accordion>
+
+  <Accordion title="Environment and daemon setup">
+    If the Gateway runs as a daemon (launchd/systemd), make sure `GEMINI_API_KEY`
+    is available to that process (for example, in `~/.openclaw/.env` or via
+    `env.shellEnv`).
+  </Accordion>
+</AccordionGroup>
+
+## Related
+
+<CardGroup cols={2}>
+  <Card title="Model selection" href="/concepts/model-providers" icon="layers">
+    Choosing providers, model refs, and failover behavior.
+  </Card>
+  <Card title="Image generation" href="/tools/image-generation" icon="image">
+    Shared image tool parameters and provider selection.
+  </Card>
+  <Card title="Video generation" href="/tools/video-generation" icon="video">
+    Shared video tool parameters and provider selection.
+  </Card>
+  <Card title="Music generation" href="/tools/music-generation" icon="music">
+    Shared music tool parameters and provider selection.
+  </Card>
+</CardGroup>
--- a/docs/providers/groq.md
+++ b/docs/providers/groq.md
@@ -12,33 +12,37 @@ read_when:
 (Llama, Gemma, Mistral, and more) using custom LPU hardware. OpenClaw connects
 to Groq through its OpenAI-compatible API.

- Provider: `groq`
- Auth: `GROQ_API_KEY`
- API: OpenAI-compatible
+| Property | Value             |
+| -------- | ----------------- |
+| Provider | `groq`            |
+| Auth     | `GROQ_API_KEY`    |
+| API      | OpenAI-compatible |

-## Quick start
+## Getting started

-1. Get an API key from [console.groq.com/keys](https://console.groq.com/keys).
+<Steps>
+  <Step title="Get an API key">
+    Create an API key at [console.groq.com/keys](https://console.groq.com/keys).
+  </Step>
+  <Step title="Set the API key">
+    ```bash
+    export GROQ_API_KEY="gsk_..."
+    ```
+  </Step>
+  <Step title="Set a default model">
+    ```json5
+    {
+      agents: {
+        defaults: {
+          model: { primary: "groq/llama-3.3-70b-versatile" },
+        },
+      },
+    }
+    ```
+  </Step>
+</Steps>

-2. Set the API key:
-
-```bash
-export GROQ_API_KEY="gsk_..."
-```
-
-3. Set a default model:
-
-```json5
-{
-  agents: {
-    defaults: {
-      model: { primary: "groq/llama-3.3-70b-versatile" },
-    },
-  },
-}
-```
-
-## Config file example
+### Config file example

 ```json5
 {
@@ -51,6 +55,24 @@ export GROQ_API_KEY="gsk_..."
 }
 ```

+## Available models
+
+Groq's model catalog changes frequently. Run `openclaw models list | grep groq`
+to see currently available models, or check
+[console.groq.com/docs/models](https://console.groq.com/docs/models).
+
+| Model                       | Notes                              |
+| --------------------------- | ---------------------------------- |
+| **Llama 3.3 70B Versatile** | General-purpose, large context     |
+| **Llama 3.1 8B Instant**    | Fast, lightweight                  |
+| **Gemma 2 9B**              | Compact, efficient                 |
+| **Mixtral 8x7B**            | MoE architecture, strong reasoning |
+
+<Tip>
+Use `openclaw models list --provider groq` for the most up-to-date list of
+models available on your account.
+</Tip>
+
 ## Audio transcription

 Groq also provides fast Whisper-based audio transcription. When configured as a
@@ -70,36 +92,43 @@ surface.
 }
 ```

-## Environment note
+<AccordionGroup>
+  <Accordion title="Audio transcription details">
+    | Property | Value |
+    |----------|-------|
+    | Shared config path | `tools.media.audio` |
+    | Default base URL   | `https://api.groq.com/openai/v1` |
+    | Default model      | `whisper-large-v3-turbo` |
+    | API endpoint       | OpenAI-compatible `/audio/transcriptions` |
+  </Accordion>

-If the Gateway runs as a daemon (launchd/systemd), make sure `GROQ_API_KEY` is
-available to that process (for example, in `~/.openclaw/.env` or via
-`env.shellEnv`).
+  <Accordion title="Environment note">
+    If the Gateway runs as a daemon (launchd/systemd), make sure `GROQ_API_KEY` is
+    available to that process (for example, in `~/.openclaw/.env` or via
+    `env.shellEnv`).

-## Audio notes
+    <Warning>
+    Keys set only in your interactive shell are not visible to daemon-managed
+    gateway processes. Use `~/.openclaw/.env` or `env.shellEnv` config for
+    persistent availability.
+    </Warning>

- Shared config path: `tools.media.audio`
- Default Groq audio base URL: `https://api.groq.com/openai/v1`
- Default Groq audio model: `whisper-large-v3-turbo`
- Groq audio transcription uses the OpenAI-compatible `/audio/transcriptions`
-  path
+  </Accordion>
+</AccordionGroup>

-## Available models
+## Related

-Groq's model catalog changes frequently. Run `openclaw models list | grep groq`
-to see currently available models, or check
-[console.groq.com/docs/models](https://console.groq.com/docs/models).
-
-Popular choices include:
-
- **Llama 3.3 70B Versatile** - general-purpose, large context
- **Llama 3.1 8B Instant** - fast, lightweight
- **Gemma 2 9B** - compact, efficient
- **Mixtral 8x7B** - MoE architecture, strong reasoning
-
-## Links
-
- [Groq Console](https://console.groq.com)
- [API Documentation](https://console.groq.com/docs)
- [Model List](https://console.groq.com/docs/models)
- [Pricing](https://groq.com/pricing)
+<CardGroup cols={2}>
+  <Card title="Model selection" href="/concepts/model-providers" icon="layers">
+    Choosing providers, model refs, and failover behavior.
+  </Card>
+  <Card title="Configuration reference" href="/gateway/configuration-reference" icon="gear">
+    Full config schema including provider and audio settings.
+  </Card>
+  <Card title="Groq Console" href="https://console.groq.com" icon="arrow-up-right-from-square">
+    Groq dashboard, API docs, and pricing.
+  </Card>
+  <Card title="Groq model list" href="https://console.groq.com/docs/models" icon="list">
+    Official Groq model catalog.
+  </Card>
+</CardGroup>
--- a/docs/providers/huggingface.md
+++ b/docs/providers/huggingface.md
@@ -15,29 +15,49 @@ title: "Hugging Face (Inference)"
 - API: OpenAI-compatible (`https://router.huggingface.co/v1`)
 - Billing: Single HF token; [pricing](https://huggingface.co/docs/inference-providers/pricing) follows provider rates with a free tier.

-## Quick start
+## Getting started

-1. Create a fine-grained token at [Hugging Face → Settings → Tokens](https://huggingface.co/settings/tokens/new?ownUserPermissions=inference.serverless.write&tokenType=fineGrained) with the **Make calls to Inference Providers** permission.
-2. Run onboarding and choose **Hugging Face** in the provider dropdown, then enter your API key when prompted:
+<Steps>
+  <Step title="Create a fine-grained token">
+    Go to [Hugging Face Settings Tokens](https://huggingface.co/settings/tokens/new?ownUserPermissions=inference.serverless.write&tokenType=fineGrained) and create a new fine-grained token.

-```bash
-openclaw onboard --auth-choice huggingface-api-key
-```
+    <Warning>
+    The token must have the **Make calls to Inference Providers** permission enabled or API requests will be rejected.
+    </Warning>

-3. In the **Default Hugging Face model** dropdown, pick the model you want (the list is loaded from the Inference API when you have a valid token; otherwise a built-in list is shown). Your choice is saved as the default model.
-4. You can also set or change the default model later in config:
+  </Step>
+  <Step title="Run onboarding">
+    Choose **Hugging Face** in the provider dropdown, then enter your API key when prompted:

-```json5
-{
-  agents: {
-    defaults: {
-      model: { primary: "huggingface/deepseek-ai/DeepSeek-R1" },
-    },
-  },
-}
-```
+    ```bash
+    openclaw onboard --auth-choice huggingface-api-key
+    ```

-## Non-interactive example
+  </Step>
+  <Step title="Select a default model">
+    In the **Default Hugging Face model** dropdown, pick the model you want. The list is loaded from the Inference API when you have a valid token; otherwise a built-in list is shown. Your choice is saved as the default model.
+
+    You can also set or change the default model later in config:
+
+    ```json5
+    {
+      agents: {
+        defaults: {
+          model: { primary: "huggingface/deepseek-ai/DeepSeek-R1" },
+        },
+      },
+    }
+    ```
+
+  </Step>
+  <Step title="Verify the model is available">
+    ```bash
+    openclaw models list --provider huggingface
+    ```
+  </Step>
+</Steps>
+
+### Non-interactive setup

 ```bash
 openclaw onboard --non-interactive \
@@ -48,56 +68,10 @@ openclaw onboard --non-interactive \

 This will set `huggingface/deepseek-ai/DeepSeek-R1` as the default model.

-## Environment note
-
-If the Gateway runs as a daemon (launchd/systemd), make sure `HUGGINGFACE_HUB_TOKEN` or `HF_TOKEN`
-is available to that process (for example, in `~/.openclaw/.env` or via
-`env.shellEnv`).
-
-## Model discovery and onboarding dropdown
-
-OpenClaw discovers models by calling the **Inference endpoint directly**:
-
-```bash
-GET https://router.huggingface.co/v1/models
-```
-
-(Optional: send `Authorization: Bearer $HUGGINGFACE_HUB_TOKEN` or `$HF_TOKEN` for the full list; some endpoints return a subset without auth.) The response is OpenAI-style `{ "object": "list", "data": [ { "id": "Qwen/Qwen3-8B", "owned_by": "Qwen", ... }, ... ] }`.
-
-When you configure a Hugging Face API key (via onboarding, `HUGGINGFACE_HUB_TOKEN`, or `HF_TOKEN`), OpenClaw uses this GET to discover available chat-completion models. During **interactive setup**, after you enter your token you see a **Default Hugging Face model** dropdown populated from that list (or the built-in catalog if the request fails). At runtime (e.g. Gateway startup), when a key is present, OpenClaw again calls **GET** `https://router.huggingface.co/v1/models` to refresh the catalog. The list is merged with a built-in catalog (for metadata like context window and cost). If the request fails or no key is set, only the built-in catalog is used.
-
-## Model names and editable options
-
- **Name from API:** The model display name is **hydrated from GET /v1/models** when the API returns `name`, `title`, or `display_name`; otherwise it is derived from the model id (e.g. `deepseek-ai/DeepSeek-R1` → “DeepSeek R1”).
- **Override display name:** You can set a custom label per model in config so it appears the way you want in the CLI and UI:
-
-```json5
-{
-  agents: {
-    defaults: {
-      models: {
-        "huggingface/deepseek-ai/DeepSeek-R1": { alias: "DeepSeek R1 (fast)" },
-        "huggingface/deepseek-ai/DeepSeek-R1:cheapest": { alias: "DeepSeek R1 (cheap)" },
-      },
-    },
-  },
-}
-```
-
- **Policy suffixes:** OpenClaw's bundled Hugging Face docs and helpers currently treat these two suffixes as the built-in policy variants:
-  - **`:fastest`** — highest throughput.
-  - **`:cheapest`** — lowest cost per output token.
-
-  You can add these as separate entries in `models.providers.huggingface.models` or set `model.primary` with the suffix. You can also set your default provider order in [Inference Provider settings](https://hf.co/settings/inference-providers) (no suffix = use that order).
-
- **Config merge:** Existing entries in `models.providers.huggingface.models` (e.g. in `models.json`) are kept when config is merged. So any custom `name`, `alias`, or model options you set there are preserved.
-
-## Model IDs and configuration examples
+## Model IDs

 Model refs use the form `huggingface/<org>/<model>` (Hub-style IDs). The list below is from **GET** `https://router.huggingface.co/v1/models`; your catalog may include more.

-**Example IDs (from the inference endpoint):**
-
 | Model                  | Ref (prefix with `huggingface/`)    |
 | ---------------------- | ----------------------------------- |
 | DeepSeek R1            | `deepseek-ai/DeepSeek-R1`           |
@@ -111,83 +85,153 @@ Model refs use the form `huggingface/<org>/<model>` (Hub-style IDs). The list be
 | GLM 4.7                | `zai-org/GLM-4.7`                   |
 | Kimi K2.5              | `moonshotai/Kimi-K2.5`              |

-You can append `:fastest` or `:cheapest` to the model id. Set your default order in [Inference Provider settings](https://hf.co/settings/inference-providers); see [Inference Providers](https://huggingface.co/docs/inference-providers) and **GET** `https://router.huggingface.co/v1/models` for the full list.
+<Tip>
+You can append `:fastest` or `:cheapest` to any model id. Set your default order in [Inference Provider settings](https://hf.co/settings/inference-providers); see [Inference Providers](https://huggingface.co/docs/inference-providers) and **GET** `https://router.huggingface.co/v1/models` for the full list.
+</Tip>

-### Complete configuration examples
+## Advanced details

-**Primary DeepSeek R1 with Qwen fallback:**
+<AccordionGroup>
+  <Accordion title="Model discovery and onboarding dropdown">
+    OpenClaw discovers models by calling the **Inference endpoint directly**:

-```json5
-{
-  agents: {
-    defaults: {
-      model: {
-        primary: "huggingface/deepseek-ai/DeepSeek-R1",
-        fallbacks: ["huggingface/Qwen/Qwen3-8B"],
+    ```bash
+    GET https://router.huggingface.co/v1/models
+    ```
+
+    (Optional: send `Authorization: Bearer $HUGGINGFACE_HUB_TOKEN` or `$HF_TOKEN` for the full list; some endpoints return a subset without auth.) The response is OpenAI-style `{ "object": "list", "data": [ { "id": "Qwen/Qwen3-8B", "owned_by": "Qwen", ... }, ... ] }`.
+
+    When you configure a Hugging Face API key (via onboarding, `HUGGINGFACE_HUB_TOKEN`, or `HF_TOKEN`), OpenClaw uses this GET to discover available chat-completion models. During **interactive setup**, after you enter your token you see a **Default Hugging Face model** dropdown populated from that list (or the built-in catalog if the request fails). At runtime (e.g. Gateway startup), when a key is present, OpenClaw again calls **GET** `https://router.huggingface.co/v1/models` to refresh the catalog. The list is merged with a built-in catalog (for metadata like context window and cost). If the request fails or no key is set, only the built-in catalog is used.
+
+  </Accordion>
+
+  <Accordion title="Model names, aliases, and policy suffixes">
+    - **Name from API:** The model display name is **hydrated from GET /v1/models** when the API returns `name`, `title`, or `display_name`; otherwise it is derived from the model id (e.g. `deepseek-ai/DeepSeek-R1` becomes "DeepSeek R1").
+    - **Override display name:** You can set a custom label per model in config so it appears the way you want in the CLI and UI:
+
+    ```json5
+    {
+      agents: {
+        defaults: {
+          models: {
+            "huggingface/deepseek-ai/DeepSeek-R1": { alias: "DeepSeek R1 (fast)" },
+            "huggingface/deepseek-ai/DeepSeek-R1:cheapest": { alias: "DeepSeek R1 (cheap)" },
+          },
+        },
      },
-      models: {
-        "huggingface/deepseek-ai/DeepSeek-R1": { alias: "DeepSeek R1" },
-        "huggingface/Qwen/Qwen3-8B": { alias: "Qwen3 8B" },
+    }
+    ```
+
+    - **Policy suffixes:** OpenClaw's bundled Hugging Face docs and helpers currently treat these two suffixes as the built-in policy variants:
+      - **`:fastest`** — highest throughput.
+      - **`:cheapest`** — lowest cost per output token.
+
+      You can add these as separate entries in `models.providers.huggingface.models` or set `model.primary` with the suffix. You can also set your default provider order in [Inference Provider settings](https://hf.co/settings/inference-providers) (no suffix = use that order).
+
+    - **Config merge:** Existing entries in `models.providers.huggingface.models` (e.g. in `models.json`) are kept when config is merged. So any custom `name`, `alias`, or model options you set there are preserved.
+
+  </Accordion>
+
+  <Accordion title="Environment and daemon setup">
+    If the Gateway runs as a daemon (launchd/systemd), make sure `HUGGINGFACE_HUB_TOKEN` or `HF_TOKEN` is available to that process (for example, in `~/.openclaw/.env` or via `env.shellEnv`).
+
+    <Note>
+    OpenClaw accepts both `HUGGINGFACE_HUB_TOKEN` and `HF_TOKEN` as env var aliases. Either one works; if both are set, `HUGGINGFACE_HUB_TOKEN` takes precedence.
+    </Note>
+
+  </Accordion>
+
+  <Accordion title="Config: DeepSeek R1 with Qwen fallback">
+    ```json5
+    {
+      agents: {
+        defaults: {
+          model: {
+            primary: "huggingface/deepseek-ai/DeepSeek-R1",
+            fallbacks: ["huggingface/Qwen/Qwen3-8B"],
+          },
+          models: {
+            "huggingface/deepseek-ai/DeepSeek-R1": { alias: "DeepSeek R1" },
+            "huggingface/Qwen/Qwen3-8B": { alias: "Qwen3 8B" },
+          },
+        },
      },
-    },
-  },
-}
-```
+    }
+    ```
+  </Accordion>

-**Qwen as default, with :cheapest and :fastest variants:**
-
-```json5
-{
-  agents: {
-    defaults: {
-      model: { primary: "huggingface/Qwen/Qwen3-8B" },
-      models: {
-        "huggingface/Qwen/Qwen3-8B": { alias: "Qwen3 8B" },
-        "huggingface/Qwen/Qwen3-8B:cheapest": { alias: "Qwen3 8B (cheapest)" },
-        "huggingface/Qwen/Qwen3-8B:fastest": { alias: "Qwen3 8B (fastest)" },
+  <Accordion title="Config: Qwen with cheapest and fastest variants">
+    ```json5
+    {
+      agents: {
+        defaults: {
+          model: { primary: "huggingface/Qwen/Qwen3-8B" },
+          models: {
+            "huggingface/Qwen/Qwen3-8B": { alias: "Qwen3 8B" },
+            "huggingface/Qwen/Qwen3-8B:cheapest": { alias: "Qwen3 8B (cheapest)" },
+            "huggingface/Qwen/Qwen3-8B:fastest": { alias: "Qwen3 8B (fastest)" },
+          },
+        },
      },
-    },
-  },
-}
-```
+    }
+    ```
+  </Accordion>

-**DeepSeek + Llama + GPT-OSS with aliases:**
-
-```json5
-{
-  agents: {
-    defaults: {
-      model: {
-        primary: "huggingface/deepseek-ai/DeepSeek-V3.2",
-        fallbacks: [
-          "huggingface/meta-llama/Llama-3.3-70B-Instruct",
-          "huggingface/openai/gpt-oss-120b",
-        ],
+  <Accordion title="Config: DeepSeek + Llama + GPT-OSS with aliases">
+    ```json5
+    {
+      agents: {
+        defaults: {
+          model: {
+            primary: "huggingface/deepseek-ai/DeepSeek-V3.2",
+            fallbacks: [
+              "huggingface/meta-llama/Llama-3.3-70B-Instruct",
+              "huggingface/openai/gpt-oss-120b",
+            ],
+          },
+          models: {
+            "huggingface/deepseek-ai/DeepSeek-V3.2": { alias: "DeepSeek V3.2" },
+            "huggingface/meta-llama/Llama-3.3-70B-Instruct": { alias: "Llama 3.3 70B" },
+            "huggingface/openai/gpt-oss-120b": { alias: "GPT-OSS 120B" },
+          },
+        },
      },
-      models: {
-        "huggingface/deepseek-ai/DeepSeek-V3.2": { alias: "DeepSeek V3.2" },
-        "huggingface/meta-llama/Llama-3.3-70B-Instruct": { alias: "Llama 3.3 70B" },
-        "huggingface/openai/gpt-oss-120b": { alias: "GPT-OSS 120B" },
-      },
-    },
-  },
-}
-```
+    }
+    ```
+  </Accordion>

-**Multiple Qwen and DeepSeek models with policy suffixes:**
-
-```json5
-{
-  agents: {
-    defaults: {
-      model: { primary: "huggingface/Qwen/Qwen2.5-7B-Instruct:cheapest" },
-      models: {
-        "huggingface/Qwen/Qwen2.5-7B-Instruct": { alias: "Qwen2.5 7B" },
-        "huggingface/Qwen/Qwen2.5-7B-Instruct:cheapest": { alias: "Qwen2.5 7B (cheap)" },
-        "huggingface/deepseek-ai/DeepSeek-R1:fastest": { alias: "DeepSeek R1 (fast)" },
-        "huggingface/meta-llama/Llama-3.1-8B-Instruct": { alias: "Llama 3.1 8B" },
+  <Accordion title="Config: Multiple Qwen and DeepSeek with policy suffixes">
+    ```json5
+    {
+      agents: {
+        defaults: {
+          model: { primary: "huggingface/Qwen/Qwen2.5-7B-Instruct:cheapest" },
+          models: {
+            "huggingface/Qwen/Qwen2.5-7B-Instruct": { alias: "Qwen2.5 7B" },
+            "huggingface/Qwen/Qwen2.5-7B-Instruct:cheapest": { alias: "Qwen2.5 7B (cheap)" },
+            "huggingface/deepseek-ai/DeepSeek-R1:fastest": { alias: "DeepSeek R1 (fast)" },
+            "huggingface/meta-llama/Llama-3.1-8B-Instruct": { alias: "Llama 3.1 8B" },
+          },
+        },
      },
-    },
-  },
-}
-```
+    }
+    ```
+  </Accordion>
+</AccordionGroup>
+
+## Related
+
+<CardGroup cols={2}>
+  <Card title="Model providers" href="/concepts/model-providers" icon="layers">
+    Overview of all providers, model refs, and failover behavior.
+  </Card>
+  <Card title="Model selection" href="/concepts/models" icon="brain">
+    How to choose and configure models.
+  </Card>
+  <Card title="Inference Providers docs" href="https://huggingface.co/docs/inference-providers" icon="book">
+    Official Hugging Face Inference Providers documentation.
+  </Card>
+  <Card title="Configuration" href="/gateway/configuration" icon="gear">
+    Full config reference.
+  </Card>
+</CardGroup>
--- a/docs/providers/inferrs.md
+++ b/docs/providers/inferrs.md
@@ -16,27 +16,27 @@ OpenAI-compatible `/v1` API. OpenClaw works with `inferrs` through the generic
 `inferrs` is currently best treated as a custom self-hosted OpenAI-compatible
 backend, not a dedicated OpenClaw provider plugin.

-## Quick start
+## Getting started

-1. Start `inferrs` with a model.
-
-Example:
-
-```bash
-inferrs serve google/gemma-4-E2B-it \
-  --host 127.0.0.1 \
-  --port 8080 \
-  --device metal
-```
-
-2. Verify the server is reachable.
-
-```bash
-curl http://127.0.0.1:8080/health
-curl http://127.0.0.1:8080/v1/models
-```
-
-3. Add an explicit OpenClaw provider entry and point your default model at it.
+<Steps>
+  <Step title="Start inferrs with a model">
+    ```bash
+    inferrs serve google/gemma-4-E2B-it \
+      --host 127.0.0.1 \
+      --port 8080 \
+      --device metal
+    ```
+  </Step>
+  <Step title="Verify the server is reachable">
+    ```bash
+    curl http://127.0.0.1:8080/health
+    curl http://127.0.0.1:8080/v1/models
+    ```
+  </Step>
+  <Step title="Add an OpenClaw provider entry">
+    Add an explicit provider entry and point your default model at it. See the full config example below.
+  </Step>
+</Steps>

 ## Full config example

@@ -81,93 +81,130 @@ This example uses Gemma 4 on a local `inferrs` server.
 }
 ```

-## Why `requiresStringContent` matters
+## Advanced

-Some `inferrs` Chat Completions routes accept only string
-`messages[].content`, not structured content-part arrays.
+<AccordionGroup>
+  <Accordion title="Why requiresStringContent matters">
+    Some `inferrs` Chat Completions routes accept only string
+    `messages[].content`, not structured content-part arrays.

-If OpenClaw runs fail with an error like:
+    <Warning>
+    If OpenClaw runs fail with an error like:

-```text
-messages[1].content: invalid type: sequence, expected a string
-```
+    ```text
+    messages[1].content: invalid type: sequence, expected a string
+    ```

-set:
+    set `compat.requiresStringContent: true` in your model entry.
+    </Warning>

-```json5
-compat: {
-  requiresStringContent: true
-}
-```
+    ```json5
+    compat: {
+      requiresStringContent: true
+    }
+    ```

-OpenClaw will flatten pure text content parts into plain strings before sending
-the request.
+    OpenClaw will flatten pure text content parts into plain strings before sending
+    the request.

-## Gemma and tool-schema caveat
+  </Accordion>

-Some current `inferrs` + Gemma combinations accept small direct
-`/v1/chat/completions` requests but still fail on full OpenClaw agent-runtime
-turns.
+  <Accordion title="Gemma and tool-schema caveat">
+    Some current `inferrs` + Gemma combinations accept small direct
+    `/v1/chat/completions` requests but still fail on full OpenClaw agent-runtime
+    turns.

-If that happens, try this first:
+    If that happens, try this first:

-```json5
-compat: {
-  requiresStringContent: true,
-  supportsTools: false
-}
-```
+    ```json5
+    compat: {
+      requiresStringContent: true,
+      supportsTools: false
+    }
+    ```

-That disables OpenClaw's tool schema surface for the model and can reduce prompt
-pressure on stricter local backends.
+    That disables OpenClaw's tool schema surface for the model and can reduce prompt
+    pressure on stricter local backends.

-If tiny direct requests still work but normal OpenClaw agent turns continue to
-crash inside `inferrs`, the remaining issue is usually upstream model/server
-behavior rather than OpenClaw's transport layer.
+    If tiny direct requests still work but normal OpenClaw agent turns continue to
+    crash inside `inferrs`, the remaining issue is usually upstream model/server
+    behavior rather than OpenClaw's transport layer.

-## Manual smoke test
+  </Accordion>

-Once configured, test both layers:
+  <Accordion title="Manual smoke test">
+    Once configured, test both layers:

-```bash
-curl http://127.0.0.1:8080/v1/chat/completions \
-  -H 'content-type: application/json' \
-  -d '{"model":"google/gemma-4-E2B-it","messages":[{"role":"user","content":"What is 2 + 2?"}],"stream":false}'
+    ```bash
+    curl http://127.0.0.1:8080/v1/chat/completions \
+      -H 'content-type: application/json' \
+      -d '{"model":"google/gemma-4-E2B-it","messages":[{"role":"user","content":"What is 2 + 2?"}],"stream":false}'
+    ```

-openclaw infer model run \
-  --model inferrs/google/gemma-4-E2B-it \
-  --prompt "What is 2 + 2? Reply with one short sentence." \
-  --json
-```
+    ```bash
+    openclaw infer model run \
+      --model inferrs/google/gemma-4-E2B-it \
+      --prompt "What is 2 + 2? Reply with one short sentence." \
+      --json
+    ```

-If the first command works but the second fails, use the troubleshooting notes
-below.
+    If the first command works but the second fails, check the troubleshooting section below.
+
+  </Accordion>
+
+  <Accordion title="Proxy-style behavior">
+    `inferrs` is treated as a proxy-style OpenAI-compatible `/v1` backend, not a
+    native OpenAI endpoint.
+
+    - Native OpenAI-only request shaping does not apply here
+    - No `service_tier`, no Responses `store`, no prompt-cache hints, and no
+      OpenAI reasoning-compat payload shaping
+    - Hidden OpenClaw attribution headers (`originator`, `version`, `User-Agent`)
+      are not injected on custom `inferrs` base URLs
+
+  </Accordion>
+</AccordionGroup>

 ## Troubleshooting

- `curl /v1/models` fails: `inferrs` is not running, not reachable, or not
-  bound to the expected host/port.
- `messages[].content ... expected a string`: set
-  `compat.requiresStringContent: true`.
- Direct tiny `/v1/chat/completions` calls pass, but `openclaw infer model run`
-  fails: try `compat.supportsTools: false`.
- OpenClaw no longer gets schema errors, but `inferrs` still crashes on larger
-  agent turns: treat it as an upstream `inferrs` or model limitation and reduce
-  prompt pressure or switch local backend/model.
+<AccordionGroup>
+  <Accordion title="curl /v1/models fails">
+    `inferrs` is not running, not reachable, or not bound to the expected
+    host/port. Make sure the server is started and listening on the address you
+    configured.
+  </Accordion>

-## Proxy-style behavior
+  <Accordion title="messages[].content expected a string">
+    Set `compat.requiresStringContent: true` in the model entry. See the
+    `requiresStringContent` section above for details.
+  </Accordion>

-`inferrs` is treated as a proxy-style OpenAI-compatible `/v1` backend, not a
-native OpenAI endpoint.
+  <Accordion title="Direct /v1/chat/completions calls pass but openclaw infer model run fails">
+    Try setting `compat.supportsTools: false` to disable the tool schema surface.
+    See the Gemma tool-schema caveat above.
+  </Accordion>

- native OpenAI-only request shaping does not apply here
- no `service_tier`, no Responses `store`, no prompt-cache hints, and no
-  OpenAI reasoning-compat payload shaping
- hidden OpenClaw attribution headers (`originator`, `version`, `User-Agent`)
-  are not injected on custom `inferrs` base URLs
+  <Accordion title="inferrs still crashes on larger agent turns">
+    If OpenClaw no longer gets schema errors but `inferrs` still crashes on larger
+    agent turns, treat it as an upstream `inferrs` or model limitation. Reduce
+    prompt pressure or switch to a different local backend or model.
+  </Accordion>
+</AccordionGroup>
+
+<Tip>
+For general help, see [Troubleshooting](/help/troubleshooting) and [FAQ](/help/faq).
+</Tip>

 ## See also

- [Local models](/gateway/local-models)
- [Gateway troubleshooting](/gateway/troubleshooting#local-openai-compatible-backend-passes-direct-probes-but-agent-runs-fail)
- [Model providers](/concepts/model-providers)
+<CardGroup cols={2}>
+  <Card title="Local models" href="/gateway/local-models" icon="server">
+    Running OpenClaw against local model servers.
+  </Card>
+  <Card title="Gateway troubleshooting" href="/gateway/troubleshooting#local-openai-compatible-backend-passes-direct-probes-but-agent-runs-fail" icon="wrench">
+    Debugging local OpenAI-compatible backends that pass probes but fail agent runs.
+  </Card>
+  <Card title="Model providers" href="/concepts/model-providers" icon="layers">
+    Overview of all providers, model refs, and failover behavior.
+  </Card>
+</CardGroup>
--- a/docs/providers/kilocode.md
+++ b/docs/providers/kilocode.md
@@ -1,5 +1,5 @@
 ---
-title: "Kilo Gateway"
+title: "Kilocode"
 summary: "Use Kilo Gateway's unified API to access many models in OpenClaw"
 read_when:
  - You want a single API key for many LLMs
@@ -11,25 +11,73 @@ read_when:
 Kilo Gateway provides a **unified API** that routes requests to many models behind a single
 endpoint and API key. It is OpenAI-compatible, so most OpenAI SDKs work by switching the base URL.

-## Getting an API key
+| Property | Value                              |
+| -------- | ---------------------------------- |
+| Provider | `kilocode`                         |
+| Auth     | `KILOCODE_API_KEY`                 |
+| API      | OpenAI-compatible                  |
+| Base URL | `https://api.kilo.ai/api/gateway/` |

-1. Go to [app.kilo.ai](https://app.kilo.ai)
-2. Sign in or create an account
-3. Navigate to API Keys and generate a new key
+## Getting started

-## CLI setup
+<Steps>
+  <Step title="Create an account">
+    Go to [app.kilo.ai](https://app.kilo.ai), sign in or create an account, then navigate to API Keys and generate a new key.
+  </Step>
+  <Step title="Run onboarding">
+    ```bash
+    openclaw onboard --auth-choice kilocode-api-key
+    ```

-```bash
-openclaw onboard --auth-choice kilocode-api-key
-```
+    Or set the environment variable directly:

-Or set the environment variable:
+    ```bash
+    export KILOCODE_API_KEY="<your-kilocode-api-key>" # pragma: allowlist secret
+    ```

-```bash
-export KILOCODE_API_KEY="<your-kilocode-api-key>" # pragma: allowlist secret
-```
+  </Step>
+  <Step title="Verify the model is available">
+    ```bash
+    openclaw models list --provider kilocode
+    ```
+  </Step>
+</Steps>

-## Config snippet
+## Default model
+
+The default model is `kilocode/kilo/auto`, a provider-owned smart-routing
+model managed by Kilo Gateway.
+
+<Note>
+OpenClaw treats `kilocode/kilo/auto` as the stable default ref, but does not
+publish a source-backed task-to-upstream-model mapping for that route. Exact
+upstream routing behind `kilocode/kilo/auto` is owned by Kilo Gateway, not
+hard-coded in OpenClaw.
+</Note>
+
+## Available models
+
+OpenClaw dynamically discovers available models from the Kilo Gateway at startup. Use
+`/models kilocode` to see the full list of models available with your account.
+
+Any model available on the gateway can be used with the `kilocode/` prefix:
+
+| Model ref                              | Notes                              |
+| -------------------------------------- | ---------------------------------- |
+| `kilocode/kilo/auto`                   | Default — smart routing            |
+| `kilocode/anthropic/claude-sonnet-4`   | Anthropic via Kilo                 |
+| `kilocode/openai/gpt-5.4`              | OpenAI via Kilo                    |
+| `kilocode/google/gemini-3-pro-preview` | Google via Kilo                    |
+| ...and many more                       | Use `/models kilocode` to list all |
+
+<Tip>
+At startup, OpenClaw queries `GET https://api.kilo.ai/api/gateway/models` and merges
+discovered models ahead of the static fallback catalog. The bundled fallback always
+includes `kilocode/kilo/auto` (`Kilo Auto`) with `input: ["text", "image"]`,
+`reasoning: true`, `contextWindow: 1000000`, and `maxTokens: 128000`.
+</Tip>
+
+## Config example

 ```json5
 {
@@ -42,48 +90,47 @@ export KILOCODE_API_KEY="<your-kilocode-api-key>" # pragma: allowlist secret
 }
 ```

-## Default model
+<AccordionGroup>
+  <Accordion title="Transport and compatibility">
+    Kilo Gateway is documented in source as OpenRouter-compatible, so it stays on
+    the proxy-style OpenAI-compatible path rather than native OpenAI request shaping.

-The default model is `kilocode/kilo/auto`, a provider-owned smart-routing
-model managed by Kilo Gateway.
+    - Gemini-backed Kilo refs stay on the proxy-Gemini path, so OpenClaw keeps
+      Gemini thought-signature sanitation there without enabling native Gemini
+      replay validation or bootstrap rewrites.
+    - Kilo Gateway uses a Bearer token with your API key under the hood.

-OpenClaw treats `kilocode/kilo/auto` as the stable default ref, but does not
-publish a source-backed task-to-upstream-model mapping for that route.
+  </Accordion>

-## Available models
+  <Accordion title="Stream wrapper and reasoning">
+    Kilo's shared stream wrapper adds the provider app header and normalizes
+    proxy reasoning payloads for supported concrete model refs.

-OpenClaw dynamically discovers available models from the Kilo Gateway at startup. Use
-`/models kilocode` to see the full list of models available with your account.
+    <Warning>
+    `kilocode/kilo/auto` and other proxy-reasoning-unsupported hints skip reasoning
+    injection. If you need reasoning support, use a concrete model ref such as
+    `kilocode/anthropic/claude-sonnet-4`.
+    </Warning>

-Any model available on the gateway can be used with the `kilocode/` prefix:
+  </Accordion>

-```
-kilocode/kilo/auto              (default - smart routing)
-kilocode/anthropic/claude-sonnet-4
-kilocode/openai/gpt-5.4
-kilocode/google/gemini-3-pro-preview
-...and many more
-```
+  <Accordion title="Troubleshooting">
+    - If model discovery fails at startup, OpenClaw falls back to the bundled static catalog containing `kilocode/kilo/auto`.
+    - Confirm your API key is valid and that your Kilo account has the desired models enabled.
+    - When the Gateway runs as a daemon, ensure `KILOCODE_API_KEY` is available to that process (for example in `~/.openclaw/.env` or via `env.shellEnv`).
+  </Accordion>
+</AccordionGroup>

-## Notes
+## Related

- Model refs are `kilocode/<model-id>` (e.g., `kilocode/anthropic/claude-sonnet-4`).
- Default model: `kilocode/kilo/auto`
- Base URL: `https://api.kilo.ai/api/gateway/`
- Bundled fallback catalog always includes `kilocode/kilo/auto` (`Kilo Auto`) with
-  `input: ["text", "image"]`, `reasoning: true`, `contextWindow: 1000000`,
-  and `maxTokens: 128000`
- At startup, OpenClaw tries `GET https://api.kilo.ai/api/gateway/models` and
-  merges discovered models ahead of the static fallback catalog
- Exact upstream routing behind `kilocode/kilo/auto` is owned by Kilo Gateway,
-  not hard-coded in OpenClaw
- Kilo Gateway is documented in source as OpenRouter-compatible, so it stays on
-  the proxy-style OpenAI-compatible path rather than native OpenAI request shaping
- Gemini-backed Kilo refs stay on the proxy-Gemini path, so OpenClaw keeps
-  Gemini thought-signature sanitation there without enabling native Gemini
-  replay validation or bootstrap rewrites.
- Kilo's shared stream wrapper adds the provider app header and normalizes
-  proxy reasoning payloads for supported concrete model refs. `kilocode/kilo/auto`
-  and other proxy-reasoning-unsupported hints skip that reasoning injection.
- For more model/provider options, see [/concepts/model-providers](/concepts/model-providers).
- Kilo Gateway uses a Bearer token with your API key under the hood.
+<CardGroup cols={2}>
+  <Card title="Model selection" href="/concepts/model-providers" icon="layers">
+    Choosing providers, model refs, and failover behavior.
+  </Card>
+  <Card title="Configuration reference" href="/gateway/configuration" icon="gear">
+    Full OpenClaw configuration reference.
+  </Card>
+  <Card title="Kilo Gateway" href="https://app.kilo.ai" icon="arrow-up-right-from-square">
+    Kilo Gateway dashboard, API keys, and account management.
+  </Card>
+</CardGroup>
--- a/docs/providers/litellm.md
+++ b/docs/providers/litellm.md
@@ -10,40 +10,55 @@ read_when:

 [LiteLLM](https://litellm.ai) is an open-source LLM gateway that provides a unified API to 100+ model providers. Route OpenClaw through LiteLLM to get centralized cost tracking, logging, and the flexibility to switch backends without changing your OpenClaw config.

-## Why use LiteLLM with OpenClaw?
+<Tip>
+**Why use LiteLLM with OpenClaw?**

 - **Cost tracking** — See exactly what OpenClaw spends across all models
 - **Model routing** — Switch between Claude, GPT-4, Gemini, Bedrock without config changes
 - **Virtual keys** — Create keys with spend limits for OpenClaw
 - **Logging** — Full request/response logs for debugging
 - **Fallbacks** — Automatic failover if your primary provider is down
+  </Tip>

 ## Quick start

-### Via onboarding
+<Tabs>
+  <Tab title="Onboarding (recommended)">
+    **Best for:** fastest path to a working LiteLLM setup.

-```bash
-openclaw onboard --auth-choice litellm-api-key
-```
+    <Steps>
+      <Step title="Run onboarding">
+        ```bash
+        openclaw onboard --auth-choice litellm-api-key
+        ```
+      </Step>
+    </Steps>

-### Manual setup
+  </Tab>

-1. Start LiteLLM Proxy:
+  <Tab title="Manual setup">
+    **Best for:** full control over installation and config.

-```bash
-pip install 'litellm[proxy]'
-litellm --model claude-opus-4-6
-```
+    <Steps>
+      <Step title="Start LiteLLM Proxy">
+        ```bash
+        pip install 'litellm[proxy]'
+        litellm --model claude-opus-4-6
+        ```
+      </Step>
+      <Step title="Point OpenClaw to LiteLLM">
+        ```bash
+        export LITELLM_API_KEY="your-litellm-key"

-2. Point OpenClaw to LiteLLM:
+        openclaw
+        ```

-```bash
-export LITELLM_API_KEY="your-litellm-key"
+        That's it. OpenClaw now routes through LiteLLM.
+      </Step>
+    </Steps>

-openclaw
-```
-
-That's it. OpenClaw now routes through LiteLLM.
+  </Tab>
+</Tabs>

 ## Configuration

@@ -92,68 +107,91 @@ export LITELLM_API_KEY="sk-litellm-key"
 }
 ```

-## Virtual keys
+## Advanced topics

-Create a dedicated key for OpenClaw with spend limits:
+<AccordionGroup>
+  <Accordion title="Virtual keys">
+    Create a dedicated key for OpenClaw with spend limits:

-```bash
-curl -X POST "http://localhost:4000/key/generate" \
-  -H "Authorization: Bearer $LITELLM_MASTER_KEY" \
-  -H "Content-Type: application/json" \
-  -d '{
-    "key_alias": "openclaw",
-    "max_budget": 50.00,
-    "budget_duration": "monthly"
-  }'
-```
+    ```bash
+    curl -X POST "http://localhost:4000/key/generate" \
+      -H "Authorization: Bearer $LITELLM_MASTER_KEY" \
+      -H "Content-Type: application/json" \
+      -d '{
+        "key_alias": "openclaw",
+        "max_budget": 50.00,
+        "budget_duration": "monthly"
+      }'
+    ```

-Use the generated key as `LITELLM_API_KEY`.
+    Use the generated key as `LITELLM_API_KEY`.

-## Model routing
+  </Accordion>

-LiteLLM can route model requests to different backends. Configure in your LiteLLM `config.yaml`:
+  <Accordion title="Model routing">
+    LiteLLM can route model requests to different backends. Configure in your LiteLLM `config.yaml`:

-```yaml
-model_list:
-  - model_name: claude-opus-4-6
-    litellm_params:
-      model: claude-opus-4-6
-      api_key: os.environ/ANTHROPIC_API_KEY
+    ```yaml
+    model_list:
+      - model_name: claude-opus-4-6
+        litellm_params:
+          model: claude-opus-4-6
+          api_key: os.environ/ANTHROPIC_API_KEY

-  - model_name: gpt-4o
-    litellm_params:
-      model: gpt-4o
-      api_key: os.environ/OPENAI_API_KEY
-```
+      - model_name: gpt-4o
+        litellm_params:
+          model: gpt-4o
+          api_key: os.environ/OPENAI_API_KEY
+    ```

-OpenClaw keeps requesting `claude-opus-4-6` — LiteLLM handles the routing.
+    OpenClaw keeps requesting `claude-opus-4-6` — LiteLLM handles the routing.

-## Viewing usage
+  </Accordion>

-Check LiteLLM's dashboard or API:
+  <Accordion title="Viewing usage">
+    Check LiteLLM's dashboard or API:

-```bash
-# Key info
-curl "http://localhost:4000/key/info" \
-  -H "Authorization: Bearer sk-litellm-key"
+    ```bash
+    # Key info
+    curl "http://localhost:4000/key/info" \
+      -H "Authorization: Bearer sk-litellm-key"

-# Spend logs
-curl "http://localhost:4000/spend/logs" \
-  -H "Authorization: Bearer $LITELLM_MASTER_KEY"
-```
+    # Spend logs
+    curl "http://localhost:4000/spend/logs" \
+      -H "Authorization: Bearer $LITELLM_MASTER_KEY"
+    ```

-## Notes
+  </Accordion>

- LiteLLM runs on `http://localhost:4000` by default
- OpenClaw connects through LiteLLM's proxy-style OpenAI-compatible `/v1`
-  endpoint
- Native OpenAI-only request shaping does not apply through LiteLLM:
-  no `service_tier`, no Responses `store`, no prompt-cache hints, and no
-  OpenAI reasoning-compat payload shaping
- Hidden OpenClaw attribution headers (`originator`, `version`, `User-Agent`)
-  are not injected on custom LiteLLM base URLs
+  <Accordion title="Proxy behavior notes">
+    - LiteLLM runs on `http://localhost:4000` by default
+    - OpenClaw connects through LiteLLM's proxy-style OpenAI-compatible `/v1`
+      endpoint
+    - Native OpenAI-only request shaping does not apply through LiteLLM:
+      no `service_tier`, no Responses `store`, no prompt-cache hints, and no
+      OpenAI reasoning-compat payload shaping
+    - Hidden OpenClaw attribution headers (`originator`, `version`, `User-Agent`)
+      are not injected on custom LiteLLM base URLs
+  </Accordion>
+</AccordionGroup>

-## See also
+<Note>
+For general provider configuration and failover behavior, see [Model Providers](/concepts/model-providers).
+</Note>

- [LiteLLM Docs](https://docs.litellm.ai)
- [Model Providers](/concepts/model-providers)
+## Related
+
+<CardGroup cols={2}>
+  <Card title="LiteLLM Docs" href="https://docs.litellm.ai" icon="book">
+    Official LiteLLM documentation and API reference.
+  </Card>
+  <Card title="Model providers" href="/concepts/model-providers" icon="layers">
+    Overview of all providers, model refs, and failover behavior.
+  </Card>
+  <Card title="Configuration" href="/gateway/configuration" icon="gear">
+    Full config reference.
+  </Card>
+  <Card title="Model selection" href="/concepts/models" icon="brain">
+    How to choose and configure models.
+  </Card>
+</CardGroup>
--- a/docs/providers/minimax.md
+++ b/docs/providers/minimax.md
@@ -12,31 +12,212 @@ OpenClaw's MiniMax provider defaults to **MiniMax M2.7**.

 MiniMax also provides:

- bundled speech synthesis via T2A v2
- bundled image understanding via `MiniMax-VL-01`
- bundled music generation via `music-2.5+`
- bundled `web_search` through the MiniMax Coding Plan search API
+- Bundled speech synthesis via T2A v2
+- Bundled image understanding via `MiniMax-VL-01`
+- Bundled music generation via `music-2.5+`
+- Bundled `web_search` through the MiniMax Coding Plan search API

 Provider split:

- `minimax`: API-key text provider, plus bundled image generation, image understanding, speech, and web search
- `minimax-portal`: OAuth text provider, plus bundled image generation and image understanding
+| Provider ID      | Auth    | Capabilities                                                    |
+| ---------------- | ------- | --------------------------------------------------------------- |
+| `minimax`        | API key | Text, image generation, image understanding, speech, web search |
+| `minimax-portal` | OAuth   | Text, image generation, image understanding                     |

 ## Model lineup

- `MiniMax-M2.7`: default hosted reasoning model.
- `MiniMax-M2.7-highspeed`: faster M2.7 reasoning tier.
- `image-01`: image generation model (generate and image-to-image editing).
+| Model                    | Type             | Description                              |
+| ------------------------ | ---------------- | ---------------------------------------- |
+| `MiniMax-M2.7`           | Chat (reasoning) | Default hosted reasoning model           |
+| `MiniMax-M2.7-highspeed` | Chat (reasoning) | Faster M2.7 reasoning tier               |
+| `MiniMax-VL-01`          | Vision           | Image understanding model                |
+| `image-01`               | Image generation | Text-to-image and image-to-image editing |
+| `music-2.5+`             | Music generation | Default music model                      |
+| `music-2.5`              | Music generation | Previous music generation tier           |
+| `music-2.0`              | Music generation | Legacy music generation tier             |
+| `MiniMax-Hailuo-2.3`     | Video generation | Text-to-video and image reference flows  |

-## Image generation
+## Getting started
+
+Choose your preferred auth method and follow the setup steps.
+
+<Tabs>
+  <Tab title="OAuth (Coding Plan)">
+    **Best for:** quick setup with MiniMax Coding Plan via OAuth, no API key required.
+
+    <Tabs>
+      <Tab title="International">
+        <Steps>
+          <Step title="Run onboarding">
+            ```bash
+            openclaw onboard --auth-choice minimax-global-oauth
+            ```
+
+            This authenticates against `api.minimax.io`.
+          </Step>
+          <Step title="Verify the model is available">
+            ```bash
+            openclaw models list --provider minimax-portal
+            ```
+          </Step>
+        </Steps>
+      </Tab>
+      <Tab title="China">
+        <Steps>
+          <Step title="Run onboarding">
+            ```bash
+            openclaw onboard --auth-choice minimax-cn-oauth
+            ```
+
+            This authenticates against `api.minimaxi.com`.
+          </Step>
+          <Step title="Verify the model is available">
+            ```bash
+            openclaw models list --provider minimax-portal
+            ```
+          </Step>
+        </Steps>
+      </Tab>
+    </Tabs>
+
+    <Note>
+    OAuth setups use the `minimax-portal` provider id. Model refs follow the form `minimax-portal/MiniMax-M2.7`.
+    </Note>
+
+    <Tip>
+    Referral link for MiniMax Coding Plan (10% off): [MiniMax Coding Plan](https://platform.minimax.io/subscribe/coding-plan?code=DbXJTRClnb&source=link)
+    </Tip>
+
+  </Tab>
+
+  <Tab title="API key">
+    **Best for:** hosted MiniMax with Anthropic-compatible API.
+
+    <Tabs>
+      <Tab title="International">
+        <Steps>
+          <Step title="Run onboarding">
+            ```bash
+            openclaw onboard --auth-choice minimax-global-api
+            ```
+
+            This configures `api.minimax.io` as the base URL.
+          </Step>
+          <Step title="Verify the model is available">
+            ```bash
+            openclaw models list --provider minimax
+            ```
+          </Step>
+        </Steps>
+      </Tab>
+      <Tab title="China">
+        <Steps>
+          <Step title="Run onboarding">
+            ```bash
+            openclaw onboard --auth-choice minimax-cn-api
+            ```
+
+            This configures `api.minimaxi.com` as the base URL.
+          </Step>
+          <Step title="Verify the model is available">
+            ```bash
+            openclaw models list --provider minimax
+            ```
+          </Step>
+        </Steps>
+      </Tab>
+    </Tabs>
+
+    ### Config example
+
+    ```json5
+    {
+      env: { MINIMAX_API_KEY: "sk-..." },
+      agents: { defaults: { model: { primary: "minimax/MiniMax-M2.7" } } },
+      models: {
+        mode: "merge",
+        providers: {
+          minimax: {
+            baseUrl: "https://api.minimax.io/anthropic",
+            apiKey: "${MINIMAX_API_KEY}",
+            api: "anthropic-messages",
+            models: [
+              {
+                id: "MiniMax-M2.7",
+                name: "MiniMax M2.7",
+                reasoning: true,
+                input: ["text", "image"],
+                cost: { input: 0.3, output: 1.2, cacheRead: 0.06, cacheWrite: 0.375 },
+                contextWindow: 204800,
+                maxTokens: 131072,
+              },
+              {
+                id: "MiniMax-M2.7-highspeed",
+                name: "MiniMax M2.7 Highspeed",
+                reasoning: true,
+                input: ["text", "image"],
+                cost: { input: 0.6, output: 2.4, cacheRead: 0.06, cacheWrite: 0.375 },
+                contextWindow: 204800,
+                maxTokens: 131072,
+              },
+            ],
+          },
+        },
+      },
+    }
+    ```
+
+    <Warning>
+    On the Anthropic-compatible streaming path, OpenClaw disables MiniMax thinking by default unless you explicitly set `thinking` yourself. MiniMax's streaming endpoint emits `reasoning_content` in OpenAI-style delta chunks instead of native Anthropic thinking blocks, which can leak internal reasoning into visible output if left enabled implicitly.
+    </Warning>
+
+    <Note>
+    API-key setups use the `minimax` provider id. Model refs follow the form `minimax/MiniMax-M2.7`.
+    </Note>
+
+  </Tab>
+</Tabs>
+
+## Configure via `openclaw configure`
+
+Use the interactive config wizard to set MiniMax without editing JSON:
+
+<Steps>
+  <Step title="Launch the wizard">
+    ```bash
+    openclaw configure
+    ```
+  </Step>
+  <Step title="Select Model/auth">
+    Choose **Model/auth** from the menu.
+  </Step>
+  <Step title="Choose a MiniMax auth option">
+    Pick one of the available MiniMax options:
+
+    | Auth choice | Description |
+    | --- | --- |
+    | `minimax-global-oauth` | International OAuth (Coding Plan) |
+    | `minimax-cn-oauth` | China OAuth (Coding Plan) |
+    | `minimax-global-api` | International API key |
+    | `minimax-cn-api` | China API key |
+
+  </Step>
+  <Step title="Pick your default model">
+    Select your default model when prompted.
+  </Step>
+</Steps>
+
+## Capabilities
+
+### Image generation

 The MiniMax plugin registers the `image-01` model for the `image_generate` tool. It supports:

- **Text-to-image generation** with aspect ratio control.
- **Image-to-image editing** (subject reference) with aspect ratio control.
- Up to **9 output images** per request.
- Up to **1 reference image** per edit request.
- Supported aspect ratios: `1:1`, `16:9`, `4:3`, `3:2`, `2:3`, `3:4`, `9:16`, `21:9`.
+- **Text-to-image generation** with aspect ratio control
+- **Image-to-image editing** (subject reference) with aspect ratio control
+- Up to **9 output images** per request
+- Up to **1 reference image** per edit request
+- Supported aspect ratios: `1:1`, `16:9`, `4:3`, `3:2`, `2:3`, `3:4`, `9:16`, `21:9`

 To use MiniMax for image generation, set it as the image generation provider:

@@ -64,10 +245,11 @@ The built-in bundled MiniMax text catalog itself stays text-only metadata until
 that explicit provider config exists. Image understanding is exposed separately
 through the plugin-owned `MiniMax-VL-01` media provider.

-See [Image Generation](/tools/image-generation) for the shared tool
-parameters, provider selection, and failover behavior.
+<Note>
+See [Image Generation](/tools/image-generation) for shared tool parameters, provider selection, and failover behavior.
+</Note>

-## Music generation
+### Music generation

 The bundled `minimax` plugin also registers music generation through the shared
 `music_generate` tool.
@@ -92,10 +274,11 @@ To use MiniMax as the default music provider:
 }
 ```

-See [Music Generation](/tools/music-generation) for the shared tool
-parameters, provider selection, and failover behavior.
+<Note>
+See [Music Generation](/tools/music-generation) for shared tool parameters, provider selection, and failover behavior.
+</Note>

-## Video generation
+### Video generation

 The bundled `minimax` plugin also registers video generation through the shared
 `video_generate` tool.
@@ -118,21 +301,24 @@ To use MiniMax as the default video provider:
 }
 ```

-See [Video Generation](/tools/video-generation) for the shared tool
-parameters, provider selection, and failover behavior.
+<Note>
+See [Video Generation](/tools/video-generation) for shared tool parameters, provider selection, and failover behavior.
+</Note>

-## Image understanding
+### Image understanding

 The MiniMax plugin registers image understanding separately from the text
 catalog:

- `minimax`: default image model `MiniMax-VL-01`
- `minimax-portal`: default image model `MiniMax-VL-01`
+| Provider ID      | Default image model |
+| ---------------- | ------------------- |
+| `minimax`        | `MiniMax-VL-01`     |
+| `minimax-portal` | `MiniMax-VL-01`     |

 That is why automatic media routing can use MiniMax image understanding even
 when the bundled text-provider catalog still shows text-only M2.7 chat refs.

-## Web search
+### Web search

 The MiniMax plugin also registers `web_search` through the MiniMax Coding Plan
 search API.
@@ -146,136 +332,66 @@ search API.
 - Search stays on provider id `minimax`; OAuth CN/global setup can still steer region indirectly through `models.providers.minimax-portal.baseUrl`

 Config lives under `plugins.entries.minimax.config.webSearch.*`.
-See [MiniMax Search](/tools/minimax-search).

-## Choose a setup
+<Note>
+See [MiniMax Search](/tools/minimax-search) for full web search configuration and usage.
+</Note>

-### MiniMax OAuth (Coding Plan) - recommended
+## Advanced configuration

-**Best for:** quick setup with MiniMax Coding Plan via OAuth, no API key required.
+<AccordionGroup>
+  <Accordion title="Configuration options">
+    | Option | Description |
+    | --- | --- |
+    | `models.providers.minimax.baseUrl` | Prefer `https://api.minimax.io/anthropic` (Anthropic-compatible); `https://api.minimax.io/v1` is optional for OpenAI-compatible payloads |
+    | `models.providers.minimax.api` | Prefer `anthropic-messages`; `openai-completions` is optional for OpenAI-compatible payloads |
+    | `models.providers.minimax.apiKey` | MiniMax API key (`MINIMAX_API_KEY`) |
+    | `models.providers.minimax.models` | Define `id`, `name`, `reasoning`, `contextWindow`, `maxTokens`, `cost` |
+    | `agents.defaults.models` | Alias models you want in the allowlist |
+    | `models.mode` | Keep `merge` if you want to add MiniMax alongside built-ins |
+  </Accordion>

-Authenticate with the explicit regional OAuth choice:
+  <Accordion title="Thinking defaults">
+    On `api: "anthropic-messages"`, OpenClaw injects `thinking: { type: "disabled" }` unless thinking is already explicitly set in params/config.

-```bash
-openclaw onboard --auth-choice minimax-global-oauth
-# or
-openclaw onboard --auth-choice minimax-cn-oauth
-```
+    This prevents MiniMax's streaming endpoint from emitting `reasoning_content` in OpenAI-style delta chunks, which would leak internal reasoning into visible output.

-Choice mapping:
+  </Accordion>

- `minimax-global-oauth`: International users (`api.minimax.io`)
- `minimax-cn-oauth`: Users in China (`api.minimaxi.com`)
+  <Accordion title="Fast mode">
+    `/fast on` or `params.fastMode: true` rewrites `MiniMax-M2.7` to `MiniMax-M2.7-highspeed` on the Anthropic-compatible stream path.
+  </Accordion>

-See the MiniMax plugin package README in the OpenClaw repo for details.
+  <Accordion title="Fallback example">
+    **Best for:** keep your strongest latest-generation model as primary, fail over to MiniMax M2.7. Example below uses Opus as a concrete primary; swap to your preferred latest-gen primary model.

-### MiniMax M2.7 (API key)
-
-**Best for:** hosted MiniMax with Anthropic-compatible API.
-
-Configure via CLI:
-
- Interactive onboarding:
-
-```bash
-openclaw onboard --auth-choice minimax-global-api
-# or
-openclaw onboard --auth-choice minimax-cn-api
-```
-
- `minimax-global-api`: International users (`api.minimax.io`)
- `minimax-cn-api`: Users in China (`api.minimaxi.com`)
-
-```json5
-{
-  env: { MINIMAX_API_KEY: "sk-..." },
-  agents: { defaults: { model: { primary: "minimax/MiniMax-M2.7" } } },
-  models: {
-    mode: "merge",
-    providers: {
-      minimax: {
-        baseUrl: "https://api.minimax.io/anthropic",
-        apiKey: "${MINIMAX_API_KEY}",
-        api: "anthropic-messages",
-        models: [
-          {
-            id: "MiniMax-M2.7",
-            name: "MiniMax M2.7",
-            reasoning: true,
-            input: ["text", "image"],
-            cost: { input: 0.3, output: 1.2, cacheRead: 0.06, cacheWrite: 0.375 },
-            contextWindow: 204800,
-            maxTokens: 131072,
+    ```json5
+    {
+      env: { MINIMAX_API_KEY: "sk-..." },
+      agents: {
+        defaults: {
+          models: {
+            "anthropic/claude-opus-4-6": { alias: "primary" },
+            "minimax/MiniMax-M2.7": { alias: "minimax" },
          },
-          {
-            id: "MiniMax-M2.7-highspeed",
-            name: "MiniMax M2.7 Highspeed",
-            reasoning: true,
-            input: ["text", "image"],
-            cost: { input: 0.6, output: 2.4, cacheRead: 0.06, cacheWrite: 0.375 },
-            contextWindow: 204800,
-            maxTokens: 131072,
+          model: {
+            primary: "anthropic/claude-opus-4-6",
+            fallbacks: ["minimax/MiniMax-M2.7"],
          },
-        ],
+        },
      },
-    },
-  },
-}
-```
+    }
+    ```

-On the Anthropic-compatible streaming path, OpenClaw now disables MiniMax
-thinking by default unless you explicitly set `thinking` yourself. MiniMax's
-streaming endpoint emits `reasoning_content` in OpenAI-style delta chunks
-instead of native Anthropic thinking blocks, which can leak internal reasoning
-into visible output if left enabled implicitly.
+  </Accordion>

-### MiniMax M2.7 as fallback (example)
-
-**Best for:** keep your strongest latest-generation model as primary, fail over to MiniMax M2.7.
-Example below uses Opus as a concrete primary; swap to your preferred latest-gen primary model.
-
-```json5
-{
-  env: { MINIMAX_API_KEY: "sk-..." },
-  agents: {
-    defaults: {
-      models: {
-        "anthropic/claude-opus-4-6": { alias: "primary" },
-        "minimax/MiniMax-M2.7": { alias: "minimax" },
-      },
-      model: {
-        primary: "anthropic/claude-opus-4-6",
-        fallbacks: ["minimax/MiniMax-M2.7"],
-      },
-    },
-  },
-}
-```
-
-## Configure via `openclaw configure`
-
-Use the interactive config wizard to set MiniMax without editing JSON:
-
-1. Run `openclaw configure`.
-2. Select **Model/auth**.
-3. Choose a **MiniMax** auth option.
-4. Pick your default model when prompted.
-
-Current MiniMax auth choices in the wizard/CLI:
-
- `minimax-global-oauth`
- `minimax-cn-oauth`
- `minimax-global-api`
- `minimax-cn-api`
-
-## Configuration options
-
- `models.providers.minimax.baseUrl`: prefer `https://api.minimax.io/anthropic` (Anthropic-compatible); `https://api.minimax.io/v1` is optional for OpenAI-compatible payloads.
- `models.providers.minimax.api`: prefer `anthropic-messages`; `openai-completions` is optional for OpenAI-compatible payloads.
- `models.providers.minimax.apiKey`: MiniMax API key (`MINIMAX_API_KEY`).
- `models.providers.minimax.models`: define `id`, `name`, `reasoning`, `contextWindow`, `maxTokens`, `cost`.
- `agents.defaults.models`: alias models you want in the allowlist.
- `models.mode`: keep `merge` if you want to add MiniMax alongside built-ins.
+  <Accordion title="Coding Plan usage details">
+    - Coding Plan usage API: `https://api.minimaxi.com/v1/api/openplatform/coding_plan/remains` (requires a coding plan key).
+    - OpenClaw normalizes MiniMax coding-plan usage to the same `% left` display used by other providers. MiniMax's raw `usage_percent` / `usagePercent` fields are remaining quota, not consumed quota, so OpenClaw inverts them. Count-based fields win when present.
+    - When the API returns `model_remains`, OpenClaw prefers the chat-model entry, derives the window label from `start_time` / `end_time` when needed, and includes the selected model name in the plan label so coding-plan windows are easier to distinguish.
+    - Usage snapshots treat `minimax`, `minimax-cn`, and `minimax-portal` as the same MiniMax quota surface, and prefer stored MiniMax OAuth before falling back to Coding Plan key env vars.
+  </Accordion>
+</AccordionGroup>

 ## Notes

@@ -284,56 +400,67 @@ Current MiniMax auth choices in the wizard/CLI:
  - OAuth setup: `minimax-portal/<model>`
 - Default chat model: `MiniMax-M2.7`
 - Alternate chat model: `MiniMax-M2.7-highspeed`
- On `api: "anthropic-messages"`, OpenClaw injects
-  `thinking: { type: "disabled" }` unless thinking is already explicitly set in
-  params/config.
- `/fast on` or `params.fastMode: true` rewrites `MiniMax-M2.7` to
-  `MiniMax-M2.7-highspeed` on the Anthropic-compatible stream path.
- Onboarding and direct API-key setup write explicit model definitions with
-  `input: ["text", "image"]` for both M2.7 variants
- The bundled provider catalog currently exposes the chat refs as text-only
-  metadata until explicit MiniMax provider config exists
- Coding Plan usage API: `https://api.minimaxi.com/v1/api/openplatform/coding_plan/remains` (requires a coding plan key).
- OpenClaw normalizes MiniMax coding-plan usage to the same `% left` display
-  used by other providers. MiniMax's raw `usage_percent` / `usagePercent`
-  fields are remaining quota, not consumed quota, so OpenClaw inverts them.
-  Count-based fields win when present. When the API returns `model_remains`,
-  OpenClaw prefers the chat-model entry, derives the window label from
-  `start_time` / `end_time` when needed, and includes the selected model name
-  in the plan label so coding-plan windows are easier to distinguish.
- Usage snapshots treat `minimax`, `minimax-cn`, and `minimax-portal` as the
-  same MiniMax quota surface, and prefer stored MiniMax OAuth before falling
-  back to Coding Plan key env vars.
- Update pricing values in `models.json` if you need exact cost tracking.
- Referral link for MiniMax Coding Plan (10% off): [https://platform.minimax.io/subscribe/coding-plan?code=DbXJTRClnb&source=link](https://platform.minimax.io/subscribe/coding-plan?code=DbXJTRClnb&source=link)
- See [/concepts/model-providers](/concepts/model-providers) for provider rules.
- Use `openclaw models list` to confirm the current provider id, then switch with
-  `openclaw models set minimax/MiniMax-M2.7` or
-  `openclaw models set minimax-portal/MiniMax-M2.7`.
+- Onboarding and direct API-key setup write explicit model definitions with `input: ["text", "image"]` for both M2.7 variants
+- The bundled provider catalog currently exposes the chat refs as text-only metadata until explicit MiniMax provider config exists
+- Update pricing values in `models.json` if you need exact cost tracking
+- Use `openclaw models list` to confirm the current provider id, then switch with `openclaw models set minimax/MiniMax-M2.7` or `openclaw models set minimax-portal/MiniMax-M2.7`
+
+<Tip>
+Referral link for MiniMax Coding Plan (10% off): [MiniMax Coding Plan](https://platform.minimax.io/subscribe/coding-plan?code=DbXJTRClnb&source=link)
+</Tip>
+
+<Note>
+See [Model providers](/concepts/model-providers) for provider rules.
+</Note>

 ## Troubleshooting

-### "Unknown model: minimax/MiniMax-M2.7"
+<AccordionGroup>
+  <Accordion title='"Unknown model: minimax/MiniMax-M2.7"'>
+    This usually means the **MiniMax provider is not configured** (no matching provider entry and no MiniMax auth profile/env key found). A fix for this detection is in **2026.1.12**. Fix by:

-This usually means the **MiniMax provider isn’t configured** (no matching
-provider entry and no MiniMax auth profile/env key found). A fix for this
-detection is in **2026.1.12**. Fix by:
+    - Upgrading to **2026.1.12** (or run from source `main`), then restarting the gateway.
+    - Running `openclaw configure` and selecting a **MiniMax** auth option, or
+    - Adding the matching `models.providers.minimax` or `models.providers.minimax-portal` block manually, or
+    - Setting `MINIMAX_API_KEY`, `MINIMAX_OAUTH_TOKEN`, or a MiniMax auth profile so the matching provider can be injected.

- Upgrading to **2026.1.12** (or run from source `main`), then restarting the gateway.
- Running `openclaw configure` and selecting a **MiniMax** auth option, or
- Adding the matching `models.providers.minimax` or
-  `models.providers.minimax-portal` block manually, or
- Setting `MINIMAX_API_KEY`, `MINIMAX_OAUTH_TOKEN`, or a MiniMax auth profile
-  so the matching provider can be injected.
+    Make sure the model id is **case-sensitive**:

-Make sure the model id is **case‑sensitive**:
+    - API-key path: `minimax/MiniMax-M2.7` or `minimax/MiniMax-M2.7-highspeed`
+    - OAuth path: `minimax-portal/MiniMax-M2.7` or `minimax-portal/MiniMax-M2.7-highspeed`

- API-key path: `minimax/MiniMax-M2.7` or `minimax/MiniMax-M2.7-highspeed`
- OAuth path: `minimax-portal/MiniMax-M2.7` or
-  `minimax-portal/MiniMax-M2.7-highspeed`
+    Then recheck with:

-Then recheck with:
+    ```bash
+    openclaw models list
+    ```

-```bash
-openclaw models list
-```
+  </Accordion>
+</AccordionGroup>
+
+<Note>
+More help: [Troubleshooting](/help/troubleshooting) and [FAQ](/help/faq).
+</Note>
+
+## Related
+
+<CardGroup cols={2}>
+  <Card title="Model selection" href="/concepts/model-providers" icon="layers">
+    Choosing providers, model refs, and failover behavior.
+  </Card>
+  <Card title="Image generation" href="/tools/image-generation" icon="image">
+    Shared image tool parameters and provider selection.
+  </Card>
+  <Card title="Music generation" href="/tools/music-generation" icon="music">
+    Shared music tool parameters and provider selection.
+  </Card>
+  <Card title="Video generation" href="/tools/video-generation" icon="video">
+    Shared video tool parameters and provider selection.
+  </Card>
+  <Card title="MiniMax Search" href="/tools/minimax-search" icon="magnifying-glass">
+    Web search configuration via MiniMax Coding Plan.
+  </Card>
+  <Card title="Troubleshooting" href="/help/troubleshooting" icon="wrench">
+    General troubleshooting and FAQ.
+  </Card>
+</CardGroup>
--- a/docs/providers/mistral.md
+++ b/docs/providers/mistral.md
@@ -12,22 +12,42 @@ OpenClaw supports Mistral for both text/image model routing (`mistral/...`) and
 audio transcription via Voxtral in media understanding.
 Mistral can also be used for memory embeddings (`memorySearch.provider = "mistral"`).

-## CLI setup
+- Provider: `mistral`
+- Auth: `MISTRAL_API_KEY`
+- API: Mistral Chat Completions (`https://api.mistral.ai/v1`)

-```bash
-openclaw onboard --auth-choice mistral-api-key
-# or non-interactive
-openclaw onboard --mistral-api-key "$MISTRAL_API_KEY"
-```
+## Getting started

-## Config snippet (LLM provider)
+<Steps>
+  <Step title="Get your API key">
+    Create an API key in the [Mistral Console](https://console.mistral.ai/).
+  </Step>
+  <Step title="Run onboarding">
+    ```bash
+    openclaw onboard --auth-choice mistral-api-key
+    ```

-```json5
-{
-  env: { MISTRAL_API_KEY: "sk-..." },
-  agents: { defaults: { model: { primary: "mistral/mistral-large-latest" } } },
-}
-```
+    Or pass the key directly:
+
+    ```bash
+    openclaw onboard --mistral-api-key "$MISTRAL_API_KEY"
+    ```
+
+  </Step>
+  <Step title="Set a default model">
+    ```json5
+    {
+      env: { MISTRAL_API_KEY: "sk-..." },
+      agents: { defaults: { model: { primary: "mistral/mistral-large-latest" } } },
+    }
+    ```
+  </Step>
+  <Step title="Verify the model is available">
+    ```bash
+    openclaw models list --provider mistral
+    ```
+  </Step>
+</Steps>

 ## Built-in LLM catalog

@@ -43,7 +63,9 @@ OpenClaw currently ships this bundled Mistral catalog:
 | `mistral/devstral-medium-latest` | text        | 262,144 | 32,768     | Devstral 2                                                       |
 | `mistral/magistral-small`        | text        | 128,000 | 40,000     | Reasoning-enabled                                                |

-## Config snippet (audio transcription with Voxtral)
+## Audio transcription (Voxtral)
+
+Use Voxtral for audio transcription through the media understanding pipeline.

 ```json5
 {
@@ -58,22 +80,55 @@ OpenClaw currently ships this bundled Mistral catalog:
 }
 ```

-## Adjustable reasoning (`mistral-small-latest`)
+<Tip>
+The media transcription path uses `/v1/audio/transcriptions`. The default audio model for Mistral is `voxtral-mini-latest`.
+</Tip>

-`mistral/mistral-small-latest` maps to Mistral Small 4 and supports [adjustable reasoning](https://docs.mistral.ai/capabilities/reasoning/adjustable) on the Chat Completions API via `reasoning_effort` (`none` minimizes extra thinking in the output; `high` surfaces full thinking traces before the final answer).
+## Advanced configuration

-OpenClaw maps the session **thinking** level to Mistral’s API:
+<AccordionGroup>
+  <Accordion title="Adjustable reasoning (mistral-small-latest)">
+    `mistral/mistral-small-latest` maps to Mistral Small 4 and supports [adjustable reasoning](https://docs.mistral.ai/capabilities/reasoning/adjustable) on the Chat Completions API via `reasoning_effort` (`none` minimizes extra thinking in the output; `high` surfaces full thinking traces before the final answer).

- **off** / **minimal** → `none`
- **low** / **medium** / **high** / **xhigh** / **adaptive** → `high`
+    OpenClaw maps the session **thinking** level to Mistral's API:

-Other bundled Mistral catalog models do not use this parameter; keep using `magistral-*` models when you want Mistral’s native reasoning-first behavior.
+    | OpenClaw thinking level                          | Mistral `reasoning_effort` |
+    | ------------------------------------------------ | -------------------------- |
+    | **off** / **minimal**                            | `none`                     |
+    | **low** / **medium** / **high** / **xhigh** / **adaptive** | `high`             |

-## Notes
+    <Note>
+    Other bundled Mistral catalog models do not use this parameter. Keep using `magistral-*` models when you want Mistral's native reasoning-first behavior.
+    </Note>

- Mistral auth uses `MISTRAL_API_KEY`.
- Provider base URL defaults to `https://api.mistral.ai/v1`.
- Onboarding default model is `mistral/mistral-large-latest`.
- Media-understanding default audio model for Mistral is `voxtral-mini-latest`.
- Media transcription path uses `/v1/audio/transcriptions`.
- Memory embeddings path uses `/v1/embeddings` (default model: `mistral-embed`).
+  </Accordion>
+
+  <Accordion title="Memory embeddings">
+    Mistral can serve memory embeddings via `/v1/embeddings` (default model: `mistral-embed`).
+
+    ```json5
+    {
+      memorySearch: { provider: "mistral" },
+    }
+    ```
+
+  </Accordion>
+
+  <Accordion title="Auth and base URL">
+    - Mistral auth uses `MISTRAL_API_KEY`.
+    - Provider base URL defaults to `https://api.mistral.ai/v1`.
+    - Onboarding default model is `mistral/mistral-large-latest`.
+    - Z.AI uses Bearer auth with your API key.
+  </Accordion>
+</AccordionGroup>
+
+## Related
+
+<CardGroup cols={2}>
+  <Card title="Model selection" href="/concepts/model-providers" icon="layers">
+    Choosing providers, model refs, and failover behavior.
+  </Card>
+  <Card title="Media understanding" href="/tools/media-understanding" icon="microphone">
+    Audio transcription setup and provider selection.
+  </Card>
+</CardGroup>
--- a/docs/providers/moonshot.md
+++ b/docs/providers/moonshot.md
@@ -13,138 +13,215 @@ Moonshot provides the Kimi API with OpenAI-compatible endpoints. Configure the
 provider and set the default model to `moonshot/kimi-k2.5`, or use
 Kimi Coding with `kimi/kimi-code`.

-Current Kimi K2 model IDs:
+<Warning>
+Moonshot and Kimi Coding are **separate providers**. Keys are not interchangeable, endpoints differ, and model refs differ (`moonshot/...` vs `kimi/...`).
+</Warning>
+
+## Built-in model catalog

 [//]: # "moonshot-kimi-k2-ids:start"

- `kimi-k2.5`
- `kimi-k2-thinking`
- `kimi-k2-thinking-turbo`
- `kimi-k2-turbo`
+| Model ref                         | Name                   | Reasoning | Input       | Context | Max output |
+| --------------------------------- | ---------------------- | --------- | ----------- | ------- | ---------- |
+| `moonshot/kimi-k2.5`              | Kimi K2.5              | No        | text, image | 262,144 | 262,144    |
+| `moonshot/kimi-k2-thinking`       | Kimi K2 Thinking       | Yes       | text        | 262,144 | 262,144    |
+| `moonshot/kimi-k2-thinking-turbo` | Kimi K2 Thinking Turbo | Yes       | text        | 262,144 | 262,144    |
+| `moonshot/kimi-k2-turbo`          | Kimi K2 Turbo          | No        | text        | 256,000 | 16,384     |

 [//]: # "moonshot-kimi-k2-ids:end"

-```bash
-openclaw onboard --auth-choice moonshot-api-key
-# or
-openclaw onboard --auth-choice moonshot-api-key-cn
-```
+## Getting started

-Kimi Coding:
+Choose your provider and follow the setup steps.

-```bash
-openclaw onboard --auth-choice kimi-code-api-key
-```
+<Tabs>
+  <Tab title="Moonshot API">
+    **Best for:** Kimi K2 models via the Moonshot Open Platform.

-Note: Moonshot and Kimi Coding are separate providers. Keys are not interchangeable, endpoints differ, and model refs differ (Moonshot uses `moonshot/...`, Kimi Coding uses `kimi/...`).
+    <Steps>
+      <Step title="Choose your endpoint region">
+        | Auth choice            | Endpoint                       | Region        |
+        | ---------------------- | ------------------------------ | ------------- |
+        | `moonshot-api-key`     | `https://api.moonshot.ai/v1`   | International |
+        | `moonshot-api-key-cn`  | `https://api.moonshot.cn/v1`   | China         |
+      </Step>
+      <Step title="Run onboarding">
+        ```bash
+        openclaw onboard --auth-choice moonshot-api-key
+        ```

-Kimi web search uses the Moonshot plugin too:
+        Or for the China endpoint:

-```bash
-openclaw configure --section web
-```
+        ```bash
+        openclaw onboard --auth-choice moonshot-api-key-cn
+        ```
+      </Step>
+      <Step title="Set a default model">
+        ```json5
+        {
+          agents: {
+            defaults: {
+              model: { primary: "moonshot/kimi-k2.5" },
+            },
+          },
+        }
+        ```
+      </Step>
+      <Step title="Verify models are available">
+        ```bash
+        openclaw models list --provider moonshot
+        ```
+      </Step>
+    </Steps>

-Choose **Kimi** in the web-search section to store
-`plugins.entries.moonshot.config.webSearch.*`.
+    ### Config example

-## Config snippet (Moonshot API)
-
-```json5
-{
-  env: { MOONSHOT_API_KEY: "sk-..." },
-  agents: {
-    defaults: {
-      model: { primary: "moonshot/kimi-k2.5" },
+    ```json5
+    {
+      env: { MOONSHOT_API_KEY: "sk-..." },
+      agents: {
+        defaults: {
+          model: { primary: "moonshot/kimi-k2.5" },
+          models: {
+            // moonshot-kimi-k2-aliases:start
+            "moonshot/kimi-k2.5": { alias: "Kimi K2.5" },
+            "moonshot/kimi-k2-thinking": { alias: "Kimi K2 Thinking" },
+            "moonshot/kimi-k2-thinking-turbo": { alias: "Kimi K2 Thinking Turbo" },
+            "moonshot/kimi-k2-turbo": { alias: "Kimi K2 Turbo" },
+            // moonshot-kimi-k2-aliases:end
+          },
+        },
+      },
      models: {
-        // moonshot-kimi-k2-aliases:start
-        "moonshot/kimi-k2.5": { alias: "Kimi K2.5" },
-        "moonshot/kimi-k2-thinking": { alias: "Kimi K2 Thinking" },
-        "moonshot/kimi-k2-thinking-turbo": { alias: "Kimi K2 Thinking Turbo" },
-        "moonshot/kimi-k2-turbo": { alias: "Kimi K2 Turbo" },
-        // moonshot-kimi-k2-aliases:end
+        mode: "merge",
+        providers: {
+          moonshot: {
+            baseUrl: "https://api.moonshot.ai/v1",
+            apiKey: "${MOONSHOT_API_KEY}",
+            api: "openai-completions",
+            models: [
+              // moonshot-kimi-k2-models:start
+              {
+                id: "kimi-k2.5",
+                name: "Kimi K2.5",
+                reasoning: false,
+                input: ["text", "image"],
+                cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 },
+                contextWindow: 262144,
+                maxTokens: 262144,
+              },
+              {
+                id: "kimi-k2-thinking",
+                name: "Kimi K2 Thinking",
+                reasoning: true,
+                input: ["text"],
+                cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 },
+                contextWindow: 262144,
+                maxTokens: 262144,
+              },
+              {
+                id: "kimi-k2-thinking-turbo",
+                name: "Kimi K2 Thinking Turbo",
+                reasoning: true,
+                input: ["text"],
+                cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 },
+                contextWindow: 262144,
+                maxTokens: 262144,
+              },
+              {
+                id: "kimi-k2-turbo",
+                name: "Kimi K2 Turbo",
+                reasoning: false,
+                input: ["text"],
+                cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 },
+                contextWindow: 256000,
+                maxTokens: 16384,
+              },
+              // moonshot-kimi-k2-models:end
+            ],
+          },
+        },
      },
-    },
-  },
-  models: {
-    mode: "merge",
-    providers: {
-      moonshot: {
-        baseUrl: "https://api.moonshot.ai/v1",
-        apiKey: "${MOONSHOT_API_KEY}",
-        api: "openai-completions",
-        models: [
-          // moonshot-kimi-k2-models:start
-          {
-            id: "kimi-k2.5",
-            name: "Kimi K2.5",
-            reasoning: false,
-            input: ["text", "image"],
-            cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 },
-            contextWindow: 262144,
-            maxTokens: 262144,
-          },
-          {
-            id: "kimi-k2-thinking",
-            name: "Kimi K2 Thinking",
-            reasoning: true,
-            input: ["text"],
-            cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 },
-            contextWindow: 262144,
-            maxTokens: 262144,
-          },
-          {
-            id: "kimi-k2-thinking-turbo",
-            name: "Kimi K2 Thinking Turbo",
-            reasoning: true,
-            input: ["text"],
-            cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 },
-            contextWindow: 262144,
-            maxTokens: 262144,
-          },
-          {
-            id: "kimi-k2-turbo",
-            name: "Kimi K2 Turbo",
-            reasoning: false,
-            input: ["text"],
-            cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 },
-            contextWindow: 256000,
-            maxTokens: 16384,
-          },
-          // moonshot-kimi-k2-models:end
-        ],
-      },
-    },
-  },
-}
-```
+    }
+    ```

-## Kimi Coding
+  </Tab>

-```json5
-{
-  env: { KIMI_API_KEY: "sk-..." },
-  agents: {
-    defaults: {
-      model: { primary: "kimi/kimi-code" },
-      models: {
-        "kimi/kimi-code": { alias: "Kimi" },
+  <Tab title="Kimi Coding">
+    **Best for:** code-focused tasks via the Kimi Coding endpoint.
+
+    <Note>
+    Kimi Coding uses a different API key and provider prefix (`kimi/...`) than Moonshot (`moonshot/...`). Legacy model ref `kimi/k2p5` remains accepted as a compatibility id.
+    </Note>
+
+    <Steps>
+      <Step title="Run onboarding">
+        ```bash
+        openclaw onboard --auth-choice kimi-code-api-key
+        ```
+      </Step>
+      <Step title="Set a default model">
+        ```json5
+        {
+          agents: {
+            defaults: {
+              model: { primary: "kimi/kimi-code" },
+            },
+          },
+        }
+        ```
+      </Step>
+      <Step title="Verify the model is available">
+        ```bash
+        openclaw models list --provider kimi
+        ```
+      </Step>
+    </Steps>
+
+    ### Config example
+
+    ```json5
+    {
+      env: { KIMI_API_KEY: "sk-..." },
+      agents: {
+        defaults: {
+          model: { primary: "kimi/kimi-code" },
+          models: {
+            "kimi/kimi-code": { alias: "Kimi" },
+          },
+        },
      },
-    },
-  },
-}
-```
+    }
+    ```
+
+  </Tab>
+</Tabs>

 ## Kimi web search

 OpenClaw also ships **Kimi** as a `web_search` provider, backed by Moonshot web
 search.

-Interactive setup can prompt for:
+<Steps>
+  <Step title="Run interactive web search setup">
+    ```bash
+    openclaw configure --section web
+    ```

- the Moonshot API region:
-  - `https://api.moonshot.ai/v1`
-  - `https://api.moonshot.cn/v1`
- the default Kimi web-search model (defaults to `kimi-k2.5`)
+    Choose **Kimi** in the web-search section to store
+    `plugins.entries.moonshot.config.webSearch.*`.
+
+  </Step>
+  <Step title="Configure the web search region and model">
+    Interactive setup prompts for:
+
+    | Setting             | Options                                                              |
+    | ------------------- | -------------------------------------------------------------------- |
+    | API region          | `https://api.moonshot.ai/v1` (international) or `https://api.moonshot.cn/v1` (China) |
+    | Web search model    | Defaults to `kimi-k2.5`                                             |
+
+  </Step>
+</Steps>

 Config lives under `plugins.entries.moonshot.config.webSearch`:

@@ -173,52 +250,82 @@ Config lives under `plugins.entries.moonshot.config.webSearch`:
 }
 ```

-## Notes
+## Advanced

- Moonshot model refs use `moonshot/<modelId>`. Kimi Coding model refs use `kimi/<modelId>`.
- Current Kimi Coding default model ref is `kimi/kimi-code`. Legacy `kimi/k2p5` remains accepted as a compatibility model id.
- Kimi web search uses `KIMI_API_KEY` or `MOONSHOT_API_KEY`, and defaults to `https://api.moonshot.ai/v1` with model `kimi-k2.5`.
- Native Moonshot endpoints (`https://api.moonshot.ai/v1` and
-  `https://api.moonshot.cn/v1`) advertise streaming usage compatibility on the
-  shared `openai-completions` transport. OpenClaw now keys that off endpoint
-  capabilities, so compatible custom provider ids targeting the same native
-  Moonshot hosts inherit the same streaming-usage behavior.
- Override pricing and context metadata in `models.providers` if needed.
- If Moonshot publishes different context limits for a model, adjust
-  `contextWindow` accordingly.
- Use `https://api.moonshot.ai/v1` for the international endpoint, and `https://api.moonshot.cn/v1` for the China endpoint.
- Onboarding choices:
-  - `moonshot-api-key` for `https://api.moonshot.ai/v1`
-  - `moonshot-api-key-cn` for `https://api.moonshot.cn/v1`
+<AccordionGroup>
+  <Accordion title="Native thinking mode">
+    Moonshot Kimi supports binary native thinking:

-## Native thinking mode (Moonshot)
+    - `thinking: { type: "enabled" }`
+    - `thinking: { type: "disabled" }`

-Moonshot Kimi supports binary native thinking:
+    Configure it per model via `agents.defaults.models.<provider/model>.params`:

- `thinking: { type: "enabled" }`
- `thinking: { type: "disabled" }`
-
-Configure it per model via `agents.defaults.models.<provider/model>.params`:
-
-```json5
-{
-  agents: {
-    defaults: {
-      models: {
-        "moonshot/kimi-k2.5": {
-          params: {
-            thinking: { type: "disabled" },
+    ```json5
+    {
+      agents: {
+        defaults: {
+          models: {
+            "moonshot/kimi-k2.5": {
+              params: {
+                thinking: { type: "disabled" },
+              },
+            },
          },
        },
      },
-    },
-  },
-}
-```
+    }
+    ```

-OpenClaw also maps runtime `/think` levels for Moonshot:
+    OpenClaw also maps runtime `/think` levels for Moonshot:

- `/think off` -> `thinking.type=disabled`
- any non-off thinking level -> `thinking.type=enabled`
+    | `/think` level       | Moonshot behavior          |
+    | -------------------- | -------------------------- |
+    | `/think off`         | `thinking.type=disabled`   |
+    | Any non-off level    | `thinking.type=enabled`    |

-When Moonshot thinking is enabled, `tool_choice` must be `auto` or `none`. OpenClaw normalizes incompatible `tool_choice` values to `auto` for compatibility.
+    <Warning>
+    When Moonshot thinking is enabled, `tool_choice` must be `auto` or `none`. OpenClaw normalizes incompatible `tool_choice` values to `auto` for compatibility.
+    </Warning>
+
+  </Accordion>
+
+  <Accordion title="Streaming usage compatibility">
+    Native Moonshot endpoints (`https://api.moonshot.ai/v1` and
+    `https://api.moonshot.cn/v1`) advertise streaming usage compatibility on the
+    shared `openai-completions` transport. OpenClaw keys that off endpoint
+    capabilities, so compatible custom provider ids targeting the same native
+    Moonshot hosts inherit the same streaming-usage behavior.
+  </Accordion>
+
+  <Accordion title="Endpoint and model ref reference">
+    | Provider   | Model ref prefix | Endpoint                      | Auth env var        |
+    | ---------- | ---------------- | ----------------------------- | ------------------- |
+    | Moonshot   | `moonshot/`      | `https://api.moonshot.ai/v1`  | `MOONSHOT_API_KEY`  |
+    | Moonshot CN| `moonshot/`      | `https://api.moonshot.cn/v1`  | `MOONSHOT_API_KEY`  |
+    | Kimi Coding| `kimi/`          | Kimi Coding endpoint          | `KIMI_API_KEY`      |
+    | Web search | N/A              | Same as Moonshot API region   | `KIMI_API_KEY` or `MOONSHOT_API_KEY` |
+
+    - Kimi web search uses `KIMI_API_KEY` or `MOONSHOT_API_KEY`, and defaults to `https://api.moonshot.ai/v1` with model `kimi-k2.5`.
+    - Override pricing and context metadata in `models.providers` if needed.
+    - If Moonshot publishes different context limits for a model, adjust `contextWindow` accordingly.
+
+  </Accordion>
+</AccordionGroup>
+
+## Related
+
+<CardGroup cols={2}>
+  <Card title="Model selection" href="/concepts/model-providers" icon="layers">
+    Choosing providers, model refs, and failover behavior.
+  </Card>
+  <Card title="Web search" href="/tools/web-search" icon="magnifying-glass">
+    Configuring web search providers including Kimi.
+  </Card>
+  <Card title="Configuration reference" href="/gateway/configuration-reference" icon="gear">
+    Full config schema for providers, models, and plugins.
+  </Card>
+  <Card title="Moonshot Open Platform" href="https://platform.moonshot.ai" icon="globe">
+    Moonshot API key management and documentation.
+  </Card>
+</CardGroup>
--- a/docs/providers/nvidia.md
+++ b/docs/providers/nvidia.md
@@ -8,21 +8,35 @@ title: "NVIDIA"

 # NVIDIA

-NVIDIA provides an OpenAI-compatible API at `https://integrate.api.nvidia.com/v1` for open models for free. Authenticate with an API key from [build.nvidia.com](https://build.nvidia.com/settings/api-keys).
+NVIDIA provides an OpenAI-compatible API at `https://integrate.api.nvidia.com/v1` for
+open models for free. Authenticate with an API key from
+[build.nvidia.com](https://build.nvidia.com/settings/api-keys).

-## CLI setup
+## Getting started

-Export the key once, then run onboarding and set an NVIDIA model:
+<Steps>
+  <Step title="Get your API key">
+    Create an API key at [build.nvidia.com](https://build.nvidia.com/settings/api-keys).
+  </Step>
+  <Step title="Export the key and run onboarding">
+    ```bash
+    export NVIDIA_API_KEY="nvapi-..."
+    openclaw onboard --auth-choice skip
+    ```
+  </Step>
+  <Step title="Set an NVIDIA model">
+    ```bash
+    openclaw models set nvidia/nvidia/nemotron-3-super-120b-a12b
+    ```
+  </Step>
+</Steps>

-```bash
-export NVIDIA_API_KEY="nvapi-..."
-openclaw onboard --auth-choice skip
-openclaw models set nvidia/nvidia/nemotron-3-super-120b-a12b
-```
+<Warning>
+If you pass `--token` instead of the env var, the value lands in shell history and
+`ps` output. Prefer the `NVIDIA_API_KEY` environment variable when possible.
+</Warning>

-If you still pass `--token`, remember it lands in shell history and `ps` output; prefer the env var when possible.
-
-## Config snippet
+## Config example

 ```json5
 {
@@ -43,7 +57,7 @@ If you still pass `--token`, remember it lands in shell history and `ps` output;
 }
 ```

-## Model IDs
+## Built-in catalog

 | Model ref                                  | Name                         | Context | Max output |
 | ------------------------------------------ | ---------------------------- | ------- | ---------- |
@@ -52,8 +66,38 @@ If you still pass `--token`, remember it lands in shell history and `ps` output;
 | `nvidia/minimaxai/minimax-m2.5`            | Minimax M2.5                 | 196,608 | 8,192      |
 | `nvidia/z-ai/glm5`                         | GLM 5                        | 202,752 | 8,192      |

-## Notes
+## Advanced notes

- OpenAI-compatible `/v1` endpoint; use an API key from [build.nvidia.com](https://build.nvidia.com/).
- Provider auto-enables when `NVIDIA_API_KEY` is set.
- The bundled catalog is static; costs default to `0` in source.
+<AccordionGroup>
+  <Accordion title="Auto-enable behavior">
+    The provider auto-enables when the `NVIDIA_API_KEY` environment variable is set.
+    No explicit provider config is required beyond the key.
+  </Accordion>
+
+  <Accordion title="Catalog and pricing">
+    The bundled catalog is static. Costs default to `0` in source since NVIDIA
+    currently offers free API access for the listed models.
+  </Accordion>
+
+  <Accordion title="OpenAI-compatible endpoint">
+    NVIDIA uses the standard `/v1` completions endpoint. Any OpenAI-compatible
+    tooling should work out of the box with the NVIDIA base URL.
+  </Accordion>
+</AccordionGroup>
+
+<Tip>
+NVIDIA models are currently free to use. Check
+[build.nvidia.com](https://build.nvidia.com/) for the latest availability and
+rate-limit details.
+</Tip>
+
+## Related
+
+<CardGroup cols={2}>
+  <Card title="Model selection" href="/concepts/model-providers" icon="layers">
+    Choosing providers, model refs, and failover behavior.
+  </Card>
+  <Card title="Configuration reference" href="/gateway/configuration-reference" icon="gear">
+    Full config reference for agents, models, and providers.
+  </Card>
+</CardGroup>
--- a/docs/providers/ollama.md
+++ b/docs/providers/ollama.md
@@ -14,122 +14,154 @@ Ollama is a local LLM runtime that makes it easy to run open-source models on yo
 **Remote Ollama users**: Do not use the `/v1` OpenAI-compatible URL (`http://host:11434/v1`) with OpenClaw. This breaks tool calling and models may output raw tool JSON as plain text. Use the native Ollama API URL instead: `baseUrl: "http://host:11434"` (no `/v1`).
 </Warning>

-## Quick start
+## Getting started

-### Onboarding (recommended)
+Choose your preferred setup method and mode.

-The fastest way to set up Ollama is through onboarding:
+<Tabs>
+  <Tab title="Onboarding (recommended)">
+    **Best for:** fastest path to a working Ollama setup with automatic model discovery.

-```bash
-openclaw onboard
-```
+    <Steps>
+      <Step title="Run onboarding">
+        ```bash
+        openclaw onboard
+        ```

-Select **Ollama** from the provider list. Onboarding will:
+        Select **Ollama** from the provider list.
+      </Step>
+      <Step title="Choose your mode">
+        - **Cloud + Local** — cloud-hosted models and local models together
+        - **Local** — local models only

-1. Ask for the Ollama base URL where your instance can be reached (default `http://127.0.0.1:11434`).
-2. Let you choose **Cloud + Local** (cloud models and local models) or **Local** (local models only).
-3. Open a browser sign-in flow if you choose **Cloud + Local** and are not signed in to ollama.com.
-4. Discover available models and suggest defaults.
-5. Auto-pull the selected model if it is not available locally.
+        If you choose **Cloud + Local** and are not signed in to ollama.com, onboarding opens a browser sign-in flow.
+      </Step>
+      <Step title="Select a model">
+        Onboarding discovers available models and suggests defaults. It auto-pulls the selected model if it is not available locally.
+      </Step>
+      <Step title="Verify the model is available">
+        ```bash
+        openclaw models list --provider ollama
+        ```
+      </Step>
+    </Steps>

-Non-interactive mode is also supported:
+    ### Non-interactive mode

-```bash
-openclaw onboard --non-interactive \
-  --auth-choice ollama \
-  --accept-risk
-```
+    ```bash
+    openclaw onboard --non-interactive \
+      --auth-choice ollama \
+      --accept-risk
+    ```

-Optionally specify a custom base URL or model:
+    Optionally specify a custom base URL or model:

-```bash
-openclaw onboard --non-interactive \
-  --auth-choice ollama \
-  --custom-base-url "http://ollama-host:11434" \
-  --custom-model-id "qwen3.5:27b" \
-  --accept-risk
-```
+    ```bash
+    openclaw onboard --non-interactive \
+      --auth-choice ollama \
+      --custom-base-url "http://ollama-host:11434" \
+      --custom-model-id "qwen3.5:27b" \
+      --accept-risk
+    ```

-### Manual setup
+  </Tab>

-1. Install Ollama: [https://ollama.com/download](https://ollama.com/download)
+  <Tab title="Manual setup">
+    **Best for:** full control over installation, model pulls, and config.

-2. Pull a local model if you want local inference:
+    <Steps>
+      <Step title="Install Ollama">
+        Download from [ollama.com/download](https://ollama.com/download).
+      </Step>
+      <Step title="Pull a local model">
+        ```bash
+        ollama pull gemma4
+        # or
+        ollama pull gpt-oss:20b
+        # or
+        ollama pull llama3.3
+        ```
+      </Step>
+      <Step title="Sign in for cloud models (optional)">
+        If you want cloud models too:

-```bash
-ollama pull gemma4
-# or
-ollama pull gpt-oss:20b
-# or
-ollama pull llama3.3
-```
+        ```bash
+        ollama signin
+        ```
+      </Step>
+      <Step title="Enable Ollama for OpenClaw">
+        Set any value for the API key (Ollama does not require a real key):

-3. If you want cloud models too, sign in:
+        ```bash
+        # Set environment variable
+        export OLLAMA_API_KEY="ollama-local"

-```bash
-ollama signin
-```
+        # Or configure in your config file
+        openclaw config set models.providers.ollama.apiKey "ollama-local"
+        ```
+      </Step>
+      <Step title="Inspect and set your model">
+        ```bash
+        openclaw models list
+        openclaw models set ollama/gemma4
+        ```

-4. Run onboarding and choose `Ollama`:
+        Or set the default in config:

-```bash
-openclaw onboard
-```
+        ```json5
+        {
+          agents: {
+            defaults: {
+              model: { primary: "ollama/gemma4" },
+            },
+          },
+        }
+        ```
+      </Step>
+    </Steps>

- `Local`: local models only
- `Cloud + Local`: local models plus cloud models
- Cloud models such as `kimi-k2.5:cloud`, `minimax-m2.7:cloud`, and `glm-5.1:cloud` do **not** require a local `ollama pull`
+  </Tab>
+</Tabs>

-OpenClaw currently suggests:
+## Cloud models

- local default: `gemma4`
- cloud defaults: `kimi-k2.5:cloud`, `minimax-m2.7:cloud`, `glm-5.1:cloud`
+<Tabs>
+  <Tab title="Cloud + Local">
+    Cloud models let you run cloud-hosted models alongside your local models. Examples include `kimi-k2.5:cloud`, `minimax-m2.7:cloud`, and `glm-5.1:cloud` -- these do **not** require a local `ollama pull`.

-5. If you prefer manual setup, enable Ollama for OpenClaw directly (any value works; Ollama doesn't require a real key):
+    Select **Cloud + Local** mode during setup. The wizard checks whether you are signed in and opens a browser sign-in flow when needed. If authentication cannot be verified, the wizard falls back to local model defaults.

-```bash
-# Set environment variable
-export OLLAMA_API_KEY="ollama-local"
+    You can also sign in directly at [ollama.com/signin](https://ollama.com/signin).

-# Or configure in your config file
-openclaw config set models.providers.ollama.apiKey "ollama-local"
-```
+    OpenClaw currently suggests these cloud defaults: `kimi-k2.5:cloud`, `minimax-m2.7:cloud`, `glm-5.1:cloud`.

-6. Inspect or switch models:
+  </Tab>

-```bash
-openclaw models list
-openclaw models set ollama/gemma4
-```
+  <Tab title="Local only">
+    In local-only mode, OpenClaw discovers models from the local Ollama instance. No cloud sign-in is needed.

-7. Or set the default in config:
+    OpenClaw currently suggests `gemma4` as the local default.

-```json5
-{
-  agents: {
-    defaults: {
-      model: { primary: "ollama/gemma4" },
-    },
-  },
-}
-```
+  </Tab>
+</Tabs>

 ## Model discovery (implicit provider)

-When you set `OLLAMA_API_KEY` (or an auth profile) and **do not** define `models.providers.ollama`, OpenClaw discovers models from the local Ollama instance at `http://127.0.0.1:11434`:
+When you set `OLLAMA_API_KEY` (or an auth profile) and **do not** define `models.providers.ollama`, OpenClaw discovers models from the local Ollama instance at `http://127.0.0.1:11434`.

- Queries `/api/tags`
- Uses best-effort `/api/show` lookups to read `contextWindow` and detect capabilities (including vision) when available
- Models with a `vision` capability reported by `/api/show` are marked as image-capable (`input: ["text", "image"]`), so OpenClaw auto-injects images into the prompt for those models
- Marks `reasoning` with a model-name heuristic (`r1`, `reasoning`, `think`)
- Sets `maxTokens` to the default Ollama max-token cap used by OpenClaw
- Sets all costs to `0`
+| Behavior             | Detail                                                                                                                                                              |
+| -------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
+| Catalog query        | Queries `/api/tags`                                                                                                                                                 |
+| Capability detection | Uses best-effort `/api/show` lookups to read `contextWindow` and detect capabilities (including vision)                                                             |
+| Vision models        | Models with a `vision` capability reported by `/api/show` are marked as image-capable (`input: ["text", "image"]`), so OpenClaw auto-injects images into the prompt |
+| Reasoning detection  | Marks `reasoning` with a model-name heuristic (`r1`, `reasoning`, `think`)                                                                                          |
+| Token limits         | Sets `maxTokens` to the default Ollama max-token cap used by OpenClaw                                                                                               |
+| Costs                | Sets all costs to `0`                                                                                                                                               |

 This avoids manual model entries while keeping the catalog aligned with the local Ollama instance.

-To see what models are available:
-
 ```bash
+# See what models are available
 ollama list
 openclaw models list
 ```
@@ -142,74 +174,79 @@ ollama pull mistral

 The new model will be automatically discovered and available to use.

-If you set `models.providers.ollama` explicitly, auto-discovery is skipped and you must define models manually (see below).
+<Note>
+If you set `models.providers.ollama` explicitly, auto-discovery is skipped and you must define models manually. See the explicit config section below.
+</Note>

 ## Configuration

-### Basic setup (implicit discovery)
+<Tabs>
+  <Tab title="Basic (implicit discovery)">
+    The simplest way to enable Ollama is via environment variable:

-The simplest way to enable Ollama is via environment variable:
+    ```bash
+    export OLLAMA_API_KEY="ollama-local"
+    ```

-```bash
-export OLLAMA_API_KEY="ollama-local"
-```
+    <Tip>
+    If `OLLAMA_API_KEY` is set, you can omit `apiKey` in the provider entry and OpenClaw will fill it for availability checks.
+    </Tip>

-### Explicit setup (manual models)
+  </Tab>

-Use explicit config when:
+  <Tab title="Explicit (manual models)">
+    Use explicit config when Ollama runs on another host/port, you want to force specific context windows or model lists, or you want fully manual model definitions.

- Ollama runs on another host/port.
- You want to force specific context windows or model lists.
- You want fully manual model definitions.
-
-```json5
-{
-  models: {
-    providers: {
-      ollama: {
-        baseUrl: "http://ollama-host:11434",
-        apiKey: "ollama-local",
-        api: "ollama",
-        models: [
-          {
-            id: "gpt-oss:20b",
-            name: "GPT-OSS 20B",
-            reasoning: false,
-            input: ["text"],
-            cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 },
-            contextWindow: 8192,
-            maxTokens: 8192 * 10
+    ```json5
+    {
+      models: {
+        providers: {
+          ollama: {
+            baseUrl: "http://ollama-host:11434",
+            apiKey: "ollama-local",
+            api: "ollama",
+            models: [
+              {
+                id: "gpt-oss:20b",
+                name: "GPT-OSS 20B",
+                reasoning: false,
+                input: ["text"],
+                cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 },
+                contextWindow: 8192,
+                maxTokens: 8192 * 10
+              }
+            ]
          }
-        ]
+        }
      }
    }
-  }
-}
-```
+    ```

-If `OLLAMA_API_KEY` is set, you can omit `apiKey` in the provider entry and OpenClaw will fill it for availability checks.
+  </Tab>

-### Custom base URL (explicit config)
+  <Tab title="Custom base URL">
+    If Ollama is running on a different host or port (explicit config disables auto-discovery, so define models manually):

-If Ollama is running on a different host or port (explicit config disables auto-discovery, so define models manually):
-
-```json5
-{
-  models: {
-    providers: {
-      ollama: {
-        apiKey: "ollama-local",
-        baseUrl: "http://ollama-host:11434", // No /v1 - use native Ollama API URL
-        api: "ollama", // Set explicitly to guarantee native tool-calling behavior
+    ```json5
+    {
+      models: {
+        providers: {
+          ollama: {
+            apiKey: "ollama-local",
+            baseUrl: "http://ollama-host:11434", // No /v1 - use native Ollama API URL
+            api: "ollama", // Set explicitly to guarantee native tool-calling behavior
+          },
+        },
      },
-    },
-  },
-}
-```
+    }
+    ```

-<Warning>
-Do not add `/v1` to the URL. The `/v1` path uses OpenAI-compatible mode, where tool calling is not reliable. Use the base Ollama URL without a path suffix.
-</Warning>
+    <Warning>
+    Do not add `/v1` to the URL. The `/v1` path uses OpenAI-compatible mode, where tool calling is not reliable. Use the base Ollama URL without a path suffix.
+    </Warning>
+
+  </Tab>
+</Tabs>

 ### Model selection

@@ -228,26 +265,17 @@ Once configured, all your Ollama models are available:
 }
 ```

-## Cloud models
-
-Cloud models let you run cloud-hosted models (for example `kimi-k2.5:cloud`, `minimax-m2.7:cloud`, `glm-5.1:cloud`) alongside your local models.
-
-To use cloud models, select **Cloud + Local** mode during setup. The wizard checks whether you are signed in and opens a browser sign-in flow when needed. If authentication cannot be verified, the wizard falls back to local model defaults.
-
-You can also sign in directly at [ollama.com/signin](https://ollama.com/signin).
-
 ## Ollama Web Search

-OpenClaw also supports **Ollama Web Search** as a bundled `web_search`
-provider.
+OpenClaw supports **Ollama Web Search** as a bundled `web_search` provider.

- It uses your configured Ollama host (`models.providers.ollama.baseUrl` when
-  set, otherwise `http://127.0.0.1:11434`).
- It is key-free.
- It requires Ollama to be running and signed in with `ollama signin`.
+| Property    | Detail                                                                                                            |
+| ----------- | ----------------------------------------------------------------------------------------------------------------- |
+| Host        | Uses your configured Ollama host (`models.providers.ollama.baseUrl` when set, otherwise `http://127.0.0.1:11434`) |
+| Auth        | Key-free                                                                                                          |
+| Requirement | Ollama must be running and signed in with `ollama signin`                                                         |

-Choose **Ollama Web Search** during `openclaw onboard` or
-`openclaw configure --section web`, or set:
+Choose **Ollama Web Search** during `openclaw onboard` or `openclaw configure --section web`, or set:

 ```json5
 {
@@ -261,120 +289,193 @@ Choose **Ollama Web Search** during `openclaw onboard` or
 }
 ```

+<Note>
 For the full setup and behavior details, see [Ollama Web Search](/tools/ollama-search).
+</Note>

-## Advanced
+## Advanced configuration

-### Reasoning models
+<AccordionGroup>
+  <Accordion title="Legacy OpenAI-compatible mode">
+    <Warning>
+    **Tool calling is not reliable in OpenAI-compatible mode.** Use this mode only if you need OpenAI format for a proxy and do not depend on native tool calling behavior.
+    </Warning>

-OpenClaw treats models with names such as `deepseek-r1`, `reasoning`, or `think` as reasoning-capable by default:
+    If you need to use the OpenAI-compatible endpoint instead (for example, behind a proxy that only supports OpenAI format), set `api: "openai-completions"` explicitly:

-```bash
-ollama pull deepseek-r1:32b
-```
-
-### Model Costs
-
-Ollama is free and runs locally, so all model costs are set to $0.
-
-### Streaming Configuration
-
-OpenClaw's Ollama integration uses the **native Ollama API** (`/api/chat`) by default, which fully supports streaming and tool calling simultaneously. No special configuration is needed.
-
-#### Legacy OpenAI-Compatible Mode
-
-<Warning>
-**Tool calling is not reliable in OpenAI-compatible mode.** Use this mode only if you need OpenAI format for a proxy and do not depend on native tool calling behavior.
-</Warning>
-
-If you need to use the OpenAI-compatible endpoint instead (e.g., behind a proxy that only supports OpenAI format), set `api: "openai-completions"` explicitly:
-
-```json5
-{
-  models: {
-    providers: {
-      ollama: {
-        baseUrl: "http://ollama-host:11434/v1",
-        api: "openai-completions",
-        injectNumCtxForOpenAICompat: true, // default: true
-        apiKey: "ollama-local",
-        models: [...]
+    ```json5
+    {
+      models: {
+        providers: {
+          ollama: {
+            baseUrl: "http://ollama-host:11434/v1",
+            api: "openai-completions",
+            injectNumCtxForOpenAICompat: true, // default: true
+            apiKey: "ollama-local",
+            models: [...]
+          }
+        }
      }
    }
-  }
-}
-```
+    ```

-This mode may not support streaming + tool calling simultaneously. You may need to disable streaming with `params: { streaming: false }` in model config.
+    This mode may not support streaming and tool calling simultaneously. You may need to disable streaming with `params: { streaming: false }` in model config.

-When `api: "openai-completions"` is used with Ollama, OpenClaw injects `options.num_ctx` by default so Ollama does not silently fall back to a 4096 context window. If your proxy/upstream rejects unknown `options` fields, disable this behavior:
+    When `api: "openai-completions"` is used with Ollama, OpenClaw injects `options.num_ctx` by default so Ollama does not silently fall back to a 4096 context window. If your proxy/upstream rejects unknown `options` fields, disable this behavior:

-```json5
-{
-  models: {
-    providers: {
-      ollama: {
-        baseUrl: "http://ollama-host:11434/v1",
-        api: "openai-completions",
-        injectNumCtxForOpenAICompat: false,
-        apiKey: "ollama-local",
-        models: [...]
+    ```json5
+    {
+      models: {
+        providers: {
+          ollama: {
+            baseUrl: "http://ollama-host:11434/v1",
+            api: "openai-completions",
+            injectNumCtxForOpenAICompat: false,
+            apiKey: "ollama-local",
+            models: [...]
+          }
+        }
      }
    }
-  }
-}
-```
+    ```

-### Context windows
+  </Accordion>

-For auto-discovered models, OpenClaw uses the context window reported by Ollama when available, otherwise it falls back to the default Ollama context window used by OpenClaw. You can override `contextWindow` and `maxTokens` in explicit provider config.
+  <Accordion title="Context windows">
+    For auto-discovered models, OpenClaw uses the context window reported by Ollama when available, otherwise it falls back to the default Ollama context window used by OpenClaw.
+
+    You can override `contextWindow` and `maxTokens` in explicit provider config:
+
+    ```json5
+    {
+      models: {
+        providers: {
+          ollama: {
+            models: [
+              {
+                id: "llama3.3",
+                contextWindow: 131072,
+                maxTokens: 65536,
+              }
+            ]
+          }
+        }
+      }
+    }
+    ```
+
+  </Accordion>
+
+  <Accordion title="Reasoning models">
+    OpenClaw treats models with names such as `deepseek-r1`, `reasoning`, or `think` as reasoning-capable by default.
+
+    ```bash
+    ollama pull deepseek-r1:32b
+    ```
+
+    No additional configuration is needed -- OpenClaw marks them automatically.
+
+  </Accordion>
+
+  <Accordion title="Model costs">
+    Ollama is free and runs locally, so all model costs are set to $0. This applies to both auto-discovered and manually defined models.
+  </Accordion>
+
+  <Accordion title="Memory embeddings">
+    The bundled Ollama plugin registers a memory embedding provider for
+    [memory search](/concepts/memory). It uses the configured Ollama base URL
+    and API key.
+
+    | Property      | Value               |
+    | ------------- | ------------------- |
+    | Default model | `nomic-embed-text`  |
+    | Auto-pull     | Yes — the embedding model is pulled automatically if not present locally |
+
+    To select Ollama as the memory search embedding provider:
+
+    ```json5
+    {
+      agents: {
+        defaults: {
+          memorySearch: { provider: "ollama" },
+        },
+      },
+    }
+    ```
+
+  </Accordion>
+
+  <Accordion title="Streaming configuration">
+    OpenClaw's Ollama integration uses the **native Ollama API** (`/api/chat`) by default, which fully supports streaming and tool calling simultaneously. No special configuration is needed.
+
+    <Tip>
+    If you need to use the OpenAI-compatible endpoint, see the "Legacy OpenAI-compatible mode" section above. Streaming and tool calling may not work simultaneously in that mode.
+    </Tip>
+
+  </Accordion>
+</AccordionGroup>

 ## Troubleshooting

-### Ollama not detected
+<AccordionGroup>
+  <Accordion title="Ollama not detected">
+    Make sure Ollama is running and that you set `OLLAMA_API_KEY` (or an auth profile), and that you did **not** define an explicit `models.providers.ollama` entry:

-Make sure Ollama is running and that you set `OLLAMA_API_KEY` (or an auth profile), and that you did **not** define an explicit `models.providers.ollama` entry:
+    ```bash
+    ollama serve
+    ```

-```bash
-ollama serve
-```
+    Verify that the API is accessible:

-And that the API is accessible:
+    ```bash
+    curl http://localhost:11434/api/tags
+    ```

-```bash
-curl http://localhost:11434/api/tags
-```
+  </Accordion>

-### No models available
+  <Accordion title="No models available">
+    If your model is not listed, either pull the model locally or define it explicitly in `models.providers.ollama`.

-If your model is not listed, either:
+    ```bash
+    ollama list  # See what's installed
+    ollama pull gemma4
+    ollama pull gpt-oss:20b
+    ollama pull llama3.3     # Or another model
+    ```

- Pull the model locally, or
- Define the model explicitly in `models.providers.ollama`.
+  </Accordion>

-To add models:
+  <Accordion title="Connection refused">
+    Check that Ollama is running on the correct port:

-```bash
-ollama list  # See what's installed
-ollama pull gemma4
-ollama pull gpt-oss:20b
-ollama pull llama3.3     # Or another model
-```
+    ```bash
+    # Check if Ollama is running
+    ps aux | grep ollama

-### Connection refused
+    # Or restart Ollama
+    ollama serve
+    ```

-Check that Ollama is running on the correct port:
+  </Accordion>
+</AccordionGroup>

-```bash
-# Check if Ollama is running
-ps aux | grep ollama
+<Note>
+More help: [Troubleshooting](/help/troubleshooting) and [FAQ](/help/faq).
+</Note>

-# Or restart Ollama
-ollama serve
-```
+## Related

-## See Also
-
- [Model Providers](/concepts/model-providers) - Overview of all providers
- [Model Selection](/concepts/models) - How to choose models
- [Configuration](/gateway/configuration) - Full config reference
+<CardGroup cols={2}>
+  <Card title="Model providers" href="/concepts/model-providers" icon="layers">
+    Overview of all providers, model refs, and failover behavior.
+  </Card>
+  <Card title="Model selection" href="/concepts/models" icon="brain">
+    How to choose and configure models.
+  </Card>
+  <Card title="Ollama Web Search" href="/tools/ollama-search" icon="magnifying-glass">
+    Full setup and behavior details for Ollama-powered web search.
+  </Card>
+  <Card title="Configuration" href="/gateway/configuration" icon="gear">
+    Full config reference.
+  </Card>
+</CardGroup>
--- a/docs/providers/openai.md
+++ b/docs/providers/openai.md
@@ -3,555 +3,542 @@ summary: "Use OpenAI via API keys or Codex subscription in OpenClaw"
 read_when:
  - You want to use OpenAI models in OpenClaw
  - You want Codex subscription auth instead of API keys
+  - You need stricter GPT-5 agent execution behavior
 title: "OpenAI"
 ---

 # OpenAI

-OpenAI provides developer APIs for GPT models. Codex supports **ChatGPT sign-in** for subscription
-access or **API key** sign-in for usage-based access. Codex cloud requires ChatGPT sign-in.
-OpenAI explicitly supports subscription OAuth usage in external tools/workflows like OpenClaw.
+OpenAI provides developer APIs for GPT models. OpenClaw supports two auth routes:

-## Default interaction style
+- **API key** — direct OpenAI Platform access with usage-based billing (`openai/*` models)
+- **Codex subscription** — ChatGPT/Codex sign-in with subscription access (`openai-codex/*` models)

-OpenClaw can add a small OpenAI-specific prompt overlay for both `openai/*` and
-`openai-codex/*` runs. By default, the overlay keeps the assistant warm,
-collaborative, concise, direct, and a little more emotionally expressive
-without replacing the base OpenClaw system prompt. The friendly overlay also
-permits the occasional emoji when it fits naturally, while keeping overall
-output concise.
+OpenAI explicitly supports subscription OAuth usage in external tools and workflows like OpenClaw.

-Config key:
+## Getting started

-`plugins.entries.openai.config.personality`
+Choose your preferred auth method and follow the setup steps.

-Allowed values:
+<Tabs>
+  <Tab title="API key (OpenAI Platform)">
+    **Best for:** direct API access and usage-based billing.

- `"friendly"`: default; enable the OpenAI-specific overlay.
- `"on"`: alias for `"friendly"`.
- `"off"`: disable the overlay and use the base OpenClaw prompt only.
+    <Steps>
+      <Step title="Get your API key">
+        Create or copy an API key from the [OpenAI Platform dashboard](https://platform.openai.com/api-keys).
+      </Step>
+      <Step title="Run onboarding">
+        ```bash
+        openclaw onboard --auth-choice openai-api-key
+        ```

-Scope:
+        Or pass the key directly:

- Applies to `openai/*` models.
- Applies to `openai-codex/*` models.
- Does not affect other providers.
+        ```bash
+        openclaw onboard --openai-api-key "$OPENAI_API_KEY"
+        ```
+      </Step>
+      <Step title="Verify the model is available">
+        ```bash
+        openclaw models list --provider openai
+        ```
+      </Step>
+    </Steps>

-This behavior is on by default. Keep `"friendly"` explicitly if you want that
-to survive future local config churn:
+    ### Route summary

-```json5
-{
-  plugins: {
-    entries: {
-      openai: {
-        config: {
-          personality: "friendly",
+    | Model ref | Route | Auth |
+    |-----------|-------|------|
+    | `openai/gpt-5.4` | Direct OpenAI Platform API | `OPENAI_API_KEY` |
+    | `openai/gpt-5.4-pro` | Direct OpenAI Platform API | `OPENAI_API_KEY` |
+
+    <Note>
+    ChatGPT/Codex sign-in is routed through `openai-codex/*`, not `openai/*`.
+    </Note>
+
+    ### Config example
+
+    ```json5
+    {
+      env: { OPENAI_API_KEY: "sk-..." },
+      agents: { defaults: { model: { primary: "openai/gpt-5.4" } } },
+    }
+    ```
+
+    <Warning>
+    OpenClaw does **not** expose `openai/gpt-5.3-codex-spark` on the direct API path. Live OpenAI API requests reject that model. Spark is Codex-only.
+    </Warning>
+
+  </Tab>
+
+  <Tab title="Codex subscription">
+    **Best for:** using your ChatGPT/Codex subscription instead of a separate API key. Codex cloud requires ChatGPT sign-in.
+
+    <Steps>
+      <Step title="Run Codex OAuth">
+        ```bash
+        openclaw onboard --auth-choice openai-codex
+        ```
+
+        Or run OAuth directly:
+
+        ```bash
+        openclaw models auth login --provider openai-codex
+        ```
+      </Step>
+      <Step title="Set the default model">
+        ```bash
+        openclaw config set agents.defaults.model.primary openai-codex/gpt-5.4
+        ```
+      </Step>
+      <Step title="Verify the model is available">
+        ```bash
+        openclaw models list --provider openai-codex
+        ```
+      </Step>
+    </Steps>
+
+    ### Route summary
+
+    | Model ref | Route | Auth |
+    |-----------|-------|------|
+    | `openai-codex/gpt-5.4` | ChatGPT/Codex OAuth | Codex sign-in |
+    | `openai-codex/gpt-5.3-codex-spark` | ChatGPT/Codex OAuth | Codex sign-in (entitlement-dependent) |
+
+    <Note>
+    This route is intentionally separate from `openai/gpt-5.4`. Use `openai/*` with an API key for direct Platform access, and `openai-codex/*` for Codex subscription access.
+    </Note>
+
+    ### Config example
+
+    ```json5
+    {
+      agents: { defaults: { model: { primary: "openai-codex/gpt-5.4" } } },
+    }
+    ```
+
+    <Tip>
+    If onboarding reuses an existing Codex CLI login, those credentials stay managed by Codex CLI. On expiry, OpenClaw re-reads the external Codex source first and writes the refreshed credential back to Codex storage.
+    </Tip>
+
+    ### Context window cap
+
+    OpenClaw treats model metadata and the runtime context cap as separate values.
+
+    For `openai-codex/gpt-5.4`:
+
+    - Native `contextWindow`: `1050000`
+    - Default runtime `contextTokens` cap: `272000`
+
+    The smaller default cap has better latency and quality characteristics in practice. Override it with `contextTokens`:
+
+    ```json5
+    {
+      models: {
+        providers: {
+          "openai-codex": {
+            models: [{ id: "gpt-5.4", contextTokens: 160000 }],
+          },
        },
      },
-    },
-  },
-}
-```
+    }
+    ```

-### Disable the OpenAI prompt overlay
+    <Note>
+    Use `contextWindow` to declare native model metadata. Use `contextTokens` to limit the runtime context budget.
+    </Note>

-If you want the unmodified base OpenClaw prompt, set the overlay to `"off"`:
-
-```json5
-{
-  plugins: {
-    entries: {
-      openai: {
-        config: {
-          personality: "off",
-        },
-      },
-    },
-  },
-}
-```
-
-You can also set it directly with the config CLI:
-
-```bash
-openclaw config set plugins.entries.openai.config.personality off
-```
-
-OpenClaw normalizes this setting case-insensitively at runtime, so values like
-`"Off"` still disable the friendly overlay.
-
-## Option A: OpenAI API key (OpenAI Platform)
-
-**Best for:** direct API access and usage-based billing.
-Get your API key from the OpenAI dashboard.
-
-Route summary:
-
- `openai/gpt-5.4` = direct OpenAI Platform API route
- Requires `OPENAI_API_KEY` (or equivalent OpenAI provider config)
- In OpenClaw, ChatGPT/Codex sign-in is routed through `openai-codex/*`, not `openai/*`
-
-### CLI setup
-
-```bash
-openclaw onboard --auth-choice openai-api-key
-# or non-interactive
-openclaw onboard --openai-api-key "$OPENAI_API_KEY"
-```
-
-### Config snippet
-
-```json5
-{
-  env: { OPENAI_API_KEY: "sk-..." },
-  agents: { defaults: { model: { primary: "openai/gpt-5.4" } } },
-}
-```
-
-OpenAI's current API model docs list `gpt-5.4` and `gpt-5.4-pro` for direct
-OpenAI API usage. OpenClaw forwards both through the `openai/*` Responses path.
-OpenClaw intentionally suppresses the stale `openai/gpt-5.3-codex-spark` row,
-because direct OpenAI API calls reject it in live traffic.
-
-OpenClaw does **not** expose `openai/gpt-5.3-codex-spark` on the direct OpenAI
-API path. `pi-ai` still ships a built-in row for that model, but live OpenAI API
-requests currently reject it. Spark is treated as Codex-only in OpenClaw.
+  </Tab>
+</Tabs>

 ## Image generation

-The bundled `openai` plugin also registers image generation through the shared
-`image_generate` tool.
+The bundled `openai` plugin registers image generation through the `image_generate` tool.

- Default image model: `openai/gpt-image-1`
- Generate: up to 4 images per request
- Edit mode: enabled, up to 5 reference images
- Supports `size`
- Current OpenAI-specific caveat: OpenClaw does not forward `aspectRatio` or
-  `resolution` overrides to the OpenAI Images API today
-
-To use OpenAI as the default image provider:
+| Capability                | Value                              |
+| ------------------------- | ---------------------------------- |
+| Default model             | `openai/gpt-image-1`               |
+| Max images per request    | 4                                  |
+| Edit mode                 | Enabled (up to 5 reference images) |
+| Size overrides            | Supported                          |
+| Aspect ratio / resolution | Not forwarded to OpenAI Images API |

 ```json5
 {
  agents: {
    defaults: {
-      imageGenerationModel: {
-        primary: "openai/gpt-image-1",
-      },
+      imageGenerationModel: { primary: "openai/gpt-image-1" },
    },
  },
 }
 ```

-See [Image Generation](/tools/image-generation) for the shared tool
-parameters, provider selection, and failover behavior.
+<Note>
+See [Image Generation](/tools/image-generation) for shared tool parameters, provider selection, and failover behavior.
+</Note>

 ## Video generation

-The bundled `openai` plugin also registers video generation through the shared
-`video_generate` tool.
+The bundled `openai` plugin registers video generation through the `video_generate` tool.

- Default video model: `openai/sora-2`
- Modes: text-to-video, image-to-video, and single-video reference/edit flows
- Current limits: 1 image or 1 video reference input
- Current OpenAI-specific caveat: OpenClaw currently only forwards `size`
-  overrides for native OpenAI video generation. Unsupported optional overrides
-  such as `aspectRatio`, `resolution`, `audio`, and `watermark` are ignored
-  and reported back as a tool warning.
-
-To use OpenAI as the default video provider:
+| Capability       | Value                                                                             |
+| ---------------- | --------------------------------------------------------------------------------- |
+| Default model    | `openai/sora-2`                                                                   |
+| Modes            | Text-to-video, image-to-video, single-video edit                                  |
+| Reference inputs | 1 image or 1 video                                                                |
+| Size overrides   | Supported                                                                         |
+| Other overrides  | `aspectRatio`, `resolution`, `audio`, `watermark` are ignored with a tool warning |

 ```json5
 {
  agents: {
    defaults: {
-      videoGenerationModel: {
-        primary: "openai/sora-2",
-      },
+      videoGenerationModel: { primary: "openai/sora-2" },
    },
  },
 }
 ```

-See [Video Generation](/tools/video-generation) for the shared tool
-parameters, provider selection, and failover behavior.
+<Note>
+See [Video Generation](/tools/video-generation) for shared tool parameters, provider selection, and failover behavior.
+</Note>

-## Option B: OpenAI Code (Codex) subscription
+## Personality overlay

-**Best for:** using ChatGPT/Codex subscription access instead of an API key.
-Codex cloud requires ChatGPT sign-in, while the Codex CLI supports ChatGPT or API key sign-in.
+OpenClaw adds a small OpenAI-specific prompt overlay for `openai/*` and `openai-codex/*` runs. The overlay keeps the assistant warm, collaborative, concise, and a little more emotionally expressive without replacing the base system prompt.

-Route summary:
+| Value                  | Effect                             |
+| ---------------------- | ---------------------------------- |
+| `"friendly"` (default) | Enable the OpenAI-specific overlay |
+| `"on"`                 | Alias for `"friendly"`             |
+| `"off"`                | Use base OpenClaw prompt only      |

- `openai-codex/gpt-5.4` = ChatGPT/Codex OAuth route
- Uses ChatGPT/Codex sign-in, not a direct OpenAI Platform API key
- Provider-side limits for `openai-codex/*` can differ from the ChatGPT web/app experience
-
-### CLI setup (Codex OAuth)
-
-```bash
-# Run Codex OAuth in the wizard
-openclaw onboard --auth-choice openai-codex
-
-# Or run OAuth directly
-openclaw models auth login --provider openai-codex
-```
-
-### Config snippet (Codex subscription)
-
-```json5
-{
-  agents: { defaults: { model: { primary: "openai-codex/gpt-5.4" } } },
-}
-```
-
-OpenAI's current Codex docs list `gpt-5.4` as the current Codex model. OpenClaw
-maps that to `openai-codex/gpt-5.4` for ChatGPT/Codex OAuth usage.
-
-This route is intentionally separate from `openai/gpt-5.4`. If you want the
-direct OpenAI Platform API path, use `openai/*` with an API key. If you want
-ChatGPT/Codex sign-in, use `openai-codex/*`.
-
-If onboarding reuses an existing Codex CLI login, those credentials stay
-managed by Codex CLI. On expiry, OpenClaw re-reads the external Codex source
-first and, when the provider can refresh it, writes the refreshed credential
-back to Codex storage instead of taking ownership in a separate OpenClaw-only
-copy.
-
-If your Codex account is entitled to Codex Spark, OpenClaw also supports:
-
- `openai-codex/gpt-5.3-codex-spark`
-
-OpenClaw treats Codex Spark as Codex-only. It does not expose a direct
-`openai/gpt-5.3-codex-spark` API-key path.
-
-OpenClaw also preserves `openai-codex/gpt-5.3-codex-spark` when `pi-ai`
-discovers it. Treat it as entitlement-dependent and experimental: Codex Spark is
-separate from GPT-5.4 `/fast`, and availability depends on the signed-in Codex /
-ChatGPT account.
-
-### Codex context window cap
-
-OpenClaw treats the Codex model metadata and the runtime context cap as separate
-values.
-
-For `openai-codex/gpt-5.4`:
-
- native `contextWindow`: `1050000`
- default runtime `contextTokens` cap: `272000`
-
-That keeps model metadata truthful while preserving the smaller default runtime
-window that has better latency and quality characteristics in practice.
-
-If you want a different effective cap, set `models.providers.<provider>.models[].contextTokens`:
-
-```json5
-{
-  models: {
-    providers: {
-      "openai-codex": {
-        models: [
-          {
-            id: "gpt-5.4",
-            contextTokens: 160000,
-          },
-        ],
+<Tabs>
+  <Tab title="Config">
+    ```json5
+    {
+      plugins: {
+        entries: {
+          openai: { config: { personality: "friendly" } },
+        },
      },
-    },
-  },
-}
-```
+    }
+    ```
+  </Tab>
+  <Tab title="CLI">
+    ```bash
+    openclaw config set plugins.entries.openai.config.personality off
+    ```
+  </Tab>
+</Tabs>

-Use `contextWindow` only when you are declaring or overriding native model
-metadata. Use `contextTokens` when you want to limit the runtime context budget.
+<Tip>
+Values are case-insensitive at runtime, so `"Off"` and `"off"` both disable the overlay.
+</Tip>

-### Transport default
+## Voice and speech

-OpenClaw uses `pi-ai` for model streaming. For both `openai/*` and
-`openai-codex/*`, default transport is `"auto"` (WebSocket-first, then SSE
-fallback).
+<AccordionGroup>
+  <Accordion title="Speech synthesis (TTS)">
+    The bundled `openai` plugin registers speech synthesis for the `messages.tts` surface.

-In `"auto"` mode, OpenClaw also retries one early, retryable WebSocket failure
-before it falls back to SSE. Forced `"websocket"` mode still surfaces transport
-errors directly instead of hiding them behind fallback.
+    | Setting | Config path | Default |
+    |---------|------------|---------|
+    | Model | `messages.tts.providers.openai.model` | `gpt-4o-mini-tts` |
+    | Voice | `messages.tts.providers.openai.voice` | `coral` |
+    | Speed | `messages.tts.providers.openai.speed` | (unset) |
+    | Instructions | `messages.tts.providers.openai.instructions` | (unset, `gpt-4o-mini-tts` only) |
+    | Format | `messages.tts.providers.openai.responseFormat` | `opus` for voice notes, `mp3` for files |
+    | API key | `messages.tts.providers.openai.apiKey` | Falls back to `OPENAI_API_KEY` |
+    | Base URL | `messages.tts.providers.openai.baseUrl` | `https://api.openai.com/v1` |

-After a connect or early-turn WebSocket failure in `"auto"` mode, OpenClaw marks
-that session's WebSocket path as degraded for about 60 seconds and sends
-subsequent turns over SSE during the cool-down instead of thrashing between
-transports.
+    Available models: `gpt-4o-mini-tts`, `tts-1`, `tts-1-hd`. Available voices: `alloy`, `ash`, `ballad`, `cedar`, `coral`, `echo`, `fable`, `juniper`, `marin`, `onyx`, `nova`, `sage`, `shimmer`, `verse`.

-For native OpenAI-family endpoints (`openai/*`, `openai-codex/*`, and Azure
-OpenAI Responses), OpenClaw also attaches stable session and turn identity state
-to requests so retries, reconnects, and SSE fallback stay aligned to the same
-conversation identity. On native OpenAI-family routes this includes stable
-session/turn request identity headers plus matching transport metadata.
-
-OpenClaw also normalizes OpenAI usage counters across transport variants before
-they reach session/status surfaces. Native OpenAI/Codex Responses traffic may
-report usage as either `input_tokens` / `output_tokens` or
-`prompt_tokens` / `completion_tokens`; OpenClaw treats those as the same input
-and output counters for `/status`, `/usage`, and session logs. When native
-WebSocket traffic omits `total_tokens` (or reports `0`), OpenClaw falls back to
-the normalized input + output total so session/status displays stay populated.
-
-You can set `agents.defaults.models.<provider/model>.params.transport`:
-
- `"sse"`: force SSE
- `"websocket"`: force WebSocket
- `"auto"`: try WebSocket, then fall back to SSE
-
-For `openai/*` (Responses API), OpenClaw also enables WebSocket warm-up by
-default (`openaiWsWarmup: true`) when WebSocket transport is used.
-
-Related OpenAI docs:
-
- [Realtime API with WebSocket](https://platform.openai.com/docs/guides/realtime-websocket)
- [Streaming API responses (SSE)](https://platform.openai.com/docs/guides/streaming-responses)
-
-```json5
-{
-  agents: {
-    defaults: {
-      model: { primary: "openai-codex/gpt-5.4" },
-      models: {
-        "openai-codex/gpt-5.4": {
-          params: {
-            transport: "auto",
+    ```json5
+    {
+      messages: {
+        tts: {
+          providers: {
+            openai: { model: "gpt-4o-mini-tts", voice: "coral" },
          },
        },
      },
-    },
-  },
-}
-```
+    }
+    ```

-### OpenAI WebSocket warm-up
+    <Note>
+    Set `OPENAI_TTS_BASE_URL` to override the TTS base URL without affecting the chat API endpoint.
+    </Note>

-OpenAI docs describe warm-up as optional. OpenClaw enables it by default for
-`openai/*` to reduce first-turn latency when using WebSocket transport.
+  </Accordion>

-### Disable warm-up
+  <Accordion title="Realtime transcription">
+    The bundled `openai` plugin registers realtime transcription for the Voice Call plugin.

-```json5
-{
-  agents: {
-    defaults: {
-      models: {
-        "openai/gpt-5.4": {
-          params: {
-            openaiWsWarmup: false,
+    | Setting | Config path | Default |
+    |---------|------------|---------|
+    | Model | `plugins.entries.voice-call.config.streaming.providers.openai.model` | `gpt-4o-transcribe` |
+    | Silence duration | `...openai.silenceDurationMs` | `800` |
+    | VAD threshold | `...openai.vadThreshold` | `0.5` |
+    | API key | `...openai.apiKey` | Falls back to `OPENAI_API_KEY` |
+
+    <Note>
+    Uses a WebSocket connection to `wss://api.openai.com/v1/realtime` with G.711 u-law audio.
+    </Note>
+
+  </Accordion>
+
+  <Accordion title="Realtime voice">
+    The bundled `openai` plugin registers realtime voice for the Voice Call plugin.
+
+    | Setting | Config path | Default |
+    |---------|------------|---------|
+    | Model | `plugins.entries.voice-call.config.realtime.providers.openai.model` | `gpt-realtime` |
+    | Voice | `...openai.voice` | `alloy` |
+    | Temperature | `...openai.temperature` | `0.8` |
+    | VAD threshold | `...openai.vadThreshold` | `0.5` |
+    | Silence duration | `...openai.silenceDurationMs` | `500` |
+    | API key | `...openai.apiKey` | Falls back to `OPENAI_API_KEY` |
+
+    <Note>
+    Supports Azure OpenAI via `azureEndpoint` and `azureDeployment` config keys. Supports bidirectional tool calling. Uses G.711 u-law audio format.
+    </Note>
+
+  </Accordion>
+</AccordionGroup>
+
+## Advanced configuration
+
+<AccordionGroup>
+  <Accordion title="Transport (WebSocket vs SSE)">
+    OpenClaw uses WebSocket-first with SSE fallback (`"auto"`) for both `openai/*` and `openai-codex/*`.
+
+    In `"auto"` mode, OpenClaw:
+    - Retries one early WebSocket failure before falling back to SSE
+    - After a failure, marks WebSocket as degraded for ~60 seconds and uses SSE during cool-down
+    - Attaches stable session and turn identity headers for retries and reconnects
+    - Normalizes usage counters (`input_tokens` / `prompt_tokens`) across transport variants
+
+    | Value | Behavior |
+    |-------|----------|
+    | `"auto"` (default) | WebSocket first, SSE fallback |
+    | `"sse"` | Force SSE only |
+    | `"websocket"` | Force WebSocket only |
+
+    ```json5
+    {
+      agents: {
+        defaults: {
+          models: {
+            "openai-codex/gpt-5.4": {
+              params: { transport: "auto" },
+            },
          },
        },
      },
-    },
-  },
-}
-```
+    }
+    ```

-### Enable warm-up explicitly
+    Related OpenAI docs:
+    - [Realtime API with WebSocket](https://platform.openai.com/docs/guides/realtime-websocket)
+    - [Streaming API responses (SSE)](https://platform.openai.com/docs/guides/streaming-responses)

-```json5
-{
-  agents: {
-    defaults: {
-      models: {
-        "openai/gpt-5.4": {
-          params: {
-            openaiWsWarmup: true,
+  </Accordion>
+
+  <Accordion title="WebSocket warm-up">
+    OpenClaw enables WebSocket warm-up by default for `openai/*` to reduce first-turn latency.
+
+    ```json5
+    // Disable warm-up
+    {
+      agents: {
+        defaults: {
+          models: {
+            "openai/gpt-5.4": {
+              params: { openaiWsWarmup: false },
+            },
          },
        },
      },
-    },
-  },
-}
-```
+    }
+    ```

-### OpenAI and Codex priority processing
+  </Accordion>

-OpenAI's API exposes priority processing via `service_tier=priority`. In
-OpenClaw, set `agents.defaults.models["<provider>/<model>"].params.serviceTier`
-to pass that field through on native OpenAI/Codex Responses endpoints.
+  <Accordion title="Fast mode">
+    OpenClaw exposes a shared fast-mode toggle for both `openai/*` and `openai-codex/*`:

-```json5
-{
-  agents: {
-    defaults: {
-      models: {
-        "openai/gpt-5.4": {
-          params: {
-            serviceTier: "priority",
-          },
-        },
-        "openai-codex/gpt-5.4": {
-          params: {
-            serviceTier: "priority",
+    - **Chat/UI:** `/fast status|on|off`
+    - **Config:** `agents.defaults.models["<provider>/<model>"].params.fastMode`
+
+    When enabled, OpenClaw maps fast mode to OpenAI priority processing (`service_tier = "priority"`). Existing `service_tier` values are preserved, and fast mode does not rewrite `reasoning` or `text.verbosity`.
+
+    ```json5
+    {
+      agents: {
+        defaults: {
+          models: {
+            "openai/gpt-5.4": { params: { fastMode: true } },
+            "openai-codex/gpt-5.4": { params: { fastMode: true } },
          },
        },
      },
-    },
-  },
-}
-```
+    }
+    ```

-Supported values are `auto`, `default`, `flex`, and `priority`.
+    <Note>
+    Session overrides win over config. Clearing the session override in the Sessions UI returns the session to the configured default.
+    </Note>

-OpenClaw forwards `params.serviceTier` to both direct `openai/*` Responses
-requests and `openai-codex/*` Codex Responses requests when those models point
-at the native OpenAI/Codex endpoints.
+  </Accordion>

-Important behavior:
+  <Accordion title="Priority processing (service_tier)">
+    OpenAI's API exposes priority processing via `service_tier`. Set it per model in OpenClaw:

- direct `openai/*` must target `api.openai.com`
- `openai-codex/*` must target `chatgpt.com/backend-api`
- if you route either provider through another base URL or proxy, OpenClaw leaves `service_tier` untouched
-
-### OpenAI fast mode
-
-OpenClaw exposes a shared fast-mode toggle for both `openai/*` and
-`openai-codex/*` sessions:
-
- Chat/UI: `/fast status|on|off`
- Config: `agents.defaults.models["<provider>/<model>"].params.fastMode`
-
-When fast mode is enabled, OpenClaw maps it to OpenAI priority processing:
-
- direct `openai/*` Responses calls to `api.openai.com` send `service_tier = "priority"`
- `openai-codex/*` Responses calls to `chatgpt.com/backend-api` also send `service_tier = "priority"`
- existing payload `service_tier` values are preserved
- fast mode does not rewrite `reasoning` or `text.verbosity`
-
-For GPT 5.4 specifically, the most common setup is:
-
- send `/fast on` in a session using `openai/gpt-5.4` or `openai-codex/gpt-5.4`
- or set `agents.defaults.models["openai/gpt-5.4"].params.fastMode = true`
- if you also use Codex OAuth, set `agents.defaults.models["openai-codex/gpt-5.4"].params.fastMode = true` too
-
-Example:
-
-```json5
-{
-  agents: {
-    defaults: {
-      models: {
-        "openai/gpt-5.4": {
-          params: {
-            fastMode: true,
-          },
-        },
-        "openai-codex/gpt-5.4": {
-          params: {
-            fastMode: true,
+    ```json5
+    {
+      agents: {
+        defaults: {
+          models: {
+            "openai/gpt-5.4": { params: { serviceTier: "priority" } },
+            "openai-codex/gpt-5.4": { params: { serviceTier: "priority" } },
          },
        },
      },
-    },
-  },
-}
-```
+    }
+    ```

-Session overrides win over config. Clearing the session override in the Sessions UI
-returns the session to the configured default.
+    Supported values: `auto`, `default`, `flex`, `priority`.

-### Native OpenAI versus OpenAI-compatible routes
+    <Warning>
+    `serviceTier` is only forwarded to native OpenAI endpoints (`api.openai.com`) and native Codex endpoints (`chatgpt.com/backend-api`). If you route either provider through a proxy, OpenClaw leaves `service_tier` untouched.
+    </Warning>

-OpenClaw treats direct OpenAI, Codex, and Azure OpenAI endpoints differently
-from generic OpenAI-compatible `/v1` proxies:
+  </Accordion>

- native `openai/*`, `openai-codex/*`, and Azure OpenAI routes keep
-  `reasoning: { effort: "none" }` intact when you explicitly disable reasoning
- native OpenAI-family routes default tool schemas to strict mode
- hidden OpenClaw attribution headers (`originator`, `version`, and
-  `User-Agent`) are only attached on verified native OpenAI hosts
-  (`api.openai.com`) and native Codex hosts (`chatgpt.com/backend-api`)
- native OpenAI/Codex routes keep OpenAI-only request shaping such as
-  `service_tier`, Responses `store`, OpenAI reasoning-compat payloads, and
-  prompt-cache hints
- proxy-style OpenAI-compatible routes keep the looser compat behavior and do
-  not force strict tool schemas, native-only request shaping, or hidden
-  OpenAI/Codex attribution headers
+  <Accordion title="Server-side compaction (Responses API)">
+    For direct OpenAI Responses models (`openai/*` on `api.openai.com`), OpenClaw auto-enables server-side compaction:

-Azure OpenAI stays in the native-routing bucket for transport and compat
-behavior, but it does not receive the hidden OpenAI/Codex attribution headers.
+    - Forces `store: true` (unless model compat sets `supportsStore: false`)
+    - Injects `context_management: [{ type: "compaction", compact_threshold: ... }]`
+    - Default `compact_threshold`: 70% of `contextWindow` (or `80000` when unavailable)

-This preserves current native OpenAI Responses behavior without forcing older
-OpenAI-compatible shims onto third-party `/v1` backends.
+    <Tabs>
+      <Tab title="Enable explicitly">
+        Useful for compatible endpoints like Azure OpenAI Responses:

-### OpenAI Responses server-side compaction
-
-For direct OpenAI Responses models (`openai/*` using `api: "openai-responses"` with
-`baseUrl` on `api.openai.com`), OpenClaw now auto-enables OpenAI server-side
-compaction payload hints:
-
- Forces `store: true` (unless model compat sets `supportsStore: false`)
- Injects `context_management: [{ type: "compaction", compact_threshold: ... }]`
-
-By default, `compact_threshold` is `70%` of model `contextWindow` (or `80000`
-when unavailable).
-
-### Enable server-side compaction explicitly
-
-Use this when you want to force `context_management` injection on compatible
-Responses models (for example Azure OpenAI Responses):
-
-```json5
-{
-  agents: {
-    defaults: {
-      models: {
-        "azure-openai-responses/gpt-5.4": {
-          params: {
-            responsesServerCompaction: true,
+        ```json5
+        {
+          agents: {
+            defaults: {
+              models: {
+                "azure-openai-responses/gpt-5.4": {
+                  params: { responsesServerCompaction: true },
+                },
+              },
+            },
          },
+        }
+        ```
+      </Tab>
+      <Tab title="Custom threshold">
+        ```json5
+        {
+          agents: {
+            defaults: {
+              models: {
+                "openai/gpt-5.4": {
+                  params: {
+                    responsesServerCompaction: true,
+                    responsesCompactThreshold: 120000,
+                  },
+                },
+              },
+            },
+          },
+        }
+        ```
+      </Tab>
+      <Tab title="Disable">
+        ```json5
+        {
+          agents: {
+            defaults: {
+              models: {
+                "openai/gpt-5.4": {
+                  params: { responsesServerCompaction: false },
+                },
+              },
+            },
+          },
+        }
+        ```
+      </Tab>
+    </Tabs>
+
+    <Note>
+    `responsesServerCompaction` only controls `context_management` injection. Direct OpenAI Responses models still force `store: true` unless compat sets `supportsStore: false`.
+    </Note>
+
+  </Accordion>
+
+  <Accordion title="Strict-agentic GPT mode">
+    For GPT-5-family runs on `openai/*` and `openai-codex/*`, OpenClaw can use a stricter embedded execution contract:
+
+    ```json5
+    {
+      agents: {
+        defaults: {
+          embeddedPi: { executionContract: "strict-agentic" },
        },
      },
-    },
-  },
-}
-```
+    }
+    ```

-### Enable with a custom threshold
+    With `strict-agentic`, OpenClaw:
+    - No longer treats a plan-only turn as successful progress when a tool action is available
+    - Retries the turn with an act-now steer
+    - Auto-enables `update_plan` for substantial work
+    - Surfaces an explicit blocked state if the model keeps planning without acting

-```json5
-{
-  agents: {
-    defaults: {
-      models: {
-        "openai/gpt-5.4": {
-          params: {
-            responsesServerCompaction: true,
-            responsesCompactThreshold: 120000,
-          },
-        },
-      },
-    },
-  },
-}
-```
+    <Note>
+    Scoped to OpenAI and Codex GPT-5-family runs only. Other providers and older model families keep default behavior.
+    </Note>

-### Disable server-side compaction
+  </Accordion>

-```json5
-{
-  agents: {
-    defaults: {
-      models: {
-        "openai/gpt-5.4": {
-          params: {
-            responsesServerCompaction: false,
-          },
-        },
-      },
-    },
-  },
-}
-```
+  <Accordion title="Native vs OpenAI-compatible routes">
+    OpenClaw treats direct OpenAI, Codex, and Azure OpenAI endpoints differently from generic OpenAI-compatible `/v1` proxies:

-`responsesServerCompaction` only controls `context_management` injection.
-Direct OpenAI Responses models still force `store: true` unless compat sets
-`supportsStore: false`.
+    **Native routes** (`openai/*`, `openai-codex/*`, Azure OpenAI):
+    - Keep `reasoning: { effort: "none" }` intact when reasoning is explicitly disabled
+    - Default tool schemas to strict mode
+    - Attach hidden attribution headers on verified native hosts only
+    - Keep OpenAI-only request shaping (`service_tier`, `store`, reasoning-compat, prompt-cache hints)

-## Notes
+    **Proxy/compatible routes:**
+    - Use looser compat behavior
+    - Do not force strict tool schemas or native-only headers

- Model refs always use `provider/model` (see [/concepts/models](/concepts/models)).
- Auth details + reuse rules are in [/concepts/oauth](/concepts/oauth).
+    Azure OpenAI uses native transport and compat behavior but does not receive the hidden attribution headers.
+
+  </Accordion>
+</AccordionGroup>
+
+## Related
+
+<CardGroup cols={2}>
+  <Card title="Model selection" href="/concepts/model-providers" icon="layers">
+    Choosing providers, model refs, and failover behavior.
+  </Card>
+  <Card title="Image generation" href="/tools/image-generation" icon="image">
+    Shared image tool parameters and provider selection.
+  </Card>
+  <Card title="Video generation" href="/tools/video-generation" icon="video">
+    Shared video tool parameters and provider selection.
+  </Card>
+  <Card title="OAuth and auth" href="/gateway/authentication" icon="key">
+    Auth details and credential reuse rules.
+  </Card>
+</CardGroup>
--- a/docs/providers/opencode-go.md
+++ b/docs/providers/opencode-go.md
@@ -12,21 +12,60 @@ OpenCode Go is the Go catalog within [OpenCode](/providers/opencode).
 It uses the same `OPENCODE_API_KEY` as the Zen catalog, but keeps the runtime
 provider id `opencode-go` so upstream per-model routing stays correct.

+| Property         | Value                           |
+| ---------------- | ------------------------------- |
+| Runtime provider | `opencode-go`                   |
+| Auth             | `OPENCODE_API_KEY`              |
+| Parent setup     | [OpenCode](/providers/opencode) |
+
 ## Supported models

- `opencode-go/kimi-k2.5`
- `opencode-go/glm-5`
- `opencode-go/minimax-m2.5`
+| Model ref                  | Name         |
+| -------------------------- | ------------ |
+| `opencode-go/kimi-k2.5`    | Kimi K2.5    |
+| `opencode-go/glm-5`        | GLM 5        |
+| `opencode-go/minimax-m2.5` | MiniMax M2.5 |

-## CLI setup
+## Getting started

-```bash
-openclaw onboard --auth-choice opencode-go
-# or non-interactive
-openclaw onboard --opencode-go-api-key "$OPENCODE_API_KEY"
-```
+<Tabs>
+  <Tab title="Interactive">
+    <Steps>
+      <Step title="Run onboarding">
+        ```bash
+        openclaw onboard --auth-choice opencode-go
+        ```
+      </Step>
+      <Step title="Set a Go model as default">
+        ```bash
+        openclaw config set agents.defaults.model.primary "opencode-go/kimi-k2.5"
+        ```
+      </Step>
+      <Step title="Verify models are available">
+        ```bash
+        openclaw models list --provider opencode-go
+        ```
+      </Step>
+    </Steps>
+  </Tab>

-## Config snippet
+  <Tab title="Non-interactive">
+    <Steps>
+      <Step title="Pass the key directly">
+        ```bash
+        openclaw onboard --opencode-go-api-key "$OPENCODE_API_KEY"
+        ```
+      </Step>
+      <Step title="Verify models are available">
+        ```bash
+        openclaw models list --provider opencode-go
+        ```
+      </Step>
+    </Steps>
+  </Tab>
+</Tabs>
+
+## Config example

 ```json5
 {
@@ -35,11 +74,37 @@ openclaw onboard --opencode-go-api-key "$OPENCODE_API_KEY"
 }
 ```

-## Routing behavior
+## Advanced notes

-OpenClaw handles per-model routing automatically when the model ref uses `opencode-go/...`.
+<AccordionGroup>
+  <Accordion title="Routing behavior">
+    OpenClaw handles per-model routing automatically when the model ref uses
+    `opencode-go/...`. No additional provider config is required.
+  </Accordion>

-## Notes
+  <Accordion title="Runtime ref convention">
+    Runtime refs stay explicit: `opencode/...` for Zen, `opencode-go/...` for Go.
+    This keeps upstream per-model routing correct across both catalogs.
+  </Accordion>

- Use [OpenCode](/providers/opencode) for the shared onboarding and catalog overview.
- Runtime refs stay explicit: `opencode/...` for Zen, `opencode-go/...` for Go.
+  <Accordion title="Shared credentials">
+    The same `OPENCODE_API_KEY` is used by both the Zen and Go catalogs. Entering
+    the key during setup stores credentials for both runtime providers.
+  </Accordion>
+</AccordionGroup>
+
+<Tip>
+See [OpenCode](/providers/opencode) for the shared onboarding overview and the full
+Zen + Go catalog reference.
+</Tip>
+
+## Related
+
+<CardGroup cols={2}>
+  <Card title="OpenCode (parent)" href="/providers/opencode" icon="server">
+    Shared onboarding, catalog overview, and advanced notes.
+  </Card>
+  <Card title="Model selection" href="/concepts/model-providers" icon="layers">
+    Choosing providers, model refs, and failover behavior.
+  </Card>
+</CardGroup>
--- a/docs/providers/opencode.md
+++ b/docs/providers/opencode.md
@@ -10,30 +10,78 @@ title: "OpenCode"

 OpenCode exposes two hosted catalogs in OpenClaw:

- `opencode/...` for the **Zen** catalog
- `opencode-go/...` for the **Go** catalog
+| Catalog | Prefix            | Runtime provider |
+| ------- | ----------------- | ---------------- |
+| **Zen** | `opencode/...`    | `opencode`       |
+| **Go**  | `opencode-go/...` | `opencode-go`    |

 Both catalogs use the same OpenCode API key. OpenClaw keeps the runtime provider ids
 split so upstream per-model routing stays correct, but onboarding and docs treat them
 as one OpenCode setup.

-## CLI setup
+## Getting started

-### Zen catalog
+<Tabs>
+  <Tab title="Zen catalog">
+    **Best for:** the curated OpenCode multi-model proxy (Claude, GPT, Gemini).

-```bash
-openclaw onboard --auth-choice opencode-zen
-openclaw onboard --opencode-zen-api-key "$OPENCODE_API_KEY"
-```
+    <Steps>
+      <Step title="Run onboarding">
+        ```bash
+        openclaw onboard --auth-choice opencode-zen
+        ```

-### Go catalog
+        Or pass the key directly:

-```bash
-openclaw onboard --auth-choice opencode-go
-openclaw onboard --opencode-go-api-key "$OPENCODE_API_KEY"
-```
+        ```bash
+        openclaw onboard --opencode-zen-api-key "$OPENCODE_API_KEY"
+        ```
+      </Step>
+      <Step title="Set a Zen model as the default">
+        ```bash
+        openclaw config set agents.defaults.model.primary "opencode/claude-opus-4-6"
+        ```
+      </Step>
+      <Step title="Verify models are available">
+        ```bash
+        openclaw models list --provider opencode
+        ```
+      </Step>
+    </Steps>

-## Config snippet
+  </Tab>
+
+  <Tab title="Go catalog">
+    **Best for:** the OpenCode-hosted Kimi, GLM, and MiniMax lineup.
+
+    <Steps>
+      <Step title="Run onboarding">
+        ```bash
+        openclaw onboard --auth-choice opencode-go
+        ```
+
+        Or pass the key directly:
+
+        ```bash
+        openclaw onboard --opencode-go-api-key "$OPENCODE_API_KEY"
+        ```
+      </Step>
+      <Step title="Set a Go model as the default">
+        ```bash
+        openclaw config set agents.defaults.model.primary "opencode-go/kimi-k2.5"
+        ```
+      </Step>
+      <Step title="Verify models are available">
+        ```bash
+        openclaw models list --provider opencode-go
+        ```
+      </Step>
+    </Steps>
+
+  </Tab>
+</Tabs>
+
+## Config example

 ```json5
 {
@@ -46,23 +94,58 @@ openclaw onboard --opencode-go-api-key "$OPENCODE_API_KEY"

 ### Zen

- Runtime provider: `opencode`
- Example models: `opencode/claude-opus-4-6`, `opencode/gpt-5.4`, `opencode/gemini-3-pro`
- Best when you want the curated OpenCode multi-model proxy
+| Property         | Value                                                                   |
+| ---------------- | ----------------------------------------------------------------------- |
+| Runtime provider | `opencode`                                                              |
+| Example models   | `opencode/claude-opus-4-6`, `opencode/gpt-5.4`, `opencode/gemini-3-pro` |

 ### Go

- Runtime provider: `opencode-go`
- Example models: `opencode-go/kimi-k2.5`, `opencode-go/glm-5`, `opencode-go/minimax-m2.5`
- Best when you want the OpenCode-hosted Kimi/GLM/MiniMax lineup
+| Property         | Value                                                                    |
+| ---------------- | ------------------------------------------------------------------------ |
+| Runtime provider | `opencode-go`                                                            |
+| Example models   | `opencode-go/kimi-k2.5`, `opencode-go/glm-5`, `opencode-go/minimax-m2.5` |

-## Notes
+## Advanced notes

- `OPENCODE_ZEN_API_KEY` is also supported.
- Entering one OpenCode key during setup stores credentials for both runtime providers.
- You sign in to OpenCode, add billing details, and copy your API key.
- Billing and catalog availability are managed from the OpenCode dashboard.
- Gemini-backed OpenCode refs stay on the proxy-Gemini path, so OpenClaw keeps
-  Gemini thought-signature sanitation there without enabling native Gemini
-  replay validation or bootstrap rewrites.
- Non-Gemini OpenCode refs keep the minimal OpenAI-compatible replay policy.
+<AccordionGroup>
+  <Accordion title="API key aliases">
+    `OPENCODE_ZEN_API_KEY` is also supported as an alias for `OPENCODE_API_KEY`.
+  </Accordion>
+
+  <Accordion title="Shared credentials">
+    Entering one OpenCode key during setup stores credentials for both runtime
+    providers. You do not need to onboard each catalog separately.
+  </Accordion>
+
+  <Accordion title="Billing and dashboard">
+    You sign in to OpenCode, add billing details, and copy your API key. Billing
+    and catalog availability are managed from the OpenCode dashboard.
+  </Accordion>
+
+  <Accordion title="Gemini replay behavior">
+    Gemini-backed OpenCode refs stay on the proxy-Gemini path, so OpenClaw keeps
+    Gemini thought-signature sanitation there without enabling native Gemini
+    replay validation or bootstrap rewrites.
+  </Accordion>
+
+  <Accordion title="Non-Gemini replay behavior">
+    Non-Gemini OpenCode refs keep the minimal OpenAI-compatible replay policy.
+  </Accordion>
+</AccordionGroup>
+
+<Tip>
+Entering one OpenCode key during setup stores credentials for both the Zen and
+Go runtime providers, so you only need to onboard once.
+</Tip>
+
+## Related
+
+<CardGroup cols={2}>
+  <Card title="Model selection" href="/concepts/model-providers" icon="layers">
+    Choosing providers, model refs, and failover behavior.
+  </Card>
+  <Card title="Configuration reference" href="/gateway/configuration-reference" icon="gear">
+    Full config reference for agents, models, and providers.
+  </Card>
+</CardGroup>
--- a/docs/providers/openrouter.md
+++ b/docs/providers/openrouter.md
@@ -11,13 +11,28 @@ title: "OpenRouter"
 OpenRouter provides a **unified API** that routes requests to many models behind a single
 endpoint and API key. It is OpenAI-compatible, so most OpenAI SDKs work by switching the base URL.

-## CLI setup
+## Getting started

-```bash
-openclaw onboard --auth-choice openrouter-api-key
-```
+<Steps>
+  <Step title="Get your API key">
+    Create an API key at [openrouter.ai/keys](https://openrouter.ai/keys).
+  </Step>
+  <Step title="Run onboarding">
+    ```bash
+    openclaw onboard --auth-choice openrouter-api-key
+    ```
+  </Step>
+  <Step title="(Optional) Switch to a specific model">
+    Onboarding defaults to `openrouter/auto`. Pick a concrete model later:

-## Config snippet
+    ```bash
+    openclaw models set openrouter/<provider>/<model>
+    ```
+
+  </Step>
+</Steps>
+
+## Config example

 ```json5
 {
@@ -30,30 +45,71 @@ openclaw onboard --auth-choice openrouter-api-key
 }
 ```

-## Notes
+## Model references

- Model refs are `openrouter/<provider>/<model>`.
- Onboarding defaults to `openrouter/auto`. Switch to a concrete model later with
-  `openclaw models set openrouter/<provider>/<model>`.
- For more model/provider options, see [/concepts/model-providers](/concepts/model-providers).
- OpenRouter uses a Bearer token with your API key under the hood.
- On real OpenRouter requests (`https://openrouter.ai/api/v1`), OpenClaw also
-  adds OpenRouter's documented app-attribution headers:
-  `HTTP-Referer: https://openclaw.ai`, `X-OpenRouter-Title: OpenClaw`, and
-  `X-OpenRouter-Categories: cli-agent`.
- On verified OpenRouter routes, Anthropic model refs also keep the
-  OpenRouter-specific Anthropic `cache_control` markers that OpenClaw uses for
-  better prompt-cache reuse on system/developer prompt blocks.
- If you repoint the OpenRouter provider at some other proxy/base URL, OpenClaw
-  does not inject those OpenRouter-specific headers or Anthropic cache markers.
- OpenRouter still runs through the proxy-style OpenAI-compatible path, so
-  native OpenAI-only request shaping such as `serviceTier`, Responses `store`,
-  OpenAI reasoning-compat payloads, and prompt-cache hints is not forwarded.
- Gemini-backed OpenRouter refs stay on the proxy-Gemini path: OpenClaw keeps
-  Gemini thought-signature sanitation there, but does not enable native Gemini
-  replay validation or bootstrap rewrites.
- On supported non-`auto` routes, OpenClaw maps the selected thinking level to
-  OpenRouter proxy reasoning payloads. Unsupported model hints and
-  `openrouter/auto` skip that reasoning injection.
- If you pass OpenRouter provider routing under model params, OpenClaw forwards
-  it as OpenRouter routing metadata before the shared stream wrappers run.
+<Note>
+Model refs follow the pattern `openrouter/<provider>/<model>`. For the full list of
+available providers and models, see [/concepts/model-providers](/concepts/model-providers).
+</Note>
+
+## Authentication and headers
+
+OpenRouter uses a Bearer token with your API key under the hood.
+
+On real OpenRouter requests (`https://openrouter.ai/api/v1`), OpenClaw also adds
+OpenRouter's documented app-attribution headers:
+
+| Header                    | Value                 |
+| ------------------------- | --------------------- |
+| `HTTP-Referer`            | `https://openclaw.ai` |
+| `X-OpenRouter-Title`      | `OpenClaw`            |
+| `X-OpenRouter-Categories` | `cli-agent`           |
+
+<Warning>
+If you repoint the OpenRouter provider at some other proxy or base URL, OpenClaw
+does **not** inject those OpenRouter-specific headers or Anthropic cache markers.
+</Warning>
+
+## Advanced notes
+
+<AccordionGroup>
+  <Accordion title="Anthropic cache markers">
+    On verified OpenRouter routes, Anthropic model refs keep the
+    OpenRouter-specific Anthropic `cache_control` markers that OpenClaw uses for
+    better prompt-cache reuse on system/developer prompt blocks.
+  </Accordion>
+
+  <Accordion title="Thinking / reasoning injection">
+    On supported non-`auto` routes, OpenClaw maps the selected thinking level to
+    OpenRouter proxy reasoning payloads. Unsupported model hints and
+    `openrouter/auto` skip that reasoning injection.
+  </Accordion>
+
+  <Accordion title="OpenAI-only request shaping">
+    OpenRouter still runs through the proxy-style OpenAI-compatible path, so
+    native OpenAI-only request shaping such as `serviceTier`, Responses `store`,
+    OpenAI reasoning-compat payloads, and prompt-cache hints is not forwarded.
+  </Accordion>
+
+  <Accordion title="Gemini-backed routes">
+    Gemini-backed OpenRouter refs stay on the proxy-Gemini path: OpenClaw keeps
+    Gemini thought-signature sanitation there, but does not enable native Gemini
+    replay validation or bootstrap rewrites.
+  </Accordion>
+
+  <Accordion title="Provider routing metadata">
+    If you pass OpenRouter provider routing under model params, OpenClaw forwards
+    it as OpenRouter routing metadata before the shared stream wrappers run.
+  </Accordion>
+</AccordionGroup>
+
+## Related
+
+<CardGroup cols={2}>
+  <Card title="Model selection" href="/concepts/model-providers" icon="layers">
+    Choosing providers, model refs, and failover behavior.
+  </Card>
+  <Card title="Configuration reference" href="/gateway/configuration-reference" icon="gear">
+    Full config reference for agents, models, and providers.
+  </Card>
+</CardGroup>
--- a/docs/providers/perplexity-provider.md
+++ b/docs/providers/perplexity-provider.md
@@ -1,5 +1,5 @@
 ---
-title: "Perplexity (Provider)"
+title: "Perplexity"
 summary: "Perplexity web search provider setup (API key, search modes, filtering)"
 read_when:
  - You want to configure Perplexity as a web search provider
@@ -16,30 +16,52 @@ This page covers the Perplexity **provider** setup. For the Perplexity
 **tool** (how the agent uses it), see [Perplexity tool](/tools/perplexity-search).
 </Note>

- Type: web search provider (not a model provider)
- Auth: `PERPLEXITY_API_KEY` (direct) or `OPENROUTER_API_KEY` (via OpenRouter)
- Config path: `plugins.entries.perplexity.config.webSearch.apiKey`
+| Property    | Value                                                                  |
+| ----------- | ---------------------------------------------------------------------- |
+| Type        | Web search provider (not a model provider)                             |
+| Auth        | `PERPLEXITY_API_KEY` (direct) or `OPENROUTER_API_KEY` (via OpenRouter) |
+| Config path | `plugins.entries.perplexity.config.webSearch.apiKey`                   |

-## Quick start
+## Getting started

-1. Set the API key:
+<Steps>
+  <Step title="Set the API key">
+    Run the interactive web-search configuration flow:

-```bash
-openclaw configure --section web
-```
+    ```bash
+    openclaw configure --section web
+    ```

-Or set it directly:
+    Or set the key directly:

-```bash
-openclaw config set plugins.entries.perplexity.config.webSearch.apiKey "pplx-xxxxxxxxxxxx"
-```
+    ```bash
+    openclaw config set plugins.entries.perplexity.config.webSearch.apiKey "pplx-xxxxxxxxxxxx"
+    ```

-2. The agent will automatically use Perplexity for web searches when configured.
+  </Step>
+  <Step title="Start searching">
+    The agent will automatically use Perplexity for web searches once the key is
+    configured. No additional steps are required.
+  </Step>
+</Steps>

 ## Search modes

 The plugin auto-selects the transport based on API key prefix:

+<Tabs>
+  <Tab title="Native Perplexity API (pplx-)">
+    When your key starts with `pplx-`, OpenClaw uses the native Perplexity Search
+    API. This transport returns structured results and supports domain, language,
+    and date filters (see filtering options below).
+  </Tab>
+  <Tab title="OpenRouter / Sonar (sk-or-)">
+    When your key starts with `sk-or-`, OpenClaw routes through OpenRouter using
+    the Perplexity Sonar model. This transport returns AI-synthesized answers with
+    citations.
+  </Tab>
+</Tabs>
+
 | Key prefix | Transport                    | Features                                         |
 | ---------- | ---------------------------- | ------------------------------------------------ |
 | `pplx-`    | Native Perplexity Search API | Structured results, domain/language/date filters |
@@ -47,16 +69,58 @@ The plugin auto-selects the transport based on API key prefix:

 ## Native API filtering

-When using the native Perplexity API (`pplx-` key), searches support:
+<Note>
+Filtering options are only available when using the native Perplexity API
+(`pplx-` key). OpenRouter/Sonar searches do not support these parameters.
+</Note>

- **Country**: 2-letter country code
- **Language**: ISO 639-1 language code
- **Date range**: day, week, month, year
- **Domain filters**: allowlist/denylist (max 20 domains)
- **Content budget**: `max_tokens`, `max_tokens_per_page`
+When using the native Perplexity API, searches support the following filters:

-## Environment note
+| Filter         | Description                            | Example                             |
+| -------------- | -------------------------------------- | ----------------------------------- |
+| Country        | 2-letter country code                  | `us`, `de`, `jp`                    |
+| Language       | ISO 639-1 language code                | `en`, `fr`, `zh`                    |
+| Date range     | Recency window                         | `day`, `week`, `month`, `year`      |
+| Domain filters | Allowlist or denylist (max 20 domains) | `example.com`                       |
+| Content budget | Token limits per response / per page   | `max_tokens`, `max_tokens_per_page` |

-If the Gateway runs as a daemon (launchd/systemd), make sure
-`PERPLEXITY_API_KEY` is available to that process (for example, in
-`~/.openclaw/.env` or via `env.shellEnv`).
+## Advanced notes
+
+<AccordionGroup>
+  <Accordion title="Environment variable for daemon processes">
+    If the OpenClaw Gateway runs as a daemon (launchd/systemd), make sure
+    `PERPLEXITY_API_KEY` is available to that process.
+
+    <Warning>
+    A key set only in `~/.profile` will not be visible to a launchd/systemd
+    daemon unless that environment is explicitly imported. Set the key in
+    `~/.openclaw/.env` or via `env.shellEnv` to ensure the gateway process can
+    read it.
+    </Warning>
+
+  </Accordion>
+
+  <Accordion title="OpenRouter proxy setup">
+    If you prefer to route Perplexity searches through OpenRouter, set an
+    `OPENROUTER_API_KEY` (prefix `sk-or-`) instead of a native Perplexity key.
+    OpenClaw will detect the prefix and switch to the Sonar transport
+    automatically.
+
+    <Tip>
+    The OpenRouter transport is useful if you already have an OpenRouter account
+    and want consolidated billing across multiple providers.
+    </Tip>
+
+  </Accordion>
+</AccordionGroup>
+
+## Related
+
+<CardGroup cols={2}>
+  <Card title="Perplexity search tool" href="/tools/perplexity-search" icon="magnifying-glass">
+    How the agent invokes Perplexity searches and interprets results.
+  </Card>
+  <Card title="Configuration reference" href="/gateway/configuration-reference" icon="gear">
+    Full configuration reference including plugin entries.
+  </Card>
+</CardGroup>
--- a/docs/providers/qianfan.md
+++ b/docs/providers/qianfan.md
@@ -6,31 +6,51 @@ read_when:
 title: "Qianfan"
 ---

-# Qianfan Provider Guide
+# Qianfan

-Qianfan is Baidu's MaaS platform, provides a **unified API** that routes requests to many models behind a single
+Qianfan is Baidu's MaaS platform, providing a **unified API** that routes requests to many models behind a single
 endpoint and API key. It is OpenAI-compatible, so most OpenAI SDKs work by switching the base URL.

-## Prerequisites
+| Property | Value                             |
+| -------- | --------------------------------- |
+| Provider | `qianfan`                         |
+| Auth     | `QIANFAN_API_KEY`                 |
+| API      | OpenAI-compatible                 |
+| Base URL | `https://qianfan.baidubce.com/v2` |

-1. A Baidu Cloud account with Qianfan API access
-2. An API key from the Qianfan console
-3. OpenClaw installed on your system
+## Getting started

-## Getting Your API Key
+<Steps>
+  <Step title="Create a Baidu Cloud account">
+    Sign up or log in at the [Qianfan Console](https://console.bce.baidu.com/qianfan/ais/console/apiKey) and ensure you have Qianfan API access enabled.
+  </Step>
+  <Step title="Generate an API key">
+    Create a new application or select an existing one, then generate an API key. The key format is `bce-v3/ALTAK-...`.
+  </Step>
+  <Step title="Run onboarding">
+    ```bash
+    openclaw onboard --auth-choice qianfan-api-key
+    ```
+  </Step>
+  <Step title="Verify the model is available">
+    ```bash
+    openclaw models list --provider qianfan
+    ```
+  </Step>
+</Steps>

-1. Visit the [Qianfan Console](https://console.bce.baidu.com/qianfan/ais/console/apiKey)
-2. Create a new application or select an existing one
-3. Generate an API key (format: `bce-v3/ALTAK-...`)
-4. Copy the API key for use with OpenClaw
+## Available models

-## CLI setup
+| Model ref                            | Input       | Context | Max output | Reasoning | Notes         |
+| ------------------------------------ | ----------- | ------- | ---------- | --------- | ------------- |
+| `qianfan/deepseek-v3.2`              | text        | 98,304  | 32,768     | Yes       | Default model |
+| `qianfan/ernie-5.0-thinking-preview` | text, image | 119,000 | 64,000     | Yes       | Multimodal    |

-```bash
-openclaw onboard --auth-choice qianfan-api-key
-```
+<Tip>
+The default bundled model ref is `qianfan/deepseek-v3.2`. You only need to override `models.providers.qianfan` when you need a custom base URL or model metadata.
+</Tip>

-## Config snippet
+## Config example

 ```json5
 {
@@ -74,17 +94,40 @@ openclaw onboard --auth-choice qianfan-api-key
 }
 ```

-## Notes
+<AccordionGroup>
+  <Accordion title="Transport and compatibility">
+    Qianfan runs through the OpenAI-compatible transport path, not native OpenAI request shaping. This means standard OpenAI SDK features work, but provider-specific parameters may not be forwarded.
+  </Accordion>

- Default bundled model ref: `qianfan/deepseek-v3.2`
- Default base URL: `https://qianfan.baidubce.com/v2`
- Bundled catalog currently includes `deepseek-v3.2` and `ernie-5.0-thinking-preview`
- Add or override `models.providers.qianfan` only when you need custom base URL or model metadata
- Qianfan runs through the OpenAI-compatible transport path, not native OpenAI request shaping
+  <Accordion title="Catalog and overrides">
+    The bundled catalog currently includes `deepseek-v3.2` and `ernie-5.0-thinking-preview`. Add or override `models.providers.qianfan` only when you need a custom base URL or model metadata.

-## Related Documentation
+    <Note>
+    Model refs use the `qianfan/` prefix (for example `qianfan/deepseek-v3.2`).
+    </Note>

- [OpenClaw Configuration](/gateway/configuration)
- [Model Providers](/concepts/model-providers)
- [Agent Setup](/concepts/agent)
- [Qianfan API Documentation](https://cloud.baidu.com/doc/qianfan-api/s/3m7of64lb)
+  </Accordion>
+
+  <Accordion title="Troubleshooting">
+    - Ensure your API key starts with `bce-v3/ALTAK-` and has Qianfan API access enabled in the Baidu Cloud console.
+    - If models are not listed, confirm your account has the Qianfan service activated.
+    - The default base URL is `https://qianfan.baidubce.com/v2`. Only change it if you use a custom endpoint or proxy.
+  </Accordion>
+</AccordionGroup>
+
+## Related
+
+<CardGroup cols={2}>
+  <Card title="Model selection" href="/concepts/model-providers" icon="layers">
+    Choosing providers, model refs, and failover behavior.
+  </Card>
+  <Card title="Configuration reference" href="/gateway/configuration" icon="gear">
+    Full OpenClaw configuration reference.
+  </Card>
+  <Card title="Agent setup" href="/concepts/agent" icon="robot">
+    Configuring agent defaults and model assignments.
+  </Card>
+  <Card title="Qianfan API docs" href="https://cloud.baidu.com/doc/qianfan-api/s/3m7of64lb" icon="arrow-up-right-from-square">
+    Official Qianfan API documentation.
+  </Card>
+</CardGroup>
--- a/docs/providers/qwen.md
+++ b/docs/providers/qwen.md
@@ -17,8 +17,6 @@ background.

 </Warning>

-## Recommended: Qwen Cloud
-
 OpenClaw now treats Qwen as a first-class bundled provider with canonical id
 `qwen`. The bundled provider targets the Qwen Cloud / Alibaba DashScope and
 Coding Plan endpoints and keeps legacy `modelstudio` ids working as a
@@ -29,38 +27,108 @@ compatibility alias.
 - Also accepted for compatibility: `MODELSTUDIO_API_KEY`, `DASHSCOPE_API_KEY`
 - API style: OpenAI-compatible

+<Tip>
 If you want `qwen3.6-plus`, prefer the **Standard (pay-as-you-go)** endpoint.
 Coding Plan support can lag behind the public catalog.
+</Tip>

-```bash
-# Global Coding Plan endpoint
-openclaw onboard --auth-choice qwen-api-key
+## Getting started

-# China Coding Plan endpoint
-openclaw onboard --auth-choice qwen-api-key-cn
+Choose your plan type and follow the setup steps.

-# Global Standard (pay-as-you-go) endpoint
-openclaw onboard --auth-choice qwen-standard-api-key
+<Tabs>
+  <Tab title="Coding Plan (subscription)">
+    **Best for:** subscription-based access through the Qwen Coding Plan.

-# China Standard (pay-as-you-go) endpoint
-openclaw onboard --auth-choice qwen-standard-api-key-cn
-```
+    <Steps>
+      <Step title="Get your API key">
+        Create or copy an API key from [home.qwencloud.com/api-keys](https://home.qwencloud.com/api-keys).
+      </Step>
+      <Step title="Run onboarding">
+        For the **Global** endpoint:

-Legacy `modelstudio-*` auth-choice ids and `modelstudio/...` model refs still
-work as compatibility aliases, but new setup flows should prefer the canonical
-`qwen-*` auth-choice ids and `qwen/...` model refs.
+        ```bash
+        openclaw onboard --auth-choice qwen-api-key
+        ```

-After onboarding, set a default model:
+        For the **China** endpoint:

-```json5
-{
-  agents: {
-    defaults: {
-      model: { primary: "qwen/qwen3.5-plus" },
-    },
-  },
-}
-```
+        ```bash
+        openclaw onboard --auth-choice qwen-api-key-cn
+        ```
+      </Step>
+      <Step title="Set a default model">
+        ```json5
+        {
+          agents: {
+            defaults: {
+              model: { primary: "qwen/qwen3.5-plus" },
+            },
+          },
+        }
+        ```
+      </Step>
+      <Step title="Verify the model is available">
+        ```bash
+        openclaw models list --provider qwen
+        ```
+      </Step>
+    </Steps>
+
+    <Note>
+    Legacy `modelstudio-*` auth-choice ids and `modelstudio/...` model refs still
+    work as compatibility aliases, but new setup flows should prefer the canonical
+    `qwen-*` auth-choice ids and `qwen/...` model refs.
+    </Note>
+
+  </Tab>
+
+  <Tab title="Standard (pay-as-you-go)">
+    **Best for:** pay-as-you-go access through the Standard Model Studio endpoint, including models like `qwen3.6-plus` that may not be available on the Coding Plan.
+
+    <Steps>
+      <Step title="Get your API key">
+        Create or copy an API key from [home.qwencloud.com/api-keys](https://home.qwencloud.com/api-keys).
+      </Step>
+      <Step title="Run onboarding">
+        For the **Global** endpoint:
+
+        ```bash
+        openclaw onboard --auth-choice qwen-standard-api-key
+        ```
+
+        For the **China** endpoint:
+
+        ```bash
+        openclaw onboard --auth-choice qwen-standard-api-key-cn
+        ```
+      </Step>
+      <Step title="Set a default model">
+        ```json5
+        {
+          agents: {
+            defaults: {
+              model: { primary: "qwen/qwen3.5-plus" },
+            },
+          },
+        }
+        ```
+      </Step>
+      <Step title="Verify the model is available">
+        ```bash
+        openclaw models list --provider qwen
+        ```
+      </Step>
+    </Steps>
+
+    <Note>
+    Legacy `modelstudio-*` auth-choice ids and `modelstudio/...` model refs still
+    work as compatibility aliases, but new setup flows should prefer the canonical
+    `qwen-*` auth-choice ids and `qwen/...` model refs.
+    </Note>
+
+  </Tab>
+</Tabs>

 ## Plan types and endpoints

@@ -75,16 +143,10 @@ The provider auto-selects the endpoint based on your auth choice. Canonical
 choices use the `qwen-*` family; `modelstudio-*` remains compatibility-only.
 You can override with a custom `baseUrl` in config.

-Native Model Studio endpoints advertise streaming usage compatibility on the
-shared `openai-completions` transport. OpenClaw keys that off endpoint
-capabilities now, so DashScope-compatible custom provider ids targeting the
-same native hosts inherit the same streaming-usage behavior instead of
-requiring the built-in `qwen` provider id specifically.
-
-## Get your API key
-
- **Manage keys**: [home.qwencloud.com/api-keys](https://home.qwencloud.com/api-keys)
- **Docs**: [docs.qwencloud.com](https://docs.qwencloud.com/developer-guides/getting-started/introduction)
+<Tip>
+**Manage keys:** [home.qwencloud.com/api-keys](https://home.qwencloud.com/api-keys) |
+**Docs:** [docs.qwencloud.com](https://docs.qwencloud.com/developer-guides/getting-started/introduction)
+</Tip>

 ## Built-in catalog

@@ -104,71 +166,20 @@ the Standard endpoint.
 | `qwen/glm-4.7`              | text        | 202,752   | GLM                                                |
 | `qwen/kimi-k2.5`            | text, image | 262,144   | Moonshot AI via Alibaba                            |

+<Note>
 Availability can still vary by endpoint and billing plan even when a model is
 present in the bundled catalog.
-
-Native-streaming usage compatibility applies to both the Coding Plan hosts and
-the Standard DashScope-compatible hosts:
-
- `https://coding.dashscope.aliyuncs.com/v1`
- `https://coding-intl.dashscope.aliyuncs.com/v1`
- `https://dashscope.aliyuncs.com/compatible-mode/v1`
- `https://dashscope-intl.aliyuncs.com/compatible-mode/v1`
-
-## Qwen 3.6 Plus availability
-
-`qwen3.6-plus` is available on the Standard (pay-as-you-go) Model Studio
-endpoints:
-
- China: `dashscope.aliyuncs.com/compatible-mode/v1`
- Global: `dashscope-intl.aliyuncs.com/compatible-mode/v1`
-
-If the Coding Plan endpoints return an "unsupported model" error for
-`qwen3.6-plus`, switch to Standard (pay-as-you-go) instead of the Coding Plan
-endpoint/key pair.
-
-## Capability plan
-
-The `qwen` extension is being positioned as the vendor home for the full Qwen
-Cloud surface, not just coding/text models.
-
- Text/chat models: bundled now
- Tool calling, structured output, thinking: inherited from the OpenAI-compatible transport
- Image generation: planned at the provider-plugin layer
- Image/video understanding: bundled now on the Standard endpoint
- Speech/audio: planned at the provider-plugin layer
- Memory embeddings/reranking: planned through the embedding adapter surface
- Video generation: bundled now through the shared video-generation capability
+</Note>

 ## Multimodal add-ons

-The `qwen` extension now also exposes:
+The `qwen` extension also exposes multimodal capabilities on the **Standard**
+DashScope endpoints (not the Coding Plan endpoints):

- Video understanding via `qwen-vl-max-latest`
- Wan video generation via:
-  - `wan2.6-t2v` (default)
-  - `wan2.6-i2v`
-  - `wan2.6-r2v`
-  - `wan2.6-r2v-flash`
-  - `wan2.7-r2v`
+- **Video understanding** via `qwen-vl-max-latest`
+- **Wan video generation** via `wan2.6-t2v` (default), `wan2.6-i2v`, `wan2.6-r2v`, `wan2.6-r2v-flash`, `wan2.7-r2v`

-These multimodal surfaces use the **Standard** DashScope endpoints, not the
-Coding Plan endpoints.
-
- Global/Intl Standard base URL: `https://dashscope-intl.aliyuncs.com/compatible-mode/v1`
- China Standard base URL: `https://dashscope.aliyuncs.com/compatible-mode/v1`
-
-For video generation, OpenClaw maps the configured Qwen region to the matching
-DashScope AIGC host before submitting the job:
-
- Global/Intl: `https://dashscope-intl.aliyuncs.com`
- China: `https://dashscope.aliyuncs.com`
-
-That means a normal `models.providers.qwen.baseUrl` pointing at either the
-Coding Plan or Standard Qwen hosts still keeps video generation on the correct
-regional DashScope video endpoint.
-
-For video generation, set a default model explicitly:
+To use Qwen as the default video provider:

 ```json5
 {
@@ -180,22 +191,125 @@ For video generation, set a default model explicitly:
 }
 ```

-Current bundled Qwen video-generation limits:
+<Note>
+See [Video Generation](/tools/video-generation) for shared tool parameters, provider selection, and failover behavior.
+</Note>

- Up to **1** output video per request
- Up to **1** input image
- Up to **4** input videos
- Up to **10 seconds** duration
- Supports `size`, `aspectRatio`, `resolution`, `audio`, and `watermark`
- Reference image/video mode currently requires **remote http(s) URLs**. Local
-  file paths are rejected up front because the DashScope video endpoint does not
-  accept uploaded local buffers for those references.
+## Advanced

-See [Video Generation](/tools/video-generation) for the shared tool
-parameters, provider selection, and failover behavior.
+<AccordionGroup>
+  <Accordion title="Image and video understanding">
+    The bundled Qwen plugin registers media understanding for images and video
+    on the **Standard** DashScope endpoints (not the Coding Plan endpoints).

-## Environment note
+    | Property      | Value                 |
+    | ------------- | --------------------- |
+    | Model         | `qwen-vl-max-latest`  |
+    | Supported input | Images, video       |

-If the Gateway runs as a daemon (launchd/systemd), make sure `QWEN_API_KEY` is
-available to that process (for example, in `~/.openclaw/.env` or via
-`env.shellEnv`).
+    Media understanding is auto-resolved from the configured Qwen auth — no
+    additional config is needed. Ensure you are using a Standard (pay-as-you-go)
+    endpoint for media understanding support.
+
+  </Accordion>
+
+  <Accordion title="Qwen 3.6 Plus availability">
+    `qwen3.6-plus` is available on the Standard (pay-as-you-go) Model Studio
+    endpoints:
+
+    - China: `dashscope.aliyuncs.com/compatible-mode/v1`
+    - Global: `dashscope-intl.aliyuncs.com/compatible-mode/v1`
+
+    If the Coding Plan endpoints return an "unsupported model" error for
+    `qwen3.6-plus`, switch to Standard (pay-as-you-go) instead of the Coding Plan
+    endpoint/key pair.
+
+  </Accordion>
+
+  <Accordion title="Capability plan">
+    The `qwen` extension is being positioned as the vendor home for the full Qwen
+    Cloud surface, not just coding/text models.
+
+    - **Text/chat models:** bundled now
+    - **Tool calling, structured output, thinking:** inherited from the OpenAI-compatible transport
+    - **Image generation:** planned at the provider-plugin layer
+    - **Image/video understanding:** bundled now on the Standard endpoint
+    - **Speech/audio:** planned at the provider-plugin layer
+    - **Memory embeddings/reranking:** planned through the embedding adapter surface
+    - **Video generation:** bundled now through the shared video-generation capability
+
+  </Accordion>
+
+  <Accordion title="Video generation details">
+    For video generation, OpenClaw maps the configured Qwen region to the matching
+    DashScope AIGC host before submitting the job:
+
+    - Global/Intl: `https://dashscope-intl.aliyuncs.com`
+    - China: `https://dashscope.aliyuncs.com`
+
+    That means a normal `models.providers.qwen.baseUrl` pointing at either the
+    Coding Plan or Standard Qwen hosts still keeps video generation on the correct
+    regional DashScope video endpoint.
+
+    Current bundled Qwen video-generation limits:
+
+    - Up to **1** output video per request
+    - Up to **1** input image
+    - Up to **4** input videos
+    - Up to **10 seconds** duration
+    - Supports `size`, `aspectRatio`, `resolution`, `audio`, and `watermark`
+    - Reference image/video mode currently requires **remote http(s) URLs**. Local
+      file paths are rejected up front because the DashScope video endpoint does not
+      accept uploaded local buffers for those references.
+
+  </Accordion>
+
+  <Accordion title="Streaming usage compatibility">
+    Native Model Studio endpoints advertise streaming usage compatibility on the
+    shared `openai-completions` transport. OpenClaw keys that off endpoint
+    capabilities now, so DashScope-compatible custom provider ids targeting the
+    same native hosts inherit the same streaming-usage behavior instead of
+    requiring the built-in `qwen` provider id specifically.
+
+    Native-streaming usage compatibility applies to both the Coding Plan hosts and
+    the Standard DashScope-compatible hosts:
+
+    - `https://coding.dashscope.aliyuncs.com/v1`
+    - `https://coding-intl.dashscope.aliyuncs.com/v1`
+    - `https://dashscope.aliyuncs.com/compatible-mode/v1`
+    - `https://dashscope-intl.aliyuncs.com/compatible-mode/v1`
+
+  </Accordion>
+
+  <Accordion title="Multimodal endpoint regions">
+    Multimodal surfaces (video understanding and Wan video generation) use the
+    **Standard** DashScope endpoints, not the Coding Plan endpoints:
+
+    - Global/Intl Standard base URL: `https://dashscope-intl.aliyuncs.com/compatible-mode/v1`
+    - China Standard base URL: `https://dashscope.aliyuncs.com/compatible-mode/v1`
+
+  </Accordion>
+
+  <Accordion title="Environment and daemon setup">
+    If the Gateway runs as a daemon (launchd/systemd), make sure `QWEN_API_KEY` is
+    available to that process (for example, in `~/.openclaw/.env` or via
+    `env.shellEnv`).
+  </Accordion>
+</AccordionGroup>
+
+## Related
+
+<CardGroup cols={2}>
+  <Card title="Model selection" href="/concepts/model-providers" icon="layers">
+    Choosing providers, model refs, and failover behavior.
+  </Card>
+  <Card title="Video generation" href="/tools/video-generation" icon="video">
+    Shared video tool parameters and provider selection.
+  </Card>
+  <Card title="Alibaba (ModelStudio)" href="/providers/alibaba" icon="cloud">
+    Legacy ModelStudio provider and migration notes.
+  </Card>
+  <Card title="Troubleshooting" href="/help/troubleshooting" icon="wrench">
+    General troubleshooting and FAQ.
+  </Card>
+</CardGroup>
--- a/docs/providers/runway.md
+++ b/docs/providers/runway.md
@@ -11,25 +11,29 @@ read_when:

 OpenClaw ships a bundled `runway` provider for hosted video generation.

- Provider id: `runway`
- Auth: `RUNWAYML_API_SECRET` (canonical) or `RUNWAY_API_KEY`
- API: Runway task-based video generation (`GET /v1/tasks/{id}` polling)
+| Property    | Value                                                             |
+| ----------- | ----------------------------------------------------------------- |
+| Provider id | `runway`                                                          |
+| Auth        | `RUNWAYML_API_SECRET` (canonical) or `RUNWAY_API_KEY`             |
+| API         | Runway task-based video generation (`GET /v1/tasks/{id}` polling) |

-## Quick start
+## Getting started

-1. Set the API key:
-
-```bash
-openclaw onboard --auth-choice runway-api-key
-```
-
-2. Set Runway as the default video provider:
-
-```bash
-openclaw config set agents.defaults.videoGenerationModel.primary "runway/gen4.5"
-```
-
-3. Ask the agent to generate a video. Runway will be used automatically.
+<Steps>
+  <Step title="Set the API key">
+    ```bash
+    openclaw onboard --auth-choice runway-api-key
+    ```
+  </Step>
+  <Step title="Set Runway as the default video provider">
+    ```bash
+    openclaw config set agents.defaults.videoGenerationModel.primary "runway/gen4.5"
+    ```
+  </Step>
+  <Step title="Generate a video">
+    Ask the agent to generate a video. Runway will be used automatically.
+  </Step>
+</Steps>

 ## Supported modes

@@ -39,9 +43,14 @@ openclaw config set agents.defaults.videoGenerationModel.primary "runway/gen4.5"
 | Image-to-video | `gen4.5`           | 1 local or remote image |
 | Video-to-video | `gen4_aleph`       | 1 local or remote video |

- Local image and video references are supported via data URIs.
- Video-to-video currently requires `runway/gen4_aleph` specifically.
- Text-only runs currently expose `16:9` and `9:16` aspect ratios.
+<Note>
+Local image and video references are supported via data URIs. Text-only runs
+currently expose `16:9` and `9:16` aspect ratios.
+</Note>
+
+<Warning>
+Video-to-video currently requires `runway/gen4_aleph` specifically.
+</Warning>

 ## Configuration

@@ -57,7 +66,28 @@ openclaw config set agents.defaults.videoGenerationModel.primary "runway/gen4.5"
 }
 ```

+## Advanced notes
+
+<AccordionGroup>
+  <Accordion title="Environment variable aliases">
+    OpenClaw recognizes both `RUNWAYML_API_SECRET` (canonical) and `RUNWAY_API_KEY`.
+    Either variable will authenticate the Runway provider.
+  </Accordion>
+
+  <Accordion title="Task polling">
+    Runway uses a task-based API. After submitting a generation request, OpenClaw
+    polls `GET /v1/tasks/{id}` until the video is ready. No additional
+    configuration is needed for the polling behavior.
+  </Accordion>
+</AccordionGroup>
+
 ## Related

- [Video Generation](/tools/video-generation) -- shared tool parameters, provider selection, and async behavior
- [Configuration Reference](/gateway/configuration-reference#agent-defaults)
+<CardGroup cols={2}>
+  <Card title="Video generation" href="/tools/video-generation" icon="video">
+    Shared tool parameters, provider selection, and async behavior.
+  </Card>
+  <Card title="Configuration reference" href="/gateway/configuration-reference#agent-defaults" icon="gear">
+    Agent default settings including video generation model.
+  </Card>
+</CardGroup>
--- a/docs/providers/sglang.md
+++ b/docs/providers/sglang.md
@@ -15,36 +15,44 @@ OpenClaw can also **auto-discover** available models from SGLang when you opt
 in with `SGLANG_API_KEY` (any value works if your server does not enforce auth)
 and you do not define an explicit `models.providers.sglang` entry.

-## Quick start
+## Getting started

-1. Start SGLang with an OpenAI-compatible server.
+<Steps>
+  <Step title="Start SGLang">
+    Launch SGLang with an OpenAI-compatible server. Your base URL should expose
+    `/v1` endpoints (for example `/v1/models`, `/v1/chat/completions`). SGLang
+    commonly runs on:

-Your base URL should expose `/v1` endpoints (for example `/v1/models`,
-`/v1/chat/completions`). SGLang commonly runs on:
+    - `http://127.0.0.1:30000/v1`

- `http://127.0.0.1:30000/v1`
+  </Step>
+  <Step title="Set an API key">
+    Any value works if no auth is configured on your server:

-2. Opt in (any value works if no auth is configured):
+    ```bash
+    export SGLANG_API_KEY="sglang-local"
+    ```

-```bash
-export SGLANG_API_KEY="sglang-local"
-```
+  </Step>
+  <Step title="Run onboarding or set a model directly">
+    ```bash
+    openclaw onboard
+    ```

-3. Run onboarding and choose `SGLang`, or set a model directly:
+    Or configure the model manually:

-```bash
-openclaw onboard
-```
+    ```json5
+    {
+      agents: {
+        defaults: {
+          model: { primary: "sglang/your-model-id" },
+        },
+      },
+    }
+    ```

-```json5
-{
-  agents: {
-    defaults: {
-      model: { primary: "sglang/your-model-id" },
-    },
-  },
-}
-```
+  </Step>
+</Steps>

 ## Model discovery (implicit provider)

@@ -55,8 +63,10 @@ define `models.providers.sglang`, OpenClaw will query:

 and convert the returned IDs into model entries.

+<Note>
 If you set `models.providers.sglang` explicitly, auto-discovery is skipped and
 you must define models manually.
+</Note>

 ## Explicit configuration (manual models)

@@ -91,25 +101,52 @@ Use explicit config when:
 }
 ```

-## Troubleshooting
+## Advanced configuration

- Check the server is reachable:
+<AccordionGroup>
+  <Accordion title="Proxy-style behavior">
+    SGLang is treated as a proxy-style OpenAI-compatible `/v1` backend, not a
+    native OpenAI endpoint.

-```bash
-curl http://127.0.0.1:30000/v1/models
-```
+    | Behavior | SGLang |
+    |----------|--------|
+    | OpenAI-only request shaping | Not applied |
+    | `service_tier`, Responses `store`, prompt-cache hints | Not sent |
+    | Reasoning-compat payload shaping | Not applied |
+    | Hidden attribution headers (`originator`, `version`, `User-Agent`) | Not injected on custom SGLang base URLs |

- If requests fail with auth errors, set a real `SGLANG_API_KEY` that matches
-  your server configuration, or configure the provider explicitly under
-  `models.providers.sglang`.
+  </Accordion>

-## Proxy-style behavior
+  <Accordion title="Troubleshooting">
+    **Server not reachable**

-SGLang is treated as a proxy-style OpenAI-compatible `/v1` backend, not a
-native OpenAI endpoint.
+    Verify the server is running and responding:

- native OpenAI-only request shaping does not apply here
- no `service_tier`, no Responses `store`, no prompt-cache hints, and no
-  OpenAI reasoning-compat payload shaping
- hidden OpenClaw attribution headers (`originator`, `version`, `User-Agent`)
-  are not injected on custom SGLang base URLs
+    ```bash
+    curl http://127.0.0.1:30000/v1/models
+    ```
+
+    **Auth errors**
+
+    If requests fail with auth errors, set a real `SGLANG_API_KEY` that matches
+    your server configuration, or configure the provider explicitly under
+    `models.providers.sglang`.
+
+    <Tip>
+    If you run SGLang without authentication, any non-empty value for
+    `SGLANG_API_KEY` is sufficient to opt in to model discovery.
+    </Tip>
+
+  </Accordion>
+</AccordionGroup>
+
+## Related
+
+<CardGroup cols={2}>
+  <Card title="Model selection" href="/concepts/model-providers" icon="layers">
+    Choosing providers, model refs, and failover behavior.
+  </Card>
+  <Card title="Configuration reference" href="/gateway/configuration-reference" icon="gear">
+    Full config schema including provider entries.
+  </Card>
+</CardGroup>
--- a/docs/providers/stepfun.md
+++ b/docs/providers/stepfun.md
@@ -13,49 +13,18 @@ OpenClaw includes a bundled StepFun provider plugin with two provider ids:
 - `stepfun` for the standard endpoint
 - `stepfun-plan` for the Step Plan endpoint

-The built-in catalogs currently differ by surface:
-
- Standard: `step-3.5-flash`
- Step Plan: `step-3.5-flash`, `step-3.5-flash-2603`
+<Warning>
+Standard and Step Plan are **separate providers** with different endpoints and model ref prefixes (`stepfun/...` vs `stepfun-plan/...`). Use a China key with the `.com` endpoints and a global key with the `.ai` endpoints.
+</Warning>

 ## Region and endpoint overview

- China standard endpoint: `https://api.stepfun.com/v1`
- Global standard endpoint: `https://api.stepfun.ai/v1`
- China Step Plan endpoint: `https://api.stepfun.com/step_plan/v1`
- Global Step Plan endpoint: `https://api.stepfun.ai/step_plan/v1`
- Auth env var: `STEPFUN_API_KEY`
+| Endpoint  | China (`.com`)                         | Global (`.ai`)                        |
+| --------- | -------------------------------------- | ------------------------------------- |
+| Standard  | `https://api.stepfun.com/v1`           | `https://api.stepfun.ai/v1`           |
+| Step Plan | `https://api.stepfun.com/step_plan/v1` | `https://api.stepfun.ai/step_plan/v1` |

-Use a China key with the `.com` endpoints and a global key with the `.ai`
-endpoints.
-
-## CLI setup
-
-Interactive setup:
-
-```bash
-openclaw onboard
-```
-
-Choose one of these auth choices:
-
- `stepfun-standard-api-key-cn`
- `stepfun-standard-api-key-intl`
- `stepfun-plan-api-key-cn`
- `stepfun-plan-api-key-intl`
-
-Non-interactive examples:
-
-```bash
-openclaw onboard --auth-choice stepfun-standard-api-key-intl --stepfun-api-key "$STEPFUN_API_KEY"
-openclaw onboard --auth-choice stepfun-plan-api-key-intl --stepfun-api-key "$STEPFUN_API_KEY"
-```
-
-## Model refs
-
- Standard default model: `stepfun/step-3.5-flash`
- Step Plan default model: `stepfun-plan/step-3.5-flash`
- Step Plan alternate model: `stepfun-plan/step-3.5-flash-2603`
+Auth env var: `STEPFUN_API_KEY`

 ## Built-in catalogs

@@ -72,81 +41,190 @@ Step Plan (`stepfun-plan`):
 | `stepfun-plan/step-3.5-flash`      | 262,144 | 65,536     | Default Step Plan model    |
 | `stepfun-plan/step-3.5-flash-2603` | 262,144 | 65,536     | Additional Step Plan model |

-## Config snippets
+## Getting started

-Standard provider:
+Choose your provider surface and follow the setup steps.

-```json5
-{
-  env: { STEPFUN_API_KEY: "your-key" },
-  agents: { defaults: { model: { primary: "stepfun/step-3.5-flash" } } },
-  models: {
-    mode: "merge",
-    providers: {
-      stepfun: {
-        baseUrl: "https://api.stepfun.ai/v1",
-        api: "openai-completions",
-        apiKey: "${STEPFUN_API_KEY}",
-        models: [
-          {
-            id: "step-3.5-flash",
-            name: "Step 3.5 Flash",
-            reasoning: true,
-            input: ["text"],
-            cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 },
-            contextWindow: 262144,
-            maxTokens: 65536,
+<Tabs>
+  <Tab title="Standard">
+    **Best for:** general-purpose use via the standard StepFun endpoint.
+
+    <Steps>
+      <Step title="Choose your endpoint region">
+        | Auth choice                      | Endpoint                         | Region        |
+        | -------------------------------- | -------------------------------- | ------------- |
+        | `stepfun-standard-api-key-intl`  | `https://api.stepfun.ai/v1`     | International |
+        | `stepfun-standard-api-key-cn`    | `https://api.stepfun.com/v1`    | China         |
+      </Step>
+      <Step title="Run onboarding">
+        ```bash
+        openclaw onboard --auth-choice stepfun-standard-api-key-intl
+        ```
+
+        Or for the China endpoint:
+
+        ```bash
+        openclaw onboard --auth-choice stepfun-standard-api-key-cn
+        ```
+      </Step>
+      <Step title="Non-interactive alternative">
+        ```bash
+        openclaw onboard --auth-choice stepfun-standard-api-key-intl \
+          --stepfun-api-key "$STEPFUN_API_KEY"
+        ```
+      </Step>
+      <Step title="Verify models are available">
+        ```bash
+        openclaw models list --provider stepfun
+        ```
+      </Step>
+    </Steps>
+
+    ### Model refs
+
+    - Default model: `stepfun/step-3.5-flash`
+
+  </Tab>
+
+  <Tab title="Step Plan">
+    **Best for:** Step Plan reasoning endpoint.
+
+    <Steps>
+      <Step title="Choose your endpoint region">
+        | Auth choice                  | Endpoint                                | Region        |
+        | ---------------------------- | --------------------------------------- | ------------- |
+        | `stepfun-plan-api-key-intl`  | `https://api.stepfun.ai/step_plan/v1`  | International |
+        | `stepfun-plan-api-key-cn`    | `https://api.stepfun.com/step_plan/v1` | China         |
+      </Step>
+      <Step title="Run onboarding">
+        ```bash
+        openclaw onboard --auth-choice stepfun-plan-api-key-intl
+        ```
+
+        Or for the China endpoint:
+
+        ```bash
+        openclaw onboard --auth-choice stepfun-plan-api-key-cn
+        ```
+      </Step>
+      <Step title="Non-interactive alternative">
+        ```bash
+        openclaw onboard --auth-choice stepfun-plan-api-key-intl \
+          --stepfun-api-key "$STEPFUN_API_KEY"
+        ```
+      </Step>
+      <Step title="Verify models are available">
+        ```bash
+        openclaw models list --provider stepfun-plan
+        ```
+      </Step>
+    </Steps>
+
+    ### Model refs
+
+    - Default model: `stepfun-plan/step-3.5-flash`
+    - Alternate model: `stepfun-plan/step-3.5-flash-2603`
+
+  </Tab>
+</Tabs>
+
+## Advanced
+
+<AccordionGroup>
+  <Accordion title="Full config: Standard provider">
+    ```json5
+    {
+      env: { STEPFUN_API_KEY: "your-key" },
+      agents: { defaults: { model: { primary: "stepfun/step-3.5-flash" } } },
+      models: {
+        mode: "merge",
+        providers: {
+          stepfun: {
+            baseUrl: "https://api.stepfun.ai/v1",
+            api: "openai-completions",
+            apiKey: "${STEPFUN_API_KEY}",
+            models: [
+              {
+                id: "step-3.5-flash",
+                name: "Step 3.5 Flash",
+                reasoning: true,
+                input: ["text"],
+                cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 },
+                contextWindow: 262144,
+                maxTokens: 65536,
+              },
+            ],
          },
-        ],
+        },
      },
-    },
-  },
-}
-```
+    }
+    ```
+  </Accordion>

-Step Plan provider:
-
-```json5
-{
-  env: { STEPFUN_API_KEY: "your-key" },
-  agents: { defaults: { model: { primary: "stepfun-plan/step-3.5-flash" } } },
-  models: {
-    mode: "merge",
-    providers: {
-      "stepfun-plan": {
-        baseUrl: "https://api.stepfun.ai/step_plan/v1",
-        api: "openai-completions",
-        apiKey: "${STEPFUN_API_KEY}",
-        models: [
-          {
-            id: "step-3.5-flash",
-            name: "Step 3.5 Flash",
-            reasoning: true,
-            input: ["text"],
-            cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 },
-            contextWindow: 262144,
-            maxTokens: 65536,
+  <Accordion title="Full config: Step Plan provider">
+    ```json5
+    {
+      env: { STEPFUN_API_KEY: "your-key" },
+      agents: { defaults: { model: { primary: "stepfun-plan/step-3.5-flash" } } },
+      models: {
+        mode: "merge",
+        providers: {
+          "stepfun-plan": {
+            baseUrl: "https://api.stepfun.ai/step_plan/v1",
+            api: "openai-completions",
+            apiKey: "${STEPFUN_API_KEY}",
+            models: [
+              {
+                id: "step-3.5-flash",
+                name: "Step 3.5 Flash",
+                reasoning: true,
+                input: ["text"],
+                cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 },
+                contextWindow: 262144,
+                maxTokens: 65536,
+              },
+              {
+                id: "step-3.5-flash-2603",
+                name: "Step 3.5 Flash 2603",
+                reasoning: true,
+                input: ["text"],
+                cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 },
+                contextWindow: 262144,
+                maxTokens: 65536,
+              },
+            ],
          },
-          {
-            id: "step-3.5-flash-2603",
-            name: "Step 3.5 Flash 2603",
-            reasoning: true,
-            input: ["text"],
-            cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 },
-            contextWindow: 262144,
-            maxTokens: 65536,
-          },
-        ],
+        },
      },
-    },
-  },
-}
-```
+    }
+    ```
+  </Accordion>

-## Notes
+  <Accordion title="Notes">
+    - The provider is bundled with OpenClaw, so there is no separate plugin install step.
+    - `step-3.5-flash-2603` is currently exposed only on `stepfun-plan`.
+    - A single auth flow writes region-matched profiles for both `stepfun` and `stepfun-plan`, so both surfaces can be discovered together.
+    - Use `openclaw models list` and `openclaw models set <provider/model>` to inspect or switch models.
+  </Accordion>
+</AccordionGroup>

- The provider is bundled with OpenClaw, so there is no separate plugin install step.
- `step-3.5-flash-2603` is currently exposed only on `stepfun-plan`.
- A single auth flow writes region-matched profiles for both `stepfun` and `stepfun-plan`, so both surfaces can be discovered together.
- Use `openclaw models list` and `openclaw models set <provider/model>` to inspect or switch models.
- For the broader provider overview, see [Model providers](/concepts/model-providers).
+<Note>
+For the broader provider overview, see [Model providers](/concepts/model-providers).
+</Note>
+
+## Related
+
+<CardGroup cols={2}>
+  <Card title="Model providers" href="/concepts/model-providers" icon="layers">
+    Overview of all providers, model refs, and failover behavior.
+  </Card>
+  <Card title="Configuration reference" href="/gateway/configuration-reference" icon="gear">
+    Full config schema for providers, models, and plugins.
+  </Card>
+  <Card title="Model selection" href="/concepts/models" icon="brain">
+    How to choose and configure models.
+  </Card>
+  <Card title="StepFun Platform" href="https://platform.stepfun.com" icon="globe">
+    StepFun API key management and documentation.
+  </Card>
+</CardGroup>
--- a/docs/providers/synthetic.md
+++ b/docs/providers/synthetic.md
@@ -8,23 +8,42 @@ title: "Synthetic"

 # Synthetic

-Synthetic exposes Anthropic-compatible endpoints. OpenClaw registers it as the
-`synthetic` provider and uses the Anthropic Messages API.
+[Synthetic](https://synthetic.new) exposes Anthropic-compatible endpoints.
+OpenClaw registers it as the `synthetic` provider and uses the Anthropic
+Messages API.

-## Quick setup
+| Property | Value                                 |
+| -------- | ------------------------------------- |
+| Provider | `synthetic`                           |
+| Auth     | `SYNTHETIC_API_KEY`                   |
+| API      | Anthropic Messages                    |
+| Base URL | `https://api.synthetic.new/anthropic` |

-1. Set `SYNTHETIC_API_KEY` (or run the wizard below).
-2. Run onboarding:
+## Getting started

-```bash
-openclaw onboard --auth-choice synthetic-api-key
-```
+<Steps>
+  <Step title="Get an API key">
+    Obtain a `SYNTHETIC_API_KEY` from your Synthetic account, or let the
+    onboarding wizard prompt you for one.
+  </Step>
+  <Step title="Run onboarding">
+    ```bash
+    openclaw onboard --auth-choice synthetic-api-key
+    ```
+  </Step>
+  <Step title="Verify the default model">
+    After onboarding the default model is set to:
+    ```
+    synthetic/hf:MiniMaxAI/MiniMax-M2.5
+    ```
+  </Step>
+</Steps>

-The default model is set to:
-
-```
-synthetic/hf:MiniMaxAI/MiniMax-M2.5
-```
+<Warning>
+OpenClaw's Anthropic client appends `/v1` to the base URL automatically, so use
+`https://api.synthetic.new/anthropic` (not `/anthropic/v1`). If Synthetic
+changes its base URL, override `models.providers.synthetic.baseUrl`.
+</Warning>

 ## Config example

@@ -61,41 +80,77 @@ synthetic/hf:MiniMaxAI/MiniMax-M2.5
 }
 ```

-Note: OpenClaw's Anthropic client appends `/v1` to the base URL, so use
-`https://api.synthetic.new/anthropic` (not `/anthropic/v1`). If Synthetic changes
-its base URL, override `models.providers.synthetic.baseUrl`.
-
 ## Model catalog

-All models below use cost `0` (input/output/cache).
+All Synthetic models use cost `0` (input/output/cache).

 | Model ID                                               | Context window | Max tokens | Reasoning | Input        |
 | ------------------------------------------------------ | -------------- | ---------- | --------- | ------------ |
-| `hf:MiniMaxAI/MiniMax-M2.5`                            | 192000         | 65536      | false     | text         |
-| `hf:moonshotai/Kimi-K2-Thinking`                       | 256000         | 8192       | true      | text         |
-| `hf:zai-org/GLM-4.7`                                   | 198000         | 128000     | false     | text         |
-| `hf:deepseek-ai/DeepSeek-R1-0528`                      | 128000         | 8192       | false     | text         |
-| `hf:deepseek-ai/DeepSeek-V3-0324`                      | 128000         | 8192       | false     | text         |
-| `hf:deepseek-ai/DeepSeek-V3.1`                         | 128000         | 8192       | false     | text         |
-| `hf:deepseek-ai/DeepSeek-V3.1-Terminus`                | 128000         | 8192       | false     | text         |
-| `hf:deepseek-ai/DeepSeek-V3.2`                         | 159000         | 8192       | false     | text         |
-| `hf:meta-llama/Llama-3.3-70B-Instruct`                 | 128000         | 8192       | false     | text         |
-| `hf:meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8` | 524000         | 8192       | false     | text         |
-| `hf:moonshotai/Kimi-K2-Instruct-0905`                  | 256000         | 8192       | false     | text         |
-| `hf:moonshotai/Kimi-K2.5`                              | 256000         | 8192       | true      | text + image |
-| `hf:openai/gpt-oss-120b`                               | 128000         | 8192       | false     | text         |
-| `hf:Qwen/Qwen3-235B-A22B-Instruct-2507`                | 256000         | 8192       | false     | text         |
-| `hf:Qwen/Qwen3-Coder-480B-A35B-Instruct`               | 256000         | 8192       | false     | text         |
-| `hf:Qwen/Qwen3-VL-235B-A22B-Instruct`                  | 250000         | 8192       | false     | text + image |
-| `hf:zai-org/GLM-4.5`                                   | 128000         | 128000     | false     | text         |
-| `hf:zai-org/GLM-4.6`                                   | 198000         | 128000     | false     | text         |
-| `hf:zai-org/GLM-5`                                     | 256000         | 128000     | true      | text + image |
-| `hf:deepseek-ai/DeepSeek-V3`                           | 128000         | 8192       | false     | text         |
-| `hf:Qwen/Qwen3-235B-A22B-Thinking-2507`                | 256000         | 8192       | true      | text         |
+| `hf:MiniMaxAI/MiniMax-M2.5`                            | 192,000        | 65,536     | no        | text         |
+| `hf:moonshotai/Kimi-K2-Thinking`                       | 256,000        | 8,192      | yes       | text         |
+| `hf:zai-org/GLM-4.7`                                   | 198,000        | 128,000    | no        | text         |
+| `hf:deepseek-ai/DeepSeek-R1-0528`                      | 128,000        | 8,192      | no        | text         |
+| `hf:deepseek-ai/DeepSeek-V3-0324`                      | 128,000        | 8,192      | no        | text         |
+| `hf:deepseek-ai/DeepSeek-V3.1`                         | 128,000        | 8,192      | no        | text         |
+| `hf:deepseek-ai/DeepSeek-V3.1-Terminus`                | 128,000        | 8,192      | no        | text         |
+| `hf:deepseek-ai/DeepSeek-V3.2`                         | 159,000        | 8,192      | no        | text         |
+| `hf:meta-llama/Llama-3.3-70B-Instruct`                 | 128,000        | 8,192      | no        | text         |
+| `hf:meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8` | 524,000        | 8,192      | no        | text         |
+| `hf:moonshotai/Kimi-K2-Instruct-0905`                  | 256,000        | 8,192      | no        | text         |
+| `hf:moonshotai/Kimi-K2.5`                              | 256,000        | 8,192      | yes       | text + image |
+| `hf:openai/gpt-oss-120b`                               | 128,000        | 8,192      | no        | text         |
+| `hf:Qwen/Qwen3-235B-A22B-Instruct-2507`                | 256,000        | 8,192      | no        | text         |
+| `hf:Qwen/Qwen3-Coder-480B-A35B-Instruct`               | 256,000        | 8,192      | no        | text         |
+| `hf:Qwen/Qwen3-VL-235B-A22B-Instruct`                  | 250,000        | 8,192      | no        | text + image |
+| `hf:zai-org/GLM-4.5`                                   | 128,000        | 128,000    | no        | text         |
+| `hf:zai-org/GLM-4.6`                                   | 198,000        | 128,000    | no        | text         |
+| `hf:zai-org/GLM-5`                                     | 256,000        | 128,000    | yes       | text + image |
+| `hf:deepseek-ai/DeepSeek-V3`                           | 128,000        | 8,192      | no        | text         |
+| `hf:Qwen/Qwen3-235B-A22B-Thinking-2507`                | 256,000        | 8,192      | yes       | text         |

-## Notes
+<Tip>
+Model refs use the form `synthetic/<modelId>`. Use
+`openclaw models list --provider synthetic` to see all models available on your
+account.
+</Tip>

- Model refs use `synthetic/<modelId>`.
- If you enable a model allowlist (`agents.defaults.models`), add every model you
-  plan to use.
- See [Model providers](/concepts/model-providers) for provider rules.
+<AccordionGroup>
+  <Accordion title="Model allowlist">
+    If you enable a model allowlist (`agents.defaults.models`), add every
+    Synthetic model you plan to use. Models not in the allowlist will be hidden
+    from the agent.
+  </Accordion>
+
+  <Accordion title="Base URL override">
+    If Synthetic changes its API endpoint, override the base URL in your config:
+
+    ```json5
+    {
+      models: {
+        providers: {
+          synthetic: {
+            baseUrl: "https://new-api.synthetic.new/anthropic",
+          },
+        },
+      },
+    }
+    ```
+
+    Remember that OpenClaw appends `/v1` automatically.
+
+  </Accordion>
+</AccordionGroup>
+
+## Related
+
+<CardGroup cols={2}>
+  <Card title="Model providers" href="/concepts/model-providers" icon="layers">
+    Provider rules, model refs, and failover behavior.
+  </Card>
+  <Card title="Configuration reference" href="/gateway/configuration-reference" icon="gear">
+    Full config schema including provider settings.
+  </Card>
+  <Card title="Synthetic" href="https://synthetic.new" icon="arrow-up-right-from-square">
+    Synthetic dashboard and API docs.
+  </Card>
+</CardGroup>
--- a/Show More
+++ b/Show More