ci: add process exec codeql security shard

2026-06-20 22:12:53 +08:00 · 2026-06-13 20:38:21 +08:00
1100 changed files with 26299 additions and 96431 deletions
--- a/.agents/skills/crabbox/SKILL.md
+++ b/.agents/skills/crabbox/SKILL.md
@@ -54,13 +54,6 @@ pnpm crabbox:run -- --help | sed -n '1,120p'
 - For broad OpenClaw maintainer `pnpm` gates, prefer the repo wrapper with
  `--provider blacksmith-testbox` or the repo Testbox helpers when the standing
  Testbox policy applies.
 - Cold Testbox acquisition and hydration often take tens of seconds. When broad
  remote proof is likely, immediately start
  `node scripts/crabbox-wrapper.mjs warmup --provider blacksmith-testbox --keep --timing-json`
  in a background command session while inspecting, editing, and running
  focused local tests. Poll later, reuse the returned `tbx_...` with
  `--provider blacksmith-testbox --id <tbx_id>`, and stop it before handoff.
  Do not warm speculatively when remote proof is unlikely.
 - Always report the actual provider and id. `cbx_...` means AWS Crabbox;
  `tbx_...` means Blacksmith Testbox through Crabbox. If the output only says
  `blacksmith testbox list`, use `blacksmith testbox list --all` before
--- a/.agents/skills/release-openclaw-maintainer/SKILL.md
+++ b/.agents/skills/release-openclaw-maintainer/SKILL.md
@@ -150,21 +150,9 @@ Use this skill for release and publish-time workflow. Load `$release-private` if
 - Stable Windows Hub release closeout requires the signed
  `OpenClawCompanion-Setup-x64.exe`, `OpenClawCompanion-Setup-arm64.exe`, and
  `OpenClawCompanion-SHA256SUMS.txt` assets on the canonical
-  `openclaw/openclaw` GitHub Release. Pass the exact signed
+  `openclaw/openclaw` GitHub Release. Use the public `Windows Node Release`
-  `openclaw/openclaw-windows-node` release tag as `windows_node_tag` to
+  workflow after the matching `openclaw/openclaw-windows-node` release exists;
-  `OpenClaw Release Publish`, together with the candidate-approved
+  it verifies Authenticode signatures on Windows before uploading assets.
  `windows_node_installer_digests` map; it prevalidates the published source
  release and required installers against that map before any publish child,
  dispatches the public `Windows Node Release` workflow while the OpenClaw
  release is still a draft, carries those pinned source asset digests
  unchanged, verifies the expected OpenClaw Foundation Authenticode signer on
  Windows, re-downloads and checksum-verifies the promoted asset contract, and
  blocks publication until the canonical asset contract is present. Use direct
  `Windows Node Release` dispatch only for recovery, always with an exact tag,
  never `latest`, and the explicit `expected_installer_digests` JSON map from
  the approved source release. Recovery rejects unexpected
  `OpenClawCompanion-*` target asset names, then replaces the expected contract
  assets with the pinned source bytes.
 - Website Windows Hub download links should target exact canonical
  `openclaw/openclaw/releases/download/vYYYY.M.PATCH/...` assets for the current
  stable release, or `releases/latest/download/...` only after verifying the
@@ -687,23 +675,19 @@ node --import tsx scripts/openclaw-npm-postpublish-verify.ts <published-version>
    where npm did not publish the beta version, delete/recreate the same beta
    tag and any accidental draft/incomplete prerelease at the fixed commit
    instead of skipping a prerelease number.
-22. Start `.github/workflows/openclaw-release-publish.yml` from the same branch with
+22. Start `.github/workflows/openclaw-npm-release.yml` from the same branch with
    the same tag for the real publish, choose `npm_dist_tag` (`beta` default,
    `latest` only when you intentionally want direct stable publish), keep it
    the same as the preflight run, and pass the successful npm
-    `preflight_run_id` plus the successful `full_release_validation_run_id`.
+    `preflight_run_id`.
    For stable publish, also pass the exact non-prerelease
    `openclaw/openclaw-windows-node` tag as `windows_node_tag` and its
    candidate-approved installer digest map as `windows_node_installer_digests`.
 23. Wait for `npm-release` approval from `@openclaw/openclaw-release-managers`.
 24. Wait for the real publish workflow to run postpublish verification,
    create or update the GitHub release as a draft, upload dependency evidence,
    promote and verify the required Windows Hub assets for stable releases,
    append release verification proof, and only then undraft/publish it. If a
-    waited plugin publish or Windows Hub promotion fails after OpenClaw npm
+    waited plugin publish fails after OpenClaw npm succeeds, the workflow keeps
-    succeeds, the workflow keeps the release draft with OpenClaw npm evidence
+    the release draft with OpenClaw npm evidence and exits red; do not undraft
-    and exits red; do not undraft until the gap is repaired. The standalone
+    until the plugin publish gap is repaired. The standalone verifier command
-    verifier command remains the recovery probe:
+    remains the recovery probe:
    `node --import tsx scripts/openclaw-npm-postpublish-verify.ts <published-version>`.
 25. Run the post-published beta verification roster. First scan current `main`
    for critical fixes that landed after the release branch cut; backport only
--- a/.github/codeql/codeql-process-exec-boundary-critical-security.yml
+++ b/.github/codeql/codeql-process-exec-boundary-critical-security.yml
@@ -0,0 +1,61 @@
 name: openclaw-codeql-process-exec-boundary-critical-security
 disable-default-queries: true
 queries:
  - uses: security-extended
 query-filters:
  - include:
      precision:
        - high
        - very-high
      tags contain: security
      security-severity: /([7-9]|10)\.(\d)+/
 paths:
  - src/process
  - src/tui/tui-local-shell.ts
  - src/tui/tui.ts
  - src/plugin-sdk/windows-spawn.ts
  - packages/agent-core/src/harness/env
  - packages/memory-host-sdk/src/host
  - extensions/acpx/src
  - extensions/bonjour/src/advertiser.ts
  - extensions/browser/src/browser/chrome-mcp.ts
  - extensions/browser/src/browser/chrome.executables.ts
  - extensions/browser/src/browser/chrome.ts
  - extensions/codex/src/app-server/sandbox-exec-server
  - extensions/codex/src/app-server/transport-stdio.ts
  - extensions/codex/src/node-cli-sessions.ts
  - extensions/codex-supervisor/src/json-rpc-client.ts
  - extensions/file-transfer/src
  - extensions/google-meet/src
  - extensions/imessage/src
  - extensions/memory-core/src/memory/qmd-manager.ts
  - extensions/memory-wiki/src/obsidian.ts
  - extensions/microsoft-foundry/cli.ts
  - extensions/ollama/src/wsl2-crash-loop-check.ts
  - extensions/qa-lab/src
  - extensions/signal/src/daemon.ts
  - extensions/tts-local-cli/speech-provider.ts
  - extensions/voice-call/src
  - scripts
 paths-ignore:
  - "**/node_modules"
  - "**/coverage"
  - "**/*.generated.ts"
  - "**/*.bundle.js"
  - "**/*-runtime.js"
  - "**/*.test.ts"
  - "**/*.test.tsx"
  - "**/*.spec.ts"
  - "**/*.spec.tsx"
  - "**/*.e2e.test.ts"
  - "**/*.e2e.test.tsx"
  - "**/*test-support*"
  - "**/*test-helper*"
  - "**/*mock*"
  - "**/*fixture*"
  - "**/*bench*"
--- a/.github/workflows/ci.yml
+++ b/.github/workflows/ci.yml
@@ -1358,8 +1358,6 @@ jobs:
          - check_name: check-additional-boundaries-bcd
            group: boundaries
            boundary_shard: 2/4,3/4,4/4
          - check_name: check-session-accessor-boundary
            group: session-accessor-boundary
          - check_name: check-additional-extension-channels
            group: extension-channels
          - check_name: check-additional-extension-bundled
@@ -1506,15 +1504,6 @@ jobs:
            boundaries)
              node scripts/run-additional-boundary-checks.mjs
              ;;
            session-accessor-boundary)
              if [ ! -f scripts/check-session-accessor-boundary.mjs ]; then
                echo "[skip] session accessor boundary check is not present in this checkout"
              elif ! node -e 'const pkg = require("./package.json"); process.exit(pkg.scripts?.["lint:tmp:session-accessor-boundary"] ? 0 : 1);'; then
                echo "[skip] session accessor boundary script is not present in package.json"
              else
                run_check "lint:tmp:session-accessor-boundary" pnpm run lint:tmp:session-accessor-boundary
              fi
              ;;
            extension-channels)
              run_check "lint:extensions:channels" pnpm run lint:extensions:channels
              ;;
--- a/.github/workflows/codeql.yml
+++ b/.github/workflows/codeql.yml
@@ -17,7 +17,28 @@ on:
      - ".github/actions/**"
      - ".github/codeql/**"
      - ".github/workflows/**"
      - "extensions/acpx/src/**"
      - "extensions/bonjour/src/advertiser.ts"
      - "extensions/browser/src/browser/chrome-mcp.ts"
      - "extensions/browser/src/browser/chrome.executables.ts"
      - "extensions/browser/src/browser/chrome.ts"
      - "extensions/codex/src/app-server/sandbox-exec-server/**"
      - "extensions/codex/src/app-server/transport-stdio.ts"
      - "extensions/codex/src/node-cli-sessions.ts"
      - "extensions/codex-supervisor/src/json-rpc-client.ts"
      - "extensions/file-transfer/src/**"
      - "extensions/google-meet/src/**"
      - "extensions/imessage/src/**"
      - "extensions/memory-core/src/memory/qmd-manager.ts"
      - "extensions/memory-wiki/src/obsidian.ts"
      - "extensions/microsoft-foundry/cli.ts"
      - "extensions/ollama/src/wsl2-crash-loop-check.ts"
      - "extensions/qa-lab/src/**"
      - "extensions/signal/src/daemon.ts"
      - "extensions/tts-local-cli/speech-provider.ts"
      - "extensions/voice-call/src/**"
      - "packages/**"
      - "scripts/**"
      - "src/**"
  push:
    branches:
@@ -26,7 +47,28 @@ on:
      - ".github/actions/**"
      - ".github/codeql/**"
      - ".github/workflows/**"
      - "extensions/acpx/src/**"
      - "extensions/bonjour/src/advertiser.ts"
      - "extensions/browser/src/browser/chrome-mcp.ts"
      - "extensions/browser/src/browser/chrome.executables.ts"
      - "extensions/browser/src/browser/chrome.ts"
      - "extensions/codex/src/app-server/sandbox-exec-server/**"
      - "extensions/codex/src/app-server/transport-stdio.ts"
      - "extensions/codex/src/node-cli-sessions.ts"
      - "extensions/codex-supervisor/src/json-rpc-client.ts"
      - "extensions/file-transfer/src/**"
      - "extensions/google-meet/src/**"
      - "extensions/imessage/src/**"
      - "extensions/memory-core/src/memory/qmd-manager.ts"
      - "extensions/memory-wiki/src/obsidian.ts"
      - "extensions/microsoft-foundry/cli.ts"
      - "extensions/ollama/src/wsl2-crash-loop-check.ts"
      - "extensions/qa-lab/src/**"
      - "extensions/signal/src/daemon.ts"
      - "extensions/tts-local-cli/speech-provider.ts"
      - "extensions/voice-call/src/**"
      - "packages/**"
      - "scripts/**"
      - "src/**"
  schedule:
    - cron: "0 6 * * *"
@@ -73,6 +115,11 @@ jobs:
            runs_on: blacksmith-4vcpu-ubuntu-2404
            timeout_minutes: 25
            config_file: ./.github/codeql/codeql-mcp-process-tool-boundary-critical-security.yml
          - language: javascript-typescript
            category: process-exec-boundary
            runs_on: blacksmith-4vcpu-ubuntu-2404
            timeout_minutes: 25
            config_file: ./.github/codeql/codeql-process-exec-boundary-critical-security.yml
          - language: javascript-typescript
            category: plugin-trust-boundary
            runs_on: blacksmith-4vcpu-ubuntu-2404
--- a/.github/workflows/full-release-validation.yml
+++ b/.github/workflows/full-release-validation.yml
@@ -783,7 +783,7 @@ jobs:
          fi
          args=(
-            -f ref="$TARGET_REF"
+            -f ref="$TARGET_SHA"
            -f expected_sha="$TARGET_SHA"
            -f provider="$PROVIDER"
            -f mode="$MODE"
--- a/.github/workflows/mantis-telegram-live.yml
+++ b/.github/workflows/mantis-telegram-live.yml
@@ -379,6 +379,7 @@ jobs:
          OPENCLAW_QA_CONVEX_SECRET_CI: ${{ secrets.OPENCLAW_QA_CONVEX_SECRET_CI }}
          OPENCLAW_QA_CREDENTIAL_ACQUIRE_TIMEOUT_MS: "1800000"
          OPENCLAW_QA_REDACT_PUBLIC_METADATA: "1"
          OPENCLAW_QA_TELEGRAM_CAPTURE_CONTENT: "1"
          CRABBOX_COORDINATOR: ${{ secrets.CRABBOX_COORDINATOR }}
          CRABBOX_COORDINATOR_TOKEN: ${{ secrets.CRABBOX_COORDINATOR_TOKEN }}
          OPENCLAW_QA_MANTIS_CRABBOX_COORDINATOR: ${{ secrets.OPENCLAW_QA_MANTIS_CRABBOX_COORDINATOR }}
--- a/.github/workflows/npm-telegram-beta-e2e.yml
+++ b/.github/workflows/npm-telegram-beta-e2e.yml
@@ -220,6 +220,7 @@ jobs:
          OPENCLAW_QA_CONVEX_SECRET_CI: ${{ secrets.OPENCLAW_QA_CONVEX_SECRET_CI }}
          OPENCLAW_QA_CREDENTIAL_ACQUIRE_TIMEOUT_MS: "1800000"
          OPENCLAW_QA_REDACT_PUBLIC_METADATA: "1"
          OPENCLAW_QA_TELEGRAM_CAPTURE_CONTENT: "1"
          INPUT_SCENARIO: ${{ inputs.scenario }}
          PACKAGE_ARTIFACT_NAME: ${{ inputs.package_artifact_name || '' }}
        run: |
--- a/.github/workflows/openclaw-live-and-e2e-checks-reusable.yml
+++ b/.github/workflows/openclaw-live-and-e2e-checks-reusable.yml
@@ -420,7 +420,6 @@ jobs:
              add_suite live-cache
              add_profile_suite native-live-src-agents "stable full"
              add_profile_suite native-live-src-agents-zai-coding "stable full"
              add_profile_suite native-live-src-gateway-core "beta minimum stable full"
              add_profile_suite native-live-src-gateway-profiles-anthropic "stable full"
              add_profile_suite native-live-src-gateway-profiles-anthropic-smoke "stable"
@@ -1957,12 +1956,6 @@ jobs:
            timeout_minutes: 60
            profile_env_only: false
            profiles: stable full
          - suite_id: native-live-src-agents-zai-coding
            label: Native live Z.AI Coding Plan
            command: ZAI_CODING_LIVE_TEST=1 node .release-harness/scripts/test-live-shard.mjs native-live-src-agents-zai-coding
            timeout_minutes: 15
            profile_env_only: false
            profiles: stable full
          - suite_id: native-live-src-gateway-core
            label: Native live gateway core
            command: OPENCLAW_LIVE_CODEX_HARNESS=1 OPENCLAW_LIVE_CODEX_HARNESS_AUTH=api-key node .release-harness/scripts/test-live-shard.mjs native-live-src-gateway-core
--- a/.github/workflows/openclaw-release-checks.yml
+++ b/.github/workflows/openclaw-release-checks.yml
@@ -1181,7 +1181,7 @@ jobs:
  runtime_tool_coverage_release_checks:
    name: Enforce QA Lab runtime tool coverage
    needs: [resolve_target, qa_lab_runtime_parity_release_checks]
-    if: contains(fromJSON('["all","qa","qa-parity"]'), needs.resolve_target.outputs.rerun_group)
+    if: always() && contains(fromJSON('["all","qa","qa-parity"]'), needs.resolve_target.outputs.rerun_group)
    runs-on: ubuntu-24.04
    timeout-minutes: 15
    permissions:
@@ -1204,35 +1204,13 @@ jobs:
          node-version: ${{ env.NODE_VERSION }}
          install-bun: "true"
      - name: Download runtime parity status
        uses: actions/download-artifact@v8
        with:
          name: release-check-status-qa-runtime-parity-${{ needs.resolve_target.outputs.revision }}
          path: .artifacts/release-check-status/
      - name: Verify runtime parity producer status
        id: verify_runtime_parity_status
        shell: bash
        run: |
          set -euo pipefail
          status_path=".artifacts/release-check-status/qa_lab_runtime_parity_release_checks.env"
          status="$(sed -n 's/^status=//p' "$status_path" | tail -n 1)"
          if [[ "$status" != "success" ]]; then
            echo "Runtime parity producer status is ${status:-missing}; skipping coverage artifact consumer."
            echo "ready=false" >> "$GITHUB_OUTPUT"
            exit 0
          fi
          echo "ready=true" >> "$GITHUB_OUTPUT"
      - name: Download runtime parity artifacts
        if: steps.verify_runtime_parity_status.outputs.ready == 'true'
        uses: actions/download-artifact@v8
        with:
          name: release-qa-runtime-parity-${{ needs.resolve_target.outputs.revision }}
          path: .artifacts/qa-e2e/
      - name: Enforce standard runtime tool coverage
        if: steps.verify_runtime_parity_status.outputs.ready == 'true'
        run: |
          set -euo pipefail
          pnpm openclaw qa coverage \
@@ -1434,6 +1412,7 @@ jobs:
          OPENCLAW_QA_CONVEX_SECRET_CI: ${{ secrets.OPENCLAW_QA_CONVEX_SECRET_CI }}
          OPENCLAW_QA_CREDENTIAL_ACQUIRE_TIMEOUT_MS: "1800000"
          OPENCLAW_QA_REDACT_PUBLIC_METADATA: "1"
          OPENCLAW_QA_TELEGRAM_CAPTURE_CONTENT: "1"
        run: |
          set -euo pipefail
--- a/.github/workflows/openclaw-release-publish.yml
+++ b/.github/workflows/openclaw-release-publish.yml
@@ -15,14 +15,6 @@ on:
        description: Successful Full Release Validation run id for this tag/SHA, required when publish_openclaw_npm=true
        required: false
        type: string
      windows_node_tag:
        description: Exact openclaw-windows-node release tag, required for stable OpenClaw publish
        required: false
        type: string
      windows_node_installer_digests:
        description: Candidate-approved compact JSON map of Windows installer names to pinned sha256 digests
        required: false
        type: string
      npm_telegram_run_id:
        description: Optional successful NPM Telegram Beta E2E run id to include in final release evidence
        required: false
@@ -89,15 +81,12 @@ jobs:
    outputs:
      sha: ${{ steps.manifest.outputs.sha || steps.ref.outputs.sha }}
      preflight_artifact_name: ${{ steps.preflight_artifact.outputs.name }}
      windows_node_installer_digests: ${{ steps.windows_source.outputs.installer_digests }}
    steps:
      - name: Validate inputs
        env:
          RELEASE_TAG: ${{ inputs.tag }}
          PREFLIGHT_RUN_ID: ${{ inputs.preflight_run_id }}
          FULL_RELEASE_VALIDATION_RUN_ID: ${{ inputs.full_release_validation_run_id }}
          WINDOWS_NODE_TAG: ${{ inputs.windows_node_tag }}
          WINDOWS_NODE_INSTALLER_DIGESTS: ${{ inputs.windows_node_installer_digests }}
          PUBLISH_OPENCLAW_NPM: ${{ inputs.publish_openclaw_npm && 'true' || 'false' }}
          PLUGIN_PUBLISH_SCOPE: ${{ inputs.plugin_publish_scope }}
          PLUGINS: ${{ inputs.plugins }}
@@ -126,22 +115,6 @@ jobs:
            echo "publish_openclaw_npm=true requires full_release_validation_run_id." >&2
            exit 1
          fi
          stable_release=true
          if [[ "${RELEASE_TAG}" == *"-alpha."* || "${RELEASE_TAG}" == *"-beta."* ]]; then
            stable_release=false
          fi
          if [[ -n "${WINDOWS_NODE_TAG}" && ! "${WINDOWS_NODE_TAG}" =~ ^v[0-9]+\.[0-9]+\.[0-9]+([-.][0-9A-Za-z]+([.-][0-9A-Za-z]+)*)?$ ]]; then
            echo "windows_node_tag must be an explicit openclaw-windows-node release tag, not latest: ${WINDOWS_NODE_TAG}" >&2
            exit 1
          fi
          if [[ "${PUBLISH_OPENCLAW_NPM}" == "true" && "${stable_release}" == "true" && -z "${WINDOWS_NODE_TAG}" ]]; then
            echo "Stable OpenClaw publish requires an explicit windows_node_tag." >&2
            exit 1
          fi
          if [[ "${PUBLISH_OPENCLAW_NPM}" == "true" && "${stable_release}" == "true" && -z "${WINDOWS_NODE_INSTALLER_DIGESTS}" ]]; then
            echo "Stable OpenClaw publish requires candidate-approved windows_node_installer_digests." >&2
            exit 1
          fi
          tideclaw_alpha_publish=false
          if [[ "${RELEASE_TAG}" == *"-alpha."* && "${RELEASE_NPM_DIST_TAG}" == "alpha" && "${WORKFLOW_REF}" =~ ^refs/heads/tideclaw/alpha/[0-9]{4}-[0-9]{2}-[0-9]{2}-[0-9]{4}Z$ ]]; then
            tideclaw_alpha_publish=true
@@ -170,73 +143,6 @@ jobs:
              ;;
          esac
      - name: Validate stable Windows source release
        id: windows_source
        if: ${{ inputs.publish_openclaw_npm }}
        env:
          GH_TOKEN: ${{ github.token }}
          RELEASE_TAG: ${{ inputs.tag }}
          WINDOWS_NODE_TAG: ${{ inputs.windows_node_tag }}
          APPROVED_INSTALLER_DIGESTS: ${{ inputs.windows_node_installer_digests }}
        run: |
          set -euo pipefail
          if [[ "${RELEASE_TAG}" == *"-alpha."* || "${RELEASE_TAG}" == *"-beta."* ]]; then
            exit 0
          fi
          source_json="$(gh release view "${WINDOWS_NODE_TAG}" \
            --repo openclaw/openclaw-windows-node \
            --json tagName,isDraft,isPrerelease,assets,url)"
          if [[ "$(printf '%s' "${source_json}" | jq -r '.tagName')" != "${WINDOWS_NODE_TAG}" ]]; then
            echo "Windows source release tag does not match ${WINDOWS_NODE_TAG}." >&2
            exit 1
          fi
          if [[ "$(printf '%s' "${source_json}" | jq -r '.isDraft')" == "true" ]]; then
            echo "Stable OpenClaw publish requires a published Windows source release." >&2
            exit 1
          fi
          if [[ "$(printf '%s' "${source_json}" | jq -r '.isPrerelease')" == "true" ]]; then
            echo "Stable OpenClaw publish requires a non-prerelease Windows source release." >&2
            exit 1
          fi
          required_assets=(
            "OpenClawCompanion-Setup-x64.exe"
            "OpenClawCompanion-Setup-arm64.exe"
          )
          required_assets_json="$(printf '%s\n' "${required_assets[@]}" | jq -R . | jq -sc .)"
          if ! approved_installer_digests="$(printf '%s' "${APPROVED_INSTALLER_DIGESTS}" | jq -ce --argjson names "${required_assets_json}" '
            if type == "object" and
              (keys | sort) == ($names | sort) and
              all(.[]; type == "string" and test("^sha256:[a-f0-9]{64}$"))
            then .
            else error("invalid candidate-approved Windows installer digest map")
            end
          ')"; then
            echo "windows_node_installer_digests must contain exactly the candidate-approved current installer asset contract." >&2
            exit 1
          fi
          for asset_name in "${required_assets[@]}"; do
            asset_matches="$(printf '%s' "${source_json}" | jq -c --arg name "${asset_name}" '[.assets[]? | select(.name == $name)]')"
            asset_match_count="$(printf '%s' "${asset_matches}" | jq 'length')"
            if [[ "${asset_match_count}" != "1" ]]; then
              echo "Windows source release ${WINDOWS_NODE_TAG} must contain exactly one required asset ${asset_name}; found ${asset_match_count}." >&2
              exit 1
            fi
            asset_digest="$(printf '%s' "${asset_matches}" | jq -r '.[0].digest // empty')"
            if [[ ! "${asset_digest}" =~ ^sha256:[a-f0-9]{64}$ ]]; then
              echo "Windows source release ${WINDOWS_NODE_TAG} asset ${asset_name} is missing its immutable SHA-256 digest." >&2
              exit 1
            fi
            approved_digest="$(printf '%s' "${approved_installer_digests}" | jq -r --arg name "${asset_name}" '.[$name]')"
            if [[ "${asset_digest}" != "${approved_digest}" ]]; then
              echo "Windows source release ${WINDOWS_NODE_TAG} asset ${asset_name} no longer matches its candidate-approved digest." >&2
              exit 1
            fi
          done
          echo "installer_digests=${approved_installer_digests}" >> "$GITHUB_OUTPUT"
          echo "- Windows Node source release: prevalidated \`${WINDOWS_NODE_TAG}\`" >> "$GITHUB_STEP_SUMMARY"
      - name: Download OpenClaw npm preflight manifest
        id: preflight_artifact
        if: ${{ inputs.publish_openclaw_npm }}
@@ -431,7 +337,6 @@ jobs:
          TARGET_SHA: ${{ steps.manifest.outputs.sha || steps.ref.outputs.sha }}
          RELEASE_PROFILE: ${{ steps.full_manifest.outputs.release_profile || inputs.release_profile }}
          FULL_RELEASE_VALIDATION_RUN_ID: ${{ inputs.full_release_validation_run_id }}
          WINDOWS_NODE_TAG: ${{ inputs.windows_node_tag }}
        run: |
          {
            echo "### Release target"
@@ -442,16 +347,13 @@ jobs:
            if [[ -n "${FULL_RELEASE_VALIDATION_RUN_ID// }" ]]; then
              echo "- Full release validation: \`${FULL_RELEASE_VALIDATION_RUN_ID}\`"
            fi
            if [[ -n "${WINDOWS_NODE_TAG// }" ]]; then
              echo "- Windows Node source release: \`${WINDOWS_NODE_TAG}\`"
            fi
          } >> "$GITHUB_STEP_SUMMARY"
  publish:
    name: Publish plugins, then OpenClaw
    needs: [resolve_release_target]
    runs-on: ubuntu-latest
-    timeout-minutes: 120
+    timeout-minutes: 60
    environment: npm-release
    steps:
      - name: Checkout release SHA
@@ -481,16 +383,10 @@ jobs:
          WAIT_FOR_CLAWHUB: ${{ inputs.wait_for_clawhub && 'true' || 'false' }}
          PREFLIGHT_ARTIFACT_NAME: ${{ needs.resolve_release_target.outputs.preflight_artifact_name }}
          NPM_TELEGRAM_RUN_ID: ${{ inputs.npm_telegram_run_id }}
          WINDOWS_NODE_TAG: ${{ inputs.windows_node_tag }}
          WINDOWS_NODE_INSTALLER_DIGESTS: ${{ needs.resolve_release_target.outputs.windows_node_installer_digests }}
          POSTPUBLISH_EVIDENCE_DIR: ${{ runner.temp }}/openclaw-release-postpublish-evidence
        run: |
          set -euo pipefail
          is_stable_release() {
            [[ "${RELEASE_TAG}" != *"-alpha."* && "${RELEASE_TAG}" != *"-beta."* ]]
          }
          dispatch_workflow_at_ref() {
            local workflow_ref="$1"
            shift
@@ -940,105 +836,10 @@ jobs:
          }
          publish_github_release() {
            if is_stable_release; then
              verify_windows_release_asset_contract
            fi
            gh release edit "${RELEASE_TAG}" --repo "$GITHUB_REPOSITORY" --draft=false
            echo "- GitHub release: https://github.com/${GITHUB_REPOSITORY}/releases/tag/${RELEASE_TAG}" >> "$GITHUB_STEP_SUMMARY"
          }
          verify_windows_release_asset_contract() {
            local actual_companion_assets actual_digest asset_name expected_companion_assets expected_digest expected_hash expected_installer_names manifest_dir manifest_json manifest_path release_json
            # Add future promoted installer names, such as MSIX x64/ARM64, here.
            local -a installer_assets=(
              "OpenClawCompanion-Setup-x64.exe"
              "OpenClawCompanion-Setup-arm64.exe"
            )
            local -a required_assets=(
              "${installer_assets[@]}"
              "OpenClawCompanion-SHA256SUMS.txt"
            )
            release_json="$(gh release view "${RELEASE_TAG}" --repo "$GITHUB_REPOSITORY" --json assets,url)"
            expected_companion_assets="$(printf '%s\n' "${required_assets[@]}" | jq -R . | jq -sc 'sort')"
            actual_companion_assets="$(printf '%s' "${release_json}" | jq -c '
              [.assets[]? | select(.name | startswith("OpenClawCompanion-")) | .name] | sort
            ')"
            if [[ "${actual_companion_assets}" != "${expected_companion_assets}" ]]; then
              echo "Stable release OpenClawCompanion asset names do not exactly match the current contract." >&2
              return 1
            fi
            for asset_name in "${required_assets[@]}"; do
              if ! printf '%s' "${release_json}" | jq -e --arg name "${asset_name}" 'any(.assets[]?; .name == $name)' >/dev/null; then
                echo "Stable release is missing required Windows asset ${asset_name}." >&2
                return 1
              fi
            done
            manifest_dir="${RUNNER_TEMP}/openclaw-windows-release-contract"
            manifest_path="${manifest_dir}/OpenClawCompanion-SHA256SUMS.txt"
            rm -rf "${manifest_dir}"
            mkdir -p "${manifest_dir}"
            gh release download "${RELEASE_TAG}" \
              --repo "$GITHUB_REPOSITORY" \
              --pattern "OpenClawCompanion-SHA256SUMS.txt" \
              --dir "${manifest_dir}"
            if ! manifest_json="$(jq -Rsc '
              split("\n") as $lines |
              (if $lines[-1] == "" then $lines[0:-1] else $lines end) |
              map(sub("\r$"; "")) |
              if all(.[]; test("^(?<hash>[a-f0-9]{64})  (?<name>[^/\\\\]+)$"))
              then map(capture("^(?<hash>[a-f0-9]{64})  (?<name>[^/\\\\]+)$"))
              else error("malformed Windows checksum manifest entry")
              end
            ' "${manifest_path}")"; then
              echo "Stable release Windows checksum manifest contains malformed entries." >&2
              return 1
            fi
            expected_installer_names="$(printf '%s\n' "${installer_assets[@]}" | jq -R . | jq -sc 'sort')"
            if ! printf '%s' "${manifest_json}" | jq -e --argjson expected "${expected_installer_names}" '
              length == ($expected | length) and
              ([.[].name] | sort) == $expected and
              ([.[].name] | unique | length) == length
            ' >/dev/null; then
              echo "Stable release Windows checksum manifest does not exactly match the installer asset contract." >&2
              return 1
            fi
            for asset_name in "${installer_assets[@]}"; do
              expected_digest="$(printf '%s' "${WINDOWS_NODE_INSTALLER_DIGESTS}" | jq -r --arg name "${asset_name}" '.[$name] // empty')"
              actual_digest="$(printf '%s' "${release_json}" | jq -r --arg name "${asset_name}" '.assets[]? | select(.name == $name) | .digest // empty')"
              if [[ -z "${expected_digest}" || "${actual_digest}" != "${expected_digest}" ]]; then
                echo "Stable release Windows asset ${asset_name} does not match its pinned digest." >&2
                return 1
              fi
              expected_hash="${expected_digest#sha256:}"
              if ! printf '%s' "${manifest_json}" | jq -e --arg name "${asset_name}" --arg hash "${expected_hash}" '
                any(.[]; .name == $name and .hash == $hash)
              ' >/dev/null; then
                echo "Stable release Windows checksum manifest does not match pinned digest for ${asset_name}." >&2
                return 1
              fi
            done
            echo "- Windows Hub asset contract: verified" >> "$GITHUB_STEP_SUMMARY"
          }
          promote_windows_release_assets() {
            if ! is_stable_release; then
              return 0
            fi
            if [[ -z "${WINDOWS_NODE_INSTALLER_DIGESTS// }" ]]; then
              echo "Stable release is missing prevalidated Windows installer digests." >&2
              return 1
            fi
            windows_node_run_id="$(dispatch_workflow windows-node-release.yml \
              -f tag="${RELEASE_TAG}" \
              -f windows_node_tag="${WINDOWS_NODE_TAG}" \
              -f expected_installer_digests="${WINDOWS_NODE_INSTALLER_DIGESTS}")"
            echo "- Windows Node release run ID: \`${windows_node_run_id}\`" >> "$GITHUB_STEP_SUMMARY"
            wait_for_run windows-node-release.yml "${windows_node_run_id}"
          }
          upload_dependency_evidence_release_asset() {
            local release_version download_dir asset_path asset_name artifact_name
            release_version="${RELEASE_TAG#v}"
@@ -1112,7 +913,7 @@ jobs:
          }
          append_release_proof_to_github_release() {
-            local release_version body_file notes_file tarball integrity telegram_line clawhub_line clawhub_bootstrap_line clawhub_runtime_state_path windows_line
+            local release_version body_file notes_file tarball integrity telegram_line clawhub_line clawhub_bootstrap_line clawhub_runtime_state_path
            release_version="${RELEASE_TAG#v}"
            body_file="${RUNNER_TEMP}/release-body.md"
@@ -1130,10 +931,6 @@ jobs:
            write_clawhub_runtime_state false "${clawhub_runtime_state_path}"
            clawhub_line="$(jq -r '.proofLines.normal' "${clawhub_runtime_state_path}")"
            clawhub_bootstrap_line="$(jq -r '.proofLines.bootstrap' "${clawhub_runtime_state_path}")"
            windows_line=""
            if [[ -n "${windows_node_run_id// }" ]]; then
              windows_line="- Windows Hub promotion: https://github.com/${GITHUB_REPOSITORY}/actions/runs/${windows_node_run_id} from openclaw/openclaw-windows-node@${WINDOWS_NODE_TAG}"
            fi
            RELEASE_BODY_FILE="${body_file}" \
              RELEASE_NOTES_FILE="${notes_file}" \
@@ -1151,7 +948,6 @@ jobs:
              CLAWHUB_LINE="${clawhub_line}" \
              CLAWHUB_BOOTSTRAP_LINE="${clawhub_bootstrap_line}" \
              TELEGRAM_LINE="${telegram_line}" \
              WINDOWS_LINE="${windows_line}" \
              node --input-type=module <<'NODE'
          import { readFileSync, writeFileSync } from "node:fs";
@@ -1178,7 +974,6 @@ jobs:
            process.env.CLAWHUB_BOOTSTRAP_LINE,
            `- OpenClaw npm publish: https://github.com/${process.env.RELEASE_REPO}/actions/runs/${process.env.OPENCLAW_NPM_RUN_ID}`,
            process.env.TELEGRAM_LINE,
            ...(process.env.WINDOWS_LINE ? [process.env.WINDOWS_LINE] : []),
          ].join("\n");
          const withoutOldProof = body.replace(/\n?### Release verification\n[\s\S]*?(?=\n### |\n## |$)/, "");
@@ -1203,9 +998,6 @@ jobs:
            else
              echo "- OpenClaw npm publish: skipped by input"
            fi
            if is_stable_release && [[ "${PUBLISH_OPENCLAW_NPM}" == "true" ]]; then
              echo "- Windows Hub promotion: required before the GitHub release can be published"
            fi
            if [[ "${WAIT_FOR_CLAWHUB}" == "true" ]]; then
              echo "- Workflow completion waits for ClawHub"
            else
@@ -1350,7 +1142,6 @@ jobs:
          failed=0
          openclaw_failed=0
          windows_node_run_id=""
          if [[ -n "${openclaw_pid}" ]] && ! wait "${openclaw_pid}"; then
            failed=1
            openclaw_failed=1
@@ -1381,9 +1172,6 @@ jobs:
            fi
            create_or_update_github_release
            upload_dependency_evidence_release_asset
            if ! promote_windows_release_assets; then
              failed=1
            fi
            append_release_proof_to_github_release
            if [[ "${failed}" == "0" ]]; then
              publish_github_release
--- a/.github/workflows/qa-live-transports-convex.yml
+++ b/.github/workflows/qa-live-transports-convex.yml
@@ -532,6 +532,7 @@ jobs:
          OPENCLAW_QA_CONVEX_SECRET_CI: ${{ secrets.OPENCLAW_QA_CONVEX_SECRET_CI }}
          OPENCLAW_QA_CREDENTIAL_ACQUIRE_TIMEOUT_MS: "1800000"
          OPENCLAW_QA_REDACT_PUBLIC_METADATA: "1"
          OPENCLAW_QA_TELEGRAM_CAPTURE_CONTENT: "1"
          INPUT_SCENARIO: ${{ github.event_name == 'workflow_dispatch' && inputs.scenario || '' }}
        run: |
          set -euo pipefail
--- a/.github/workflows/stale.yml
+++ b/.github/workflows/stale.yml
@@ -68,7 +68,7 @@ jobs:
          days-before-pr-close: 7
          stale-issue-label: stale
          stale-pr-label: stale
-          exempt-issue-labels: enhancement,maintainer,pinned,security,no-stale,bad-barnacle,clawsweeper:queueable-fix,clawsweeper:source-repro,clawsweeper:fix-shape-clear
+          exempt-issue-labels: enhancement,maintainer,pinned,security,no-stale,bad-barnacle
          exempt-pr-labels: maintainer,no-stale,bad-barnacle
          operations-per-run: 2000
          ascending: true
@@ -100,7 +100,7 @@ jobs:
          days-before-pr-stale: -1
          days-before-pr-close: -1
          stale-issue-label: stale
-          exempt-issue-labels: enhancement,maintainer,pinned,security,no-stale,bad-barnacle,clawsweeper:queueable-fix,clawsweeper:source-repro,clawsweeper:fix-shape-clear
+          exempt-issue-labels: enhancement,maintainer,pinned,security,no-stale,bad-barnacle
          operations-per-run: 2000
          ascending: true
          include-only-assigned: true
@@ -172,7 +172,7 @@ jobs:
          days-before-pr-close: 7
          stale-issue-label: stale
          stale-pr-label: stale
-          exempt-issue-labels: enhancement,maintainer,pinned,security,no-stale,bad-barnacle,clawsweeper:queueable-fix,clawsweeper:source-repro,clawsweeper:fix-shape-clear
+          exempt-issue-labels: enhancement,maintainer,pinned,security,no-stale,bad-barnacle
          exempt-pr-labels: maintainer,no-stale,bad-barnacle
          operations-per-run: 2000
          ascending: true
@@ -203,7 +203,7 @@ jobs:
          days-before-pr-stale: -1
          days-before-pr-close: -1
          stale-issue-label: stale
-          exempt-issue-labels: enhancement,maintainer,pinned,security,no-stale,bad-barnacle,clawsweeper:queueable-fix,clawsweeper:source-repro,clawsweeper:fix-shape-clear
+          exempt-issue-labels: enhancement,maintainer,pinned,security,no-stale,bad-barnacle
          operations-per-run: 2000
          ascending: true
          include-only-assigned: true
@@ -277,9 +277,6 @@ jobs:
              "security",
              "no-stale",
              "bad-barnacle",
              "clawsweeper:queueable-fix",
              "clawsweeper:source-repro",
              "clawsweeper:fix-shape-clear",
            ]);
            const prExemptLabels = new Set(["maintainer", "no-stale", "bad-barnacle"]);
            const maintainerAssociations = new Set(["OWNER", "MEMBER", "COLLABORATOR"]);
--- a/.github/workflows/windows-node-release.yml
+++ b/.github/workflows/windows-node-release.yml
@@ -8,12 +8,9 @@ on:
        required: true
        type: string
      windows_node_tag:
-        description: Exact openclaw-windows-node release tag to promote, for example v0.6.3
+        description: openclaw-windows-node release tag to promote, or latest
        required: true
        type: string
      expected_installer_digests:
        description: Compact JSON map of installer asset names to pinned source sha256 digests
        required: true
        default: latest
        type: string
 permissions:
@@ -34,129 +31,46 @@ jobs:
        env:
          RELEASE_TAG: ${{ inputs.tag }}
          WINDOWS_NODE_TAG: ${{ inputs.windows_node_tag }}
          EXPECTED_INSTALLER_DIGESTS: ${{ inputs.expected_installer_digests }}
          GH_TOKEN: ${{ github.token }}
        run: |
          if ($env:RELEASE_TAG -notmatch '^v[0-9]{4}\.[1-9][0-9]*\.[1-9][0-9]*((-(alpha|beta)\.[1-9][0-9]*)|(-[1-9][0-9]*))?$') {
            throw "Invalid OpenClaw release tag: $env:RELEASE_TAG"
          }
-          $stableRelease = -not (
+          if ($env:WINDOWS_NODE_TAG -ne "latest" -and $env:WINDOWS_NODE_TAG -notmatch '^v[0-9]+\.[0-9]+\.[0-9]+([-.][0-9A-Za-z.-]+)?$') {
-            $env:RELEASE_TAG.Contains("-alpha.") -or
+            throw "Invalid openclaw-windows-node release tag: $env:WINDOWS_NODE_TAG"
            $env:RELEASE_TAG.Contains("-beta.")
          )
          if ($env:WINDOWS_NODE_TAG -notmatch '^v[0-9]+\.[0-9]+\.[0-9]+([-.][0-9A-Za-z]+([.-][0-9A-Za-z]+)*)?$') {
            throw "windows_node_tag must be an explicit openclaw-windows-node release tag, not latest: $env:WINDOWS_NODE_TAG"
          }
          try {
            $expectedDigests = $env:EXPECTED_INSTALLER_DIGESTS | ConvertFrom-Json -AsHashtable
          } catch {
            throw "expected_installer_digests must be a JSON object: $_"
          }
          # Add future signed installer names, such as MSIX x64/ARM64, here.
          $requiredInstallerNames = @(
            "OpenClawCompanion-Setup-x64.exe",
            "OpenClawCompanion-Setup-arm64.exe"
          )
          $allowedTargetCompanionAssetNames = @(
            $requiredInstallerNames
            "OpenClawCompanion-SHA256SUMS.txt"
          )
          if ($expectedDigests.Count -ne $requiredInstallerNames.Count) {
            throw "expected_installer_digests must contain exactly the current installer asset contract."
          }
          foreach ($name in $requiredInstallerNames) {
            $digest = [string]$expectedDigests[$name]
            if ($digest -notmatch '^sha256:[A-Fa-f0-9]{64}$') {
              throw "expected_installer_digests is missing a valid pinned digest for $name."
            }
          }
          $targetRelease = gh release view $env:RELEASE_TAG --repo $env:GITHUB_REPOSITORY --json tagName,isDraft,isPrerelease,assets,url | ConvertFrom-Json
          if ($targetRelease.tagName -ne $env:RELEASE_TAG) {
            throw "OpenClaw release tag mismatch: expected $env:RELEASE_TAG, got $($targetRelease.tagName)"
          }
          $unexpectedTargetCompanionAssets = @(
            $targetRelease.assets |
              Where-Object {
                $_.name.StartsWith("OpenClawCompanion-") -and
                $_.name -notin $allowedTargetCompanionAssetNames
              } |
              ForEach-Object name |
              Sort-Object
          )
          if ($unexpectedTargetCompanionAssets.Count -ne 0) {
            throw "Target OpenClaw release contains unexpected OpenClawCompanion assets before upload: $($unexpectedTargetCompanionAssets -join ', ')"
          }
          $sourceRelease = gh release view $env:WINDOWS_NODE_TAG --repo openclaw/openclaw-windows-node --json tagName,isDraft,isPrerelease,assets,url | ConvertFrom-Json
          if ($sourceRelease.tagName -ne $env:WINDOWS_NODE_TAG) {
            throw "Windows source release tag mismatch: expected $env:WINDOWS_NODE_TAG, got $($sourceRelease.tagName)"
          }
          if ($sourceRelease.isDraft) {
            throw "Windows source release must be published: $($sourceRelease.url)"
          }
          if ($stableRelease -and $sourceRelease.isPrerelease) {
            throw "Stable OpenClaw releases require a non-prerelease Windows source release: $($sourceRelease.url)"
          }
          foreach ($name in $requiredInstallerNames) {
            $sourceAssets = @($sourceRelease.assets | Where-Object name -eq $name)
            if ($sourceAssets.Count -ne 1) {
              throw "Windows source release must contain exactly one required asset $name; found $($sourceAssets.Count)."
            }
            if ([string]$sourceAssets[0].digest -ne [string]$expectedDigests[$name]) {
              throw "Windows source release asset digest does not match the pinned digest: $name"
            }
          }
          gh release view $env:RELEASE_TAG --repo $env:GITHUB_REPOSITORY | Out-Null
      - name: Download Windows Hub release installers
        shell: pwsh
        env:
          WINDOWS_NODE_TAG: ${{ inputs.windows_node_tag }}
          EXPECTED_INSTALLER_DIGESTS: ${{ inputs.expected_installer_digests }}
          GH_TOKEN: ${{ github.token }}
        run: |
          New-Item -ItemType Directory -Force -Path dist | Out-Null
-          # Add future signed installer patterns, such as MSIX x64/ARM64, here.
+          $tagArgs = @()
-          # Every matched installer is signature-checked, checksummed, and promoted.
+          if ($env:WINDOWS_NODE_TAG -ne "latest") {
-          $installerPatterns = @(
+            $tagArgs += $env:WINDOWS_NODE_TAG
            "OpenClawCompanion-Setup-x64.exe",
            "OpenClawCompanion-Setup-arm64.exe"
          )
          $downloadArgs = @(
            $env:WINDOWS_NODE_TAG,
            "--repo", "openclaw/openclaw-windows-node",
            "--dir", "dist"
          )
          foreach ($pattern in $installerPatterns) {
            $downloadArgs += @("--pattern", $pattern)
          }
          gh release download @downloadArgs
          if ($LASTEXITCODE -ne 0) {
            throw "Failed to download Windows release assets from $env:WINDOWS_NODE_TAG."
          }
          gh release download @tagArgs `
            --repo openclaw/openclaw-windows-node `
            --pattern "OpenClawCompanion-Setup-*.exe" `
            --dir dist
-          foreach ($pattern in $installerPatterns) {
+          $expected = @(
-            $patternMatches = @(Get-ChildItem -LiteralPath dist -File | Where-Object Name -Like $pattern)
+            "dist/OpenClawCompanion-Setup-x64.exe",
-            if ($patternMatches.Count -ne 1) {
+            "dist/OpenClawCompanion-Setup-arm64.exe"
-              throw "Expected exactly one Windows installer matching '$pattern', found $($patternMatches.Count)."
+          )
-            }
+          foreach ($file in $expected) {
-          }
+            if (-not (Test-Path -LiteralPath $file)) {
-
+              throw "Missing expected Windows installer: $file"
          $expectedDigests = $env:EXPECTED_INSTALLER_DIGESTS | ConvertFrom-Json -AsHashtable
          foreach ($file in Get-ChildItem -LiteralPath dist -File) {
            $expectedHash = ([string]$expectedDigests[$file.Name]) -replace '^sha256:', ''
            $actualHash = (Get-FileHash -Algorithm SHA256 -LiteralPath $file.FullName).Hash
            if ($actualHash -ne $expectedHash) {
              throw "Downloaded Windows source asset does not match pinned digest: $($file.Name)"
            }
          }
      - name: Verify Authenticode signatures
        shell: pwsh
        run: |
-          $expectedSignerSubject = "CN=OpenClaw Foundation, O=OpenClaw Foundation, L=Mill Valley, S=California, C=US"
+          Get-ChildItem -LiteralPath dist -Filter "OpenClawCompanion-Setup-*.exe" | ForEach-Object {
          Get-ChildItem -LiteralPath dist -File | ForEach-Object {
            $signature = Get-AuthenticodeSignature -LiteralPath $_.FullName
            if ($signature.Status -ne "Valid") {
              throw "$($_.Name) Authenticode signature was $($signature.Status)."
@@ -164,9 +78,6 @@ jobs:
            if (-not $signature.SignerCertificate) {
              throw "$($_.Name) has no signer certificate."
            }
            if ($signature.SignerCertificate.Subject -ne $expectedSignerSubject) {
              throw "$($_.Name) has unexpected signer subject $($signature.SignerCertificate.Subject)."
            }
            [pscustomobject]@{
              File = $_.Name
              Signer = $signature.SignerCertificate.Subject
@@ -177,7 +88,7 @@ jobs:
      - name: Write SHA-256 manifest
        shell: pwsh
        run: |
-          Get-ChildItem -LiteralPath dist -File |
+          Get-ChildItem -LiteralPath dist -Filter "OpenClawCompanion-Setup-*.exe" |
            Sort-Object Name |
            ForEach-Object {
              $hash = Get-FileHash -Algorithm SHA256 -LiteralPath $_.FullName
@@ -190,81 +101,12 @@ jobs:
          RELEASE_TAG: ${{ inputs.tag }}
          GH_TOKEN: ${{ github.token }}
        run: |
-          $releaseAssets = @(Get-ChildItem -LiteralPath dist -File | Sort-Object Name | ForEach-Object FullName)
+          gh release upload $env:RELEASE_TAG `
-          gh release upload $env:RELEASE_TAG @releaseAssets --repo $env:GITHUB_REPOSITORY --clobber
+            dist/OpenClawCompanion-Setup-x64.exe `
-          if ($LASTEXITCODE -ne 0) {
+            dist/OpenClawCompanion-Setup-arm64.exe `
-            throw "Failed to upload Windows release assets to $env:RELEASE_TAG."
+            dist/OpenClawCompanion-SHA256SUMS.txt `
-          }
+            --repo $env:GITHUB_REPOSITORY `
-
+            --clobber
      - name: Verify promoted release asset contract
        shell: pwsh
        env:
          RELEASE_TAG: ${{ inputs.tag }}
          GH_TOKEN: ${{ github.token }}
        run: |
          New-Item -ItemType Directory -Force -Path verified | Out-Null
          $expectedAssets = @(Get-ChildItem -LiteralPath dist -File | Sort-Object Name)
          $expectedCompanionAssetNames = @($expectedAssets | ForEach-Object Name | Sort-Object)
          $targetRelease = gh release view $env:RELEASE_TAG --repo $env:GITHUB_REPOSITORY --json assets | ConvertFrom-Json
          $actualCompanionAssetNames = @(
            $targetRelease.assets |
              Where-Object { $_.name.StartsWith("OpenClawCompanion-") } |
              ForEach-Object name |
              Sort-Object
          )
          $assetContractDiff = @(
            Compare-Object `
              -ReferenceObject $expectedCompanionAssetNames `
              -DifferenceObject $actualCompanionAssetNames
          )
          if (
            $actualCompanionAssetNames.Count -ne $expectedCompanionAssetNames.Count -or
            $assetContractDiff.Count -ne 0
          ) {
            throw "Promoted OpenClawCompanion asset names do not exactly match the current contract."
          }
          foreach ($asset in $expectedAssets) {
            gh release download $env:RELEASE_TAG `
              --repo $env:GITHUB_REPOSITORY `
              --pattern $asset.Name `
              --dir verified
            if ($LASTEXITCODE -ne 0) {
              throw "Failed to download promoted Windows release asset $($asset.Name)."
            }
          }
          $manifestPath = "verified/OpenClawCompanion-SHA256SUMS.txt"
          $manifestEntries = @(Get-Content -LiteralPath $manifestPath | ForEach-Object {
            if ($_ -notmatch '^([A-Fa-f0-9]{64})  ([^\\/]+)$') {
              throw "Invalid Windows SHA-256 manifest entry: $_"
            }
            [PSCustomObject]@{
              Hash = $Matches[1]
              Name = $Matches[2]
            }
          })
          $expectedInstallerNames = @(
            $expectedAssets |
              Where-Object Name -ne "OpenClawCompanion-SHA256SUMS.txt" |
              ForEach-Object Name
          )
          $manifestInstallerNames = @($manifestEntries | ForEach-Object Name | Sort-Object)
          $contractDiff = @(
            Compare-Object `
              -ReferenceObject $expectedInstallerNames `
              -DifferenceObject $manifestInstallerNames
          )
          if ($contractDiff.Count -ne 0) {
            throw "Promoted Windows SHA-256 manifest does not match the installer asset contract."
          }
          foreach ($entry in $manifestEntries) {
            $hash = (Get-FileHash -Algorithm SHA256 -LiteralPath "verified/$($entry.Name)").Hash
            if ($hash -ne $entry.Hash) {
              throw "Promoted Windows release asset checksum mismatch: $($entry.Name)"
            }
          }
      - name: Summary
        shell: pwsh
@@ -277,9 +119,8 @@ jobs:
          OpenClaw release: $env:RELEASE_TAG
          Source release: openclaw/openclaw-windows-node@$env:WINDOWS_NODE_TAG
          - https://github.com/openclaw/openclaw/releases/download/$env:RELEASE_TAG/OpenClawCompanion-Setup-x64.exe
          - https://github.com/openclaw/openclaw/releases/download/$env:RELEASE_TAG/OpenClawCompanion-Setup-arm64.exe
          - https://github.com/openclaw/openclaw/releases/download/$env:RELEASE_TAG/OpenClawCompanion-SHA256SUMS.txt
          "@ >> $env:GITHUB_STEP_SUMMARY
          Get-ChildItem -LiteralPath dist -File |
            Sort-Object Name |
            ForEach-Object {
              "- https://github.com/openclaw/openclaw/releases/download/$env:RELEASE_TAG/$($_.Name)"
            } >> $env:GITHUB_STEP_SUMMARY
--- a/CHANGELOG.md
+++ b/CHANGELOG.md
@@ -2,35 +2,6 @@
 Docs: https://docs.openclaw.ai
 ## 2026.6.8
 ### Highlights
 - Telegram and WhatsApp channel delivery are richer and less brittle: Telegram can send structured rich text with tables, lists, expandable blockquotes, prompt-preserving CLI backend delivery, retired native draft migration, and safer rich-media boundaries, while WhatsApp now honors configured ACP bindings. (#92679, #84082, #89421, #92513) Thanks @obviyus, @jzakirov, @spacegeologist, and @TurboTheTurtle.
 - Agent and Gateway recovery is sharper across account-scoped DM sends, generated media completions, restart shutdown aborts, yielded subagent pauses, yielded cron media, heartbeat dedupe, session identity prompts, and unknown OpenAI agent selector rejection. (#92788, #91246, #91357, #92631, #92146, #91287, #92468, #92510) Thanks @yetval, @TurboTheTurtle, @ooiuuii, @openperf, @IWhatsskill, @ZengWen-DT, and @zhangguiping-xydt.
 - Provider/model handling expands and tightens with GLM-5.2, Claude Haiku 4.5 catalog rows, OpenRouter and Google Vertex provider-prefix normalization, managed SecretRef auth, bounded model browse discovery, storeless OpenAI Responses replay gating, and Claude 4.5 Copilot tool-streaming safety. (#92796, #90116, #92627, #91218, #90686, #92247, #90706, #75393) Thanks @arkyu2077, @liuhao1024, @bymle, @rohitjavvadi, @samson910022, @snowzlm, and @Kailigithub.
 - `/usage` and reply payload hooks now have a native full footer renderer, default template, fixed-decimal formatting, credential-aware limits, better partial-count handling, and warnings for broken templates instead of silent bad output. (#92657, #89835, #89629) Thanks @Marvinthebored.
 - UI and mobile flows are steadier: workspace files can collapse and start collapsed, WebChat backscroll survives streaming, the sidebar session picker remains interactive above the desktop workbench, reset soft args survive UI dispatch, stale dashboard session parent lineage is preserved, and iOS reconnects stale foreground gateways. (#92779, #92622, #92705, #91353, #90658, #92552) Thanks @shakkernerd, @TurboTheTurtle, @NianJiuZst, @zhouhe-xydt, @luoyanglang, and @Solvely-Colin.
 - Memory, state, and diagnostics recover cleaner: oversized OpenAI embedding batches split before 431s, QMD memory search stays available in transient mode, SQLite avoids WAL on NFS state volumes, stuck-session recovery scheduling no longer resets warning backoff, and Infinity chunk limits stay genuinely unbounded. (#92650, #92618, #92639, #91247, #92752, #92735) Thanks @mushuiyu886, @TurboTheTurtle, @849261680, @gnanam1990, and @yhterrance.
 ### Changes
 - Providers/models: add GLM-5.2 support and Claude Haiku 4.5 catalog entries while keeping provider-qualified model IDs normalized across OpenRouter and Google Vertex paths. (#92796, #90116, #92627, #91218) Thanks @arkyu2077, @liuhao1024, and @bymle.
 - Channel plugins: ship Telegram rich-message delivery and WhatsApp ACP binding support, including rich prompt handoff to CLI backends and transport fixtures for richer drafts. (#92679, #92513) Thanks @obviyus and @TurboTheTurtle.
 - Agent commands: support `/btw` in CLI-backed sessions and keep CLI usage-error exits classified as usage failures instead of successful runs. (#92669, #92162) Thanks @joshavant and @Pandah97.
 - Usage hooks: add built-in full footer rendering, default footer templates, per-turn usage state, credential-aware limits, and fixed-decimal formatting for usage-bar templates. (#92657, #89835, #89629) Thanks @Marvinthebored.
 - Docs and operator guidance: document node config examples, clarify before-install hook scope, correct agent default concurrency comments, refresh ZAI provider docs, and update channel/group docs for current Telegram and WhatsApp behavior. (#92677, #92766, #92695) Thanks @liuhao1024, @sallyom, and @ArielSmoliar.
 ### Fixes
 - Channels and delivery: preserve account-scoped DM channel send policy, rich Telegram final replies, rich Telegram tables and lists, Telegram thread-create CLI remapping, Slack outbound `message_sent` hooks, contributed message-tool schema optionality, same-channel generated media completions, and channel chunking around surrogate pairs and Infinity limits. (#92788, #92679, #89421, #89943, #91137, #91246, #92735) Thanks @yetval, @obviyus, @spacegeologist, @rishitamrakar, @lundog, @TurboTheTurtle, and @yhterrance.
 - Discord: give generated auto-thread titles a 60-second timeout and 4,096-token reasoning-model output budget, clamped to the selected model output cap. (#64734) Thanks @hanamizuki.
 - Agent, cron, and Gateway runtime: mark active main sessions before restart shutdown aborts, pause yielded subagent runs whose terminal also signals abort, preserve yielded media completions, de-duplicate main-session heartbeat events, expose session identity in runtime prompts, reject unknown OpenAI agent selectors, keep generated media completions and slash-command block replies in WebChat, and require admin privileges for HTTP session/model override surfaces. (#91357, #92631, #92146, #91287, #92468, #92510, #91246, #92651, #92646) Thanks @ooiuuii, @openperf, @IWhatsskill, @ZengWen-DT, @zhangguiping-xydt, and @TurboTheTurtle.
 - Providers and model replay: preserve storeless OpenAI Responses replay compatibility, avoid eager tool streaming for Claude 4.5 in Copilot, honor profile auth for SecretRef model entries, bound model browsing, strip provider prefixes where runtimes need bare IDs, and surface nested embedding fetch failures. (#90706, #75393, #90686, #92247, #92627, #91218, #92628) Thanks @snowzlm, @Kailigithub, @rohitjavvadi, @samson910022, @liuhao1024, @bymle, and @mushuiyu886.
 - Memory, state, diagnostics, and config: split header-too-large embedding batches, keep QMD memory search enabled in transient mode, avoid SQLite WAL on NFS volumes, preserve recovery scheduling outside stuck-session warning backoff, and keep shell environment fallbacks contained in config write tests. (#92650, #92618, #92639, #91247, #92752) Thanks @mushuiyu886, @TurboTheTurtle, @849261680, and @gnanam1990.
 - UI/mobile/TUI: preserve dashboard session parent lineage, WebChat backscroll, reset soft command args, sidebar session picker interactivity, collapsed workspace files, resolved `/model` confirmation refs, and stale foreground iOS Gateway reconnects. (#90658, #92622, #91353, #92705, #92779, #92773, #92552) Thanks @luoyanglang, @TurboTheTurtle, @zhouhe-xydt, @NianJiuZst, @shakkernerd, @NarahariRaghava, and @Solvely-Colin.
 - Release and test reliability: extend slow Gateway/full-suite watchdogs, split local full-suite shards when throttled, stabilize plugin auth marker fixtures, avoid brittle provider-ref error text, and keep QA Lab bootstrap selection assertions aligned with flow-only scenarios. (#92652)
 ## 2026.6.6
 ### Highlights
--- a/4
+++ b/4
@@ -147,10 +147,6 @@ RUN --mount=type=cache,id=openclaw-pnpm-store,target=/root/.local/share/pnpm/sto
    OPENCLAW_EXTENSIONS="$OPENCLAW_EXTENSIONS" OPENCLAW_BUNDLED_PLUGIN_DIR="$OPENCLAW_BUNDLED_PLUGIN_DIR" node scripts/prune-docker-plugin-dist.mjs && \
    node scripts/postinstall-bundled-plugins.mjs && \
    find dist -type f \( -name '*.d.ts' -o -name '*.d.mts' -o -name '*.d.cts' -o -name '*.map' \) -delete && \
    rm -rf \
      /app/node_modules/openclaw \
      /app/node_modules/.bin/openclaw \
      /app/node_modules/.pnpm/openclaw@*/node_modules/openclaw && \
    node scripts/check-package-dist-imports.mjs /app
 # ── Runtime base image ──────────────────────────────────────────
--- a/apps/ios/Sources/Model/NodeAppModel.swift
+++ b/apps/ios/Sources/Model/NodeAppModel.swift
@@ -188,7 +188,6 @@ final class NodeAppModel {
    @ObservationIgnored private var backgroundGraceTaskTimer: Task<Void, Never>?
    private var backgroundReconnectSuppressed = false
    private var backgroundReconnectLeaseUntil: Date?
    @ObservationIgnored private var foregroundGatewayResumeCheckInFlight = false
    private var lastSignificantLocationWakeAt: Date?
    @ObservationIgnored private let watchReplyCoordinator = WatchReplyCoordinator()
    private var watchExecApprovalPromptsByID: [String: ExecApprovalPrompt] = [:]
@@ -215,7 +214,6 @@ final class NodeAppModel {
    private static let watchExecApprovalBridgeStateKey = "watch.execApproval.bridge.state.v1"
    private static let backgroundAliveLastSuccessAtMsKey = "gateway.backgroundAlive.lastSuccessAtMs"
    private static let backgroundAliveLastTriggerKey = "gateway.backgroundAlive.lastTrigger"
    private static let foregroundResumeHealthTimeoutSeconds = 1
    var cameraHUDText: String?
    var cameraHUDKind: CameraHUDKind?
@@ -419,7 +417,9 @@ final class NodeAppModel {
            self.isBackgrounded = false
            self.endBackgroundConnectionGracePeriod(reason: "scene_foreground")
            self.clearBackgroundReconnectSuppression(reason: "scene_foreground")
-            var shouldStartGatewayHealthMonitor = self.operatorConnected
+            if self.operatorConnected {
                self.startGatewayHealthMonitor()
            }
            if phase == .active {
                self.voiceWake.resumeAfterExternalAudioCapture(wasSuspended: self.backgroundVoiceWakeSuspended)
                self.backgroundVoiceWakeSuspended = false
@@ -444,8 +444,6 @@ final class NodeAppModel {
                // iOS may suspend network sockets in background without a clean close.
                // On foreground, force a fresh handshake to avoid "connected but dead" states.
                if backgroundedFor >= 3.0 {
                    shouldStartGatewayHealthMonitor = false
                    self.foregroundGatewayResumeCheckInFlight = true
                    Task { [weak self] in
                        guard let self else { return }
                        let operatorWasConnected = await MainActor.run { self.operatorConnected }
@@ -454,26 +452,31 @@ final class NodeAppModel {
                            let healthy = await (try? self.operatorGateway.request(
                                method: "health",
                                paramsJSON: nil,
-                                timeoutSeconds: Self.foregroundResumeHealthTimeoutSeconds)) != nil
+                                timeoutSeconds: 2)) != nil
                            if healthy {
-                                await MainActor.run {
+                                await MainActor.run { self.startGatewayHealthMonitor() }
                                    self.foregroundGatewayResumeCheckInFlight = false
                                    self.startGatewayHealthMonitor()
                                }
                                return
                            }
                        }
                        await self.operatorGateway.disconnect()
                        await self.nodeGateway.disconnect()
                        await MainActor.run {
-                            self.foregroundGatewayResumeCheckInFlight = false
+                            guard !self.isAppleReviewDemoModeEnabled else { return }
                            self.setOperatorConnected(false)
                            self.gatewayConnected = false
                            // Foreground recovery must actively restart the saved gateway config.
                            // Disconnecting stale sockets alone can leave us idle if the old
                            // reconnect tasks were suppressed or otherwise got stuck in background.
                            self.gatewayStatusText = "Reconnecting…"
                            self.talkMode.updateGatewayConnected(false)
                            if let cfg = self.activeGatewayConnectConfig {
                                self.applyGatewayConnectConfig(cfg)
                            }
                        }
                        await self.restartGatewaySessionsAfterForegroundStaleConnection()
                    }
                }
            }
            if shouldStartGatewayHealthMonitor {
                self.startGatewayHealthMonitor()
            }
        @unknown default:
            self.isBackgrounded = false
            self.endBackgroundConnectionGracePeriod(reason: "scene_unknown")
@@ -783,12 +786,6 @@ final class NodeAppModel {
    func refreshGatewayOverviewIfConnected() async {
        guard await self.isOperatorConnected() else { return }
        if self.foregroundGatewayResumeCheckInFlight {
            GatewayDiagnostics.log("gateway overview refresh deferred reason=foreground_resume_check")
            try? await Task.sleep(
                nanoseconds: UInt64(Self.foregroundResumeHealthTimeoutSeconds) * 1_000_000_000)
            guard await self.isOperatorConnected(), !self.foregroundGatewayResumeCheckInFlight else { return }
        }
        await self.refreshBrandingFromGateway()
        await self.refreshAgentsFromGateway()
    }
@@ -1989,33 +1986,12 @@ extension NodeAppModel {
    }
    func resetGatewaySessionsForForcedReconnect() async {
-        let nodeGatewayTask = self.nodeGatewayTask
+        self.nodeGatewayTask?.cancel()
        let operatorGatewayTask = self.operatorGatewayTask
        nodeGatewayTask?.cancel()
        self.nodeGatewayTask = nil
-        operatorGatewayTask?.cancel()
+        self.operatorGatewayTask?.cancel()
        self.operatorGatewayTask = nil
        await self.operatorGateway.disconnect()
        await self.nodeGateway.disconnect()
        // Foreground recovery reuses the same config immediately after reset.
        // Wait for canceled loops so their shutdown cleanup cannot clobber the new reconnect state.
        if let operatorGatewayTask {
            await operatorGatewayTask.value
        }
        if let nodeGatewayTask {
            await nodeGatewayTask.value
        }
    }
    private func restartGatewaySessionsAfterForegroundStaleConnection() async {
        await self.resetGatewaySessionsForForcedReconnect()
        guard !self.isAppleReviewDemoModeEnabled else { return }
        self.setOperatorConnected(false)
        self.gatewayConnected = false
        self.gatewayStatusText = "Reconnecting…"
        self.talkMode.updateGatewayConnected(false)
        guard let cfg = self.activeGatewayConnectConfig else { return }
        self.applyGatewayConnectConfig(cfg, forceReconnect: true)
    }
    func disconnectGateway() {
@@ -4850,10 +4826,6 @@ extension NodeAppModel {
        (self.nodeGatewayTask != nil, self.operatorGatewayTask != nil)
    }
    func _test_restartGatewaySessionsAfterForegroundStaleConnection() async {
        await self.restartGatewaySessionsAfterForegroundStaleConnection()
    }
    func _test_handleSuccessfulBootstrapGatewayOnboarding() async {
        await self.handleSuccessfulBootstrapGatewayOnboarding(
            url: URL(string: "wss://gateway.example")!,
--- a/apps/ios/Tests/GatewayConnectionControllerTests.swift
+++ b/apps/ios/Tests/GatewayConnectionControllerTests.swift
@@ -356,20 +356,6 @@ import UIKit
        #expect(!appModel._test_hasGatewayLoopTasks().operator)
    }
    @Test @MainActor func foregroundStaleConnectionRestartReappliesActiveGatewayConfig() async {
        let appModel = NodeAppModel()
        defer { appModel.disconnectGateway() }
        let config = Self.makeGatewayConnectConfig()
        appModel.applyGatewayConnectConfig(config)
        await appModel._test_restartGatewaySessionsAfterForegroundStaleConnection()
        #expect(appModel.gatewayStatusText == "Reconnecting…")
        #expect(appModel.activeGatewayConnectConfig?.hasSameConnectionInputs(as: config) == true)
        #expect(appModel._test_hasGatewayLoopTasks().node)
        #expect(appModel._test_hasGatewayLoopTasks().operator)
    }
    @Test @MainActor func loadLastConnectionReadsSavedValues() {
        let prior = KeychainStore.loadString(service: "ai.openclaw.gateway", account: "lastConnection")
        defer {
--- a/apps/macos/Package.resolved
+++ b/apps/macos/Package.resolved
@@ -1,5 +1,5 @@
 {
-  "originHash" : "ae9f37f50cff0d32d189e60948f61e2fa1704e997a6ef4ad5e37f6a11c165ea4",
+  "originHash" : "035a4fe955164c62c1628de75f6437a14443a947eea2a1b0176ba484d6fde6f8",
  "pins" : [
    {
      "identity" : "axorcist",
@@ -42,8 +42,8 @@
      "kind" : "remoteSourceControl",
      "location" : "https://github.com/steipete/Peekaboo.git",
      "state" : {
-        "revision" : "ee0e3185431788dad533ffca77cd75315aa3d26f",
+        "revision" : "3a56ed2aa769bfefb5a78722dfce3c34088cfba1",
-        "version" : "3.4.1"
+        "version" : "3.4.0"
      }
    },
    {
@@ -51,8 +51,8 @@
      "kind" : "remoteSourceControl",
      "location" : "https://github.com/sparkle-project/Sparkle",
      "state" : {
-        "revision" : "d46d456107feacc80711b21847b82b07bd9fb46e",
+        "revision" : "6276ba2b404829d139c45ff98427cf90e2efc59b",
-        "version" : "2.9.3"
+        "version" : "2.9.2"
      }
    },
    {
@@ -78,8 +78,8 @@
      "kind" : "remoteSourceControl",
      "location" : "https://github.com/apple/swift-log.git",
      "state" : {
-        "revision" : "92448c359f00ebe36ae97d3bd9086f13c7692b5a",
+        "revision" : "2aed77ae5ec9a86d8fe42c12275e4c2653a286ee",
-        "version" : "1.13.2"
+        "version" : "1.13.1"
      }
    },
    {
--- a/apps/macos/Package.swift
+++ b/apps/macos/Package.swift
@@ -19,7 +19,7 @@ let package = Package(
        .package(url: "https://github.com/swiftlang/swift-subprocess.git", from: "0.4.0"),
        .package(url: "https://github.com/apple/swift-log.git", from: "1.10.1"),
        .package(url: "https://github.com/sparkle-project/Sparkle", from: "2.9.0"),
-        .package(url: "https://github.com/steipete/Peekaboo.git", exact: "3.4.1"),
+        .package(url: "https://github.com/steipete/Peekaboo.git", exact: "3.4.0"),
        .package(path: "../shared/OpenClawKit"),
        .package(path: "../swabble"),
    ],
--- a/apps/macos/Sources/OpenClaw/VoiceWakeOverlayController+Window.swift
+++ b/apps/macos/Sources/OpenClaw/VoiceWakeOverlayController+Window.swift
@@ -92,13 +92,7 @@ extension VoiceWakeOverlayController {
        let contentHeight = ceil(used.height + (textInset.height * 2))
        let total = contentHeight + self.verticalPadding * 2
-        // Defer the overflow state mutation to break the SwiftUI onChange → measuredHeight →
+        self.model.isOverflowing = total > self.maxHeight
        // isOverflowing → re-render → onChange synchronous render loop (fixes #43480).
        let overflowing = total > self.maxHeight
        DispatchQueue.main.async { [weak self] in
            guard let self, self.model.isOverflowing != overflowing else { return }
            self.model.isOverflowing = overflowing
        }
        return max(self.minHeight, min(total, self.maxHeight))
    }
--- a/apps/macos/Tests/OpenClawIPCTests/ExecApprovalsStoreRefactorTests.swift
+++ b/apps/macos/Tests/OpenClawIPCTests/ExecApprovalsStoreRefactorTests.swift
@@ -4,64 +4,14 @@ import Testing
@Suite(.serialized)
 struct ExecApprovalsStoreRefactorTests {
    private var realTemporaryDirectory: URL {
        let path = FileManager().temporaryDirectory.path
        if path.hasPrefix("/var/") {
            return URL(fileURLWithPath: "/private\(path)", isDirectory: true)
        }
        return FileManager().temporaryDirectory.resolvingSymlinksInPath()
    }
    private func withLockedEnv(
        _ values: [String: String?],
        _ body: () async throws -> Void) async throws
    {
        func restoreEnv(_ values: [String: String?]) {
            for (key, value) in values {
                if let value {
                    setenv(key, value, 1)
                } else {
                    unsetenv(key)
                }
            }
        }
        await TestIsolationLock.shared.acquire()
        var previousEnv: [String: String?] = [:]
        for (key, value) in values {
            previousEnv[key] = getenv(key).map { String(cString: $0) }
            if let value {
                setenv(key, value, 1)
            } else {
                unsetenv(key)
            }
        }
        do {
            try await body()
            restoreEnv(previousEnv)
            await TestIsolationLock.shared.release()
        } catch {
            restoreEnv(previousEnv)
            await TestIsolationLock.shared.release()
            throw error
        }
    }
    private func withTempStateDir(
        _ body: @escaping @Sendable (URL) async throws -> Void) async throws
    {
-        let root = self.realTemporaryDirectory
+        let stateDir = FileManager().temporaryDirectory
            .appendingPathComponent("openclaw-state-\(UUID().uuidString)", isDirectory: true)
-        let home = root.appendingPathComponent("home", isDirectory: true)
+        defer { try? FileManager().removeItem(at: stateDir) }
        let stateDir = root.appendingPathComponent("state", isDirectory: true)
        defer { try? FileManager().removeItem(at: root) }
        try Self.seedCurrentApprovalsFile(in: stateDir)
-        try await self.withLockedEnv([
+        try await TestIsolation.withEnvValues(["OPENCLAW_STATE_DIR": stateDir.path]) {
            "OPENCLAW_HOME": home.path,
            "OPENCLAW_STATE_DIR": stateDir.path,
        ]) {
            try await body(stateDir)
        }
    }
@@ -69,13 +19,13 @@ struct ExecApprovalsStoreRefactorTests {
    private func withTempHomeAndStateDir(
        _ body: @escaping @Sendable (URL, URL) async throws -> Void) async throws
    {
-        let root = self.realTemporaryDirectory
+        let root = FileManager().temporaryDirectory
            .appendingPathComponent("openclaw-home-state-\(UUID().uuidString)", isDirectory: true)
        let home = root.appendingPathComponent("home", isDirectory: true)
        let stateDir = root.appendingPathComponent("state", isDirectory: true)
        defer { try? FileManager().removeItem(at: root) }
-        try await self.withLockedEnv([
+        try await TestIsolation.withEnvValues([
            "OPENCLAW_HOME": home.path,
            "OPENCLAW_STATE_DIR": stateDir.path,
        ]) {
@@ -197,19 +147,4 @@ struct ExecApprovalsStoreRefactorTests {
        }
        return identifier
    }
    private static func seedCurrentApprovalsFile(in stateDir: URL) throws {
        try FileManager().createDirectory(at: stateDir, withIntermediateDirectories: true)
        let file = ExecApprovalsFile(
            version: 1,
            socket: ExecApprovalsSocketConfig(
                path: stateDir.appendingPathComponent("exec-approvals.sock").path,
                token: "test-token"),
            defaults: nil,
            agents: [:])
        let encoder = JSONEncoder()
        encoder.outputFormatting = [.prettyPrinted, .sortedKeys]
        try encoder.encode(file)
            .write(to: stateDir.appendingPathComponent("exec-approvals.json"))
    }
 }
--- a/apps/shared/OpenClawKit/Sources/OpenClawProtocol/GatewayModels.swift
+++ b/apps/shared/OpenClawKit/Sources/OpenClawProtocol/GatewayModels.swift
@@ -2074,204 +2074,6 @@ public struct SessionsCompactionRestoreResult: Codable, Sendable {
    }
 }
 public struct SessionFileBrowserEntry: Codable, Sendable {
    public let path: String
    public let name: String
    public let kind: AnyCodable
    public let sessionkind: SessionFileRelevance?
    public let size: Int?
    public let updatedatms: Int?
    public init(
        path: String,
        name: String,
        kind: AnyCodable,
        sessionkind: SessionFileRelevance?,
        size: Int?,
        updatedatms: Int?)
    {
        self.path = path
        self.name = name
        self.kind = kind
        self.sessionkind = sessionkind
        self.size = size
        self.updatedatms = updatedatms
    }
    private enum CodingKeys: String, CodingKey {
        case path
        case name
        case kind
        case sessionkind = "sessionKind"
        case size
        case updatedatms = "updatedAtMs"
    }
 }
 public struct SessionFileBrowserResult: Codable, Sendable {
    public let path: String
    public let parentpath: String?
    public let search: String?
    public let entries: [SessionFileBrowserEntry]
    public let truncated: Bool?
    public init(
        path: String,
        parentpath: String?,
        search: String?,
        entries: [SessionFileBrowserEntry],
        truncated: Bool?)
    {
        self.path = path
        self.parentpath = parentpath
        self.search = search
        self.entries = entries
        self.truncated = truncated
    }
    private enum CodingKeys: String, CodingKey {
        case path
        case parentpath = "parentPath"
        case search
        case entries
        case truncated
    }
 }
 public struct SessionFileEntry: Codable, Sendable {
    public let path: String
    public let name: String
    public let kind: SessionFileKind
    public let missing: Bool
    public let size: Int?
    public let updatedatms: Int?
    public let content: String?
    public init(
        path: String,
        name: String,
        kind: SessionFileKind,
        missing: Bool,
        size: Int?,
        updatedatms: Int?,
        content: String?)
    {
        self.path = path
        self.name = name
        self.kind = kind
        self.missing = missing
        self.size = size
        self.updatedatms = updatedatms
        self.content = content
    }
    private enum CodingKeys: String, CodingKey {
        case path
        case name
        case kind
        case missing
        case size
        case updatedatms = "updatedAtMs"
        case content
    }
 }
 public struct SessionsFilesListParams: Codable, Sendable {
    public let sessionkey: String
    public let agentid: String?
    public let path: String?
    public let search: String?
    public init(
        sessionkey: String,
        agentid: String? = nil,
        path: String?,
        search: String?)
    {
        self.sessionkey = sessionkey
        self.agentid = agentid
        self.path = path
        self.search = search
    }
    private enum CodingKeys: String, CodingKey {
        case sessionkey = "sessionKey"
        case agentid = "agentId"
        case path
        case search
    }
 }
 public struct SessionsFilesListResult: Codable, Sendable {
    public let sessionkey: String
    public let root: String?
    public let files: [SessionFileEntry]
    public let browser: SessionFileBrowserResult?
    public init(
        sessionkey: String,
        root: String?,
        files: [SessionFileEntry],
        browser: SessionFileBrowserResult?)
    {
        self.sessionkey = sessionkey
        self.root = root
        self.files = files
        self.browser = browser
    }
    private enum CodingKeys: String, CodingKey {
        case sessionkey = "sessionKey"
        case root
        case files
        case browser
    }
 }
 public struct SessionsFilesGetParams: Codable, Sendable {
    public let sessionkey: String
    public let path: String
    public let agentid: String?
    public init(
        sessionkey: String,
        path: String,
        agentid: String? = nil)
    {
        self.sessionkey = sessionkey
        self.path = path
        self.agentid = agentid
    }
    private enum CodingKeys: String, CodingKey {
        case sessionkey = "sessionKey"
        case path
        case agentid = "agentId"
    }
 }
 public struct SessionsFilesGetResult: Codable, Sendable {
    public let sessionkey: String
    public let root: String?
    public let file: SessionFileEntry
    public init(
        sessionkey: String,
        root: String?,
        file: SessionFileEntry)
    {
        self.sessionkey = sessionkey
        self.root = root
        self.file = file
    }
    private enum CodingKeys: String, CodingKey {
        case sessionkey = "sessionKey"
        case root
        case file
    }
 }
 public struct SessionsCreateParams: Codable, Sendable {
    public let key: String?
    public let agentid: String?
--- a/docs/.generated/config-baseline.sha256
+++ b/docs/.generated/config-baseline.sha256
@@ -1,4 +1,4 @@
-0485ba902d2afd89d2c41cde7180d0cec2900b2db6804b9f97d42b7d85cd3af5  config-baseline.json
+37b56008790612b8293930b6a29d74490e98daa90f954fca9d133fcc28645c4c  config-baseline.json
-72bb80be618406f3337eaa2560d2559a35e49bd29576de8dd4a3aec1a6a94d92  config-baseline.core.json
+75b64c2ea081369ba4306493313a8a4cd48b784145f92fed995e6b77a5df350d  config-baseline.core.json
-1218f5555541b61bd5ddcac6441f15061b44789e2471d4ffecbe3059777c55c1  config-baseline.channel.json
+17d64c9799dfa239a49493413f1100bdd9237e9b67aaeae331a4604dbc227023  config-baseline.channel.json
-a14ac4261e98403d1a7e047070e6f151938444e27382b860315bd0c74fda4861  config-baseline.plugin.json
+f9d1f50bfa8403891e76cd99dc1357cdece4a71e8ae18a39b190c2a14e6f97b0  config-baseline.plugin.json
--- a/docs/.generated/plugin-sdk-api-baseline.sha256
+++ b/docs/.generated/plugin-sdk-api-baseline.sha256
@@ -1,2 +1,2 @@
-b121079a0912b3051a9fc319a675ef920da9db23364ca0c0ccd3c9f0a05a3a49  plugin-sdk-api-baseline.json
+2c783beea6b3cda3d79060739a923f9f39e7e8b5942123dd6b08a09143a587ca  plugin-sdk-api-baseline.json
-61a0108da670e0f44ba4b861c002eb6eaa5cf63e392d4e7e7de42044cbe7d115  plugin-sdk-api-baseline.jsonl
+0b33af2cffb42abb46682fb71c8f214da220793f13d10a34d332e75ff99e8ce9  plugin-sdk-api-baseline.jsonl
--- a/docs/automation/tasks.md
+++ b/docs/automation/tasks.md
@@ -311,9 +311,7 @@ $OPENCLAW_STATE_DIR/tasks/runs.sqlite
 The registry loads into memory at gateway start and syncs writes to SQLite for durability across restarts.
 The Gateway keeps the SQLite write-ahead log bounded by using SQLite's default
-autocheckpoint threshold plus periodic `PASSIVE` checkpoints. Shutdown and
+autocheckpoint threshold plus periodic and shutdown `TRUNCATE` checkpoints.
 explicit maintenance checkpoints still use `TRUNCATE` so normal closes can
 reclaim WAL space without making the background sweeper wait on active readers.
 ### Automatic maintenance
--- a/docs/channels/broadcast-groups.md
+++ b/docs/channels/broadcast-groups.md
@@ -161,20 +161,17 @@ Control how agents process messages:
  <Step title="Incoming message arrives">
    A WhatsApp group or DM message arrives.
  </Step>
  <Step title="Route and admission">
    OpenClaw applies channel allowlists, group activation rules, and configured ACP binding ownership.
  </Step>
  <Step title="Broadcast check">
-    If no configured ACP binding owns the route, OpenClaw checks whether the peer ID is in `broadcast`.
+    System checks if peer ID is in `broadcast`.
  </Step>
-  <Step title="If broadcast applies">
+  <Step title="If in broadcast list">
    - All listed agents process the message.
    - Each agent has its own session key and isolated context.
    - Agents process in parallel (default) or sequentially.
  </Step>
-  <Step title="If broadcast does not apply">
+  <Step title="If not in broadcast list">
-    OpenClaw dispatches the ordinary route or the configured ACP session route selected during routing.
+    Normal routing applies (first matching binding).
  </Step>
 </Steps>
@@ -325,7 +322,7 @@ Broadcast groups work alongside existing routing:
 - `GROUP_B`: agent1 AND agent2 respond (broadcast).
 <Note>
-**Precedence:** `broadcast` takes priority over ordinary route bindings. Configured ACP bindings (`bindings[].type="acp"`) are exclusive: when one matches, OpenClaw dispatches to the configured ACP session instead of fan-out broadcast.
+**Precedence:** `broadcast` takes priority over `bindings`.
 </Note>
 ## Troubleshooting
@@ -346,9 +343,9 @@ Broadcast groups work alongside existing routing:
  </Accordion>
  <Accordion title="Only one agent responding">
-    **Cause:** Peer ID might be in ordinary route bindings but not `broadcast`, or it might match an exclusive configured ACP binding.
+    **Cause:** Peer ID might be in `bindings` but not `broadcast`.
-    **Fix:** Add ordinary route-bound peers to broadcast config, or remove/change the configured ACP binding if fan-out broadcast is desired.
+    **Fix:** Add to broadcast config or remove from bindings.
  </Accordion>
  <Accordion title="Performance issues">
--- a/docs/channels/feishu.md
+++ b/docs/channels/feishu.md
@@ -416,9 +416,7 @@ Enable `dynamicAgentCreation` to automatically create **isolated agent instances
 This is essential for public bots where you want each user to have their own private AI assistant experience.
 <Note>
-Dynamic bindings include the normalized Feishu `accountId`, so default and named accounts route each sender to the correct dynamic agent.
+**Account limitation**: `dynamicAgentCreation` currently works with the **default Feishu account only**. Named/multi-account setups are not yet fully supported — dynamic bindings are created without `accountId`, so messages to named accounts may still route to `agent:main`. Track progress in [Issue #42837](https://github.com/openclaw/openclaw/issues/42837).
 If a named account created an unscoped dynamic agent on an older release, that legacy agent still counts toward `maxAgents`. Confirm that it is not used by the default account before removing it, or temporarily increase `maxAgents`; OpenClaw cannot safely infer which account owns ambiguous legacy state.
 </Note>
 ### Quick setup
@@ -449,7 +447,7 @@ If a named account created an unscoped dynamic agent on an older release, that l
 When a new user sends their first DM:
-1. The channel generates a unique `agentId`: `feishu-{user_open_id}` for the default account, or a bounded account-prefixed identity digest for a named account
+1. The channel generates a unique `agentId` = `feishu-{user_open_id}`
 2. Creates a new workspace at `workspaceTemplate` path
 3. Registers the agent and creates a binding for this user
 4. The workspace helper ensures bootstrap files (`AGENTS.md`, `SOUL.md`, `USER.md`, etc.) on first access
@@ -466,23 +464,22 @@ When a new user sends their first DM:
 Template variables:
- `{agentId}` - the generated agent ID (e.g., `feishu-ou_xxxxxx` or `feishu-support-<identity_digest>`)
+- `{agentId}` - the generated agent ID (e.g., `feishu-ou_xxxxxx`)
 - `{userId}` - the sender's Feishu open_id (e.g., `ou_xxxxxx`)
 ### Session scope
 `session.dmScope` controls how direct messages are mapped to agent sessions. This is a **global setting** that affects all channels.
-| Value                        | Behavior                                                            | Best for                                                           |
+| Value                | Behavior                                                  | Best for                                                           |
-| ---------------------------- | ------------------------------------------------------------------- | ------------------------------------------------------------------ |
+| -------------------- | --------------------------------------------------------- | ------------------------------------------------------------------ |
-| `"main"`                     | Each user's DM maps to their agent's main session                   | Single-user bots where you want `USER.md` / `SOUL.md` to auto-load |
+| `"main"`             | Each user's DM maps to their agent's main session         | Single-user bots where you want `USER.md` / `SOUL.md` to auto-load |
-| `"per-channel-peer"`         | Each (channel + user) combination gets a separate session           | Public multi-user bots needing stronger isolation                  |
+| `"per-channel-peer"` | Each (channel + user) combination gets a separate session | Public multi-user bots needing stronger isolation                  |
 | `"per-account-channel-peer"` | Each (account + channel + user) combination gets a separate session | Multi-account bots needing account-level session isolation         |
 **Tradeoff**: Using `"main"` enables automatic bootstrap file loading (`USER.md`, `SOUL.md`, `MEMORY.md`), but means all DMs across all channels share the same session key pattern. For public multi-user bots where isolation matters more than bootstrap auto-loading, consider `"per-channel-peer"` and manage bootstrap files manually.
 <Note>
-Use `"per-account-channel-peer"` when named Feishu accounts should keep separate sessions for the same sender. Dynamic bindings preserve the account scope.
+`"per-account-channel-peer"` is not recommended with `dynamicAgentCreation` because dynamic bindings are created without `accountId`. Use it only with manual bindings.
 </Note>
 ```json5
--- a/docs/channels/groups.md
+++ b/docs/channels/groups.md
@@ -586,7 +586,7 @@ Group inbound payloads set:
 - `WasMentioned` (mention gating result)
 - Telegram forum topics also include `MessageThreadId` and `IsForum`.
-The agent system prompt includes a group intro on the first turn of a new group session. It reminds the model to respond like a human, minimize empty lines and follow normal chat spacing, and avoid typing literal `\n` sequences. Non-Telegram groups also discourage Markdown tables; Telegram rich-text guidance comes from the Telegram channel prompt. Channel-sourced group names and participant labels are rendered as fenced untrusted metadata, not inline system instructions.
+The agent system prompt includes a group intro on the first turn of a new group session. It reminds the model to respond like a human, avoid Markdown tables, minimize empty lines and follow normal chat spacing, and avoid typing literal `\n` sequences. Channel-sourced group names and participant labels are rendered as fenced untrusted metadata, not inline system instructions.
 ## iMessage specifics
--- a/docs/channels/telegram.md
+++ b/docs/channels/telegram.md
@@ -311,6 +311,7 @@ curl "https://api.telegram.org/bot<bot_token>/getUpdates"
    - direct chats: preview message + `editMessageText`
    - groups/topics: preview message + `editMessageText`
    - direct-chat tool progress: optional native `sendMessageDraft` status preview when enabled and supported
    Requirement:
@@ -319,10 +320,29 @@ curl "https://api.telegram.org/bot<bot_token>/getUpdates"
    - `streaming.preview.toolProgress` controls whether tool/progress updates reuse the same edited preview message (default: `true` when preview streaming is active)
    - `streaming.preview.commandText` controls command/exec detail inside those tool-progress lines: `raw` (default, preserves released behavior) or `status` (tool label only)
    - `streaming.progress.commentary` (default: `false`) opts into assistant commentary/preamble text in the temporary progress draft
-    - legacy `channels.telegram.streamMode`, boolean `streaming` values, and retired native draft preview keys are detected; run `openclaw doctor --fix` to migrate them to current streaming config
+    - legacy `channels.telegram.streamMode` and boolean `streaming` values are detected; run `openclaw doctor --fix` to migrate them to `channels.telegram.streaming.mode`
    Tool-progress preview updates are the short status lines shown while tools run, for example command execution, file reads, planning updates, patch summaries, or Codex preamble/commentary text in Codex app-server mode. Telegram keeps these enabled by default to match released OpenClaw behavior from `v2026.4.22` and later.
    Direct chats can use native Telegram drafts for these tool-progress lines without persisting tool chatter into chat history. Native drafts stop before answer text starts; final answers stay on the normal persistent delivery path. This lane is off by default and should be gated to trusted DM IDs first:
    ```json
    {
      "channels": {
        "telegram": {
          "streaming": {
            "mode": "partial",
            "preview": {
              "toolProgress": true,
              "nativeToolProgress": true,
              "nativeToolProgressAllowFrom": ["123456789"]
            }
          }
        }
      }
    }
    ```
    To keep the edited preview for answer text but hide tool-progress lines, set:
    ```json
@@ -400,16 +420,14 @@ curl "https://api.telegram.org/bot<bot_token>/getUpdates"
  </Accordion>
-  <Accordion title="Rich message formatting">
+  <Accordion title="Formatting and HTML fallback">
-    Outbound text uses Telegram rich messages.
+    Outbound text uses Telegram `parse_mode: "HTML"`.
-    - Markdown text is sent as rich Markdown without converting it to HTML.
+    - Markdown-ish text is rendered to Telegram-safe HTML.
-    - Explicit HTML payloads are sent as rich HTML.
+    - Supported Telegram HTML tags are preserved; unsupported HTML is escaped.
-    - Media captions still use Telegram HTML captions because rich messages do not replace captions.
+    - If Telegram rejects parsed HTML, OpenClaw retries as plain text.
-    Long rich text is split automatically across Telegram's rich text and rich block limits. Tables over Telegram's column limit are sent as code blocks.
+    Link previews are enabled by default and can be disabled with `channels.telegram.linkPreview: false`.
    Link previews are enabled by default. `channels.telegram.linkPreview: false` skips automatic entity detection for rich text.
  </Accordion>
--- a/docs/channels/whatsapp.md
+++ b/docs/channels/whatsapp.md
@@ -319,40 +319,6 @@ content and identifiers.
  </Tab>
 </Tabs>
 ## Configured ACP bindings
 WhatsApp supports persistent ACP bindings with top-level `bindings[]` entries:
 ```json5
 {
  bindings: [
    {
      type: "acp",
      agentId: "codex",
      match: {
        channel: "whatsapp",
        accountId: "work",
        peer: { kind: "direct", id: "+15555550123" },
      },
    },
    {
      type: "acp",
      agentId: "codex",
      match: {
        channel: "whatsapp",
        accountId: "work",
        peer: { kind: "group", id: "120363424282127706@g.us" },
      },
    },
  ],
 }
 ```
 - Direct chats match E.164 numbers such as `+15555550123`.
 - Groups match WhatsApp group JIDs such as `120363424282127706@g.us`.
 - Group allowlists, sender policy, and mention or activation gating run before OpenClaw ensures the configured ACP session exists.
 - A matched configured ACP binding owns the route. WhatsApp broadcast groups do not fan out that turn to ordinary WhatsApp sessions.
 ## Personal-number and self-chat behavior
 When the linked self number is also present in `allowFrom`, WhatsApp self-chat safeguards activate:
--- a/docs/ci.md
+++ b/docs/ci.md
@@ -200,19 +200,13 @@ from `release/YYYY.M.PATCH` or `main` after the release tag exists and after the
 OpenClaw npm preflight has succeeded. It verifies `pnpm plugins:sync:check`,
 dispatches `Plugin NPM Release` for all publishable plugin packages, dispatches
 `Plugin ClawHub Release` for the same release SHA, and only then dispatches
-`OpenClaw NPM Release` with the saved `preflight_run_id`. Stable publish also
+`OpenClaw NPM Release` with the saved `preflight_run_id`.
 requires an exact `windows_node_tag`; the workflow verifies the Windows source
 release and compares its x64/ARM64 installers with the candidate-approved
 `windows_node_installer_digests` input before any publish child, then promotes
 and verifies those same pinned installer digests plus the exact companion asset
 and checksum contract before publishing the GitHub release draft.
 ```bash
 gh workflow run openclaw-release-publish.yml \
  --ref release/YYYY.M.PATCH \
  -f tag=vYYYY.M.PATCH-beta.N \
  -f preflight_run_id=<successful-openclaw-npm-preflight-run-id> \
  -f full_release_validation_run_id=<successful-full-release-validation-run-id> \
  -f npm_dist_tag=beta
 ```
@@ -458,7 +452,7 @@ For normal PRs, follow scoped CI/check evidence instead of treating parity as a
 The `CodeQL` workflow is intentionally a narrow first-pass security scanner, not the full repository sweep. Daily, manual, and non-draft pull request guard runs scan Actions workflow code plus the highest-risk JavaScript/TypeScript surfaces with high-confidence security queries filtered to high/critical `security-severity`.
-The pull request guard stays light: it only starts for changes under `.github/actions`, `.github/codeql`, `.github/workflows`, `packages`, or `src`, and it runs the same high-confidence security matrix as the scheduled workflow. Android and macOS CodeQL stay out of PR defaults.
+The pull request guard stays light: it only starts for changes under `.github/actions`, `.github/codeql`, `.github/workflows`, `packages`, `scripts`, `src`, or process-owning bundled plugin runtime paths, and it runs the same high-confidence security matrix as the scheduled workflow. Android and macOS CodeQL stay out of PR defaults.
 ### Security categories
@@ -468,6 +462,7 @@ The pull request guard stays light: it only starts for changes under `.github/ac
 | `/codeql-security-high/channel-runtime-boundary`  | Core channel implementation contracts plus the channel plugin runtime, gateway, Plugin SDK, secrets, audit touchpoints              |
 | `/codeql-security-high/network-ssrf-boundary`     | Core SSRF, IP parsing, network guard, web-fetch, and Plugin SDK SSRF policy surfaces                                                |
 | `/codeql-security-high/mcp-process-tool-boundary` | MCP servers, process execution helpers, outbound delivery, and agent tool-execution gates                                           |
 | `/codeql-security-high/process-exec-boundary`     | Local shell, process spawn helpers, subprocess-owning bundled plugin runtimes, and workflow script glue                             |
 | `/codeql-security-high/plugin-trust-boundary`     | Plugin install, loader, manifest, registry, package-manager install, source-loading, and Plugin SDK package contract trust surfaces |
 ### Platform-specific security shards
--- a/docs/cli/browser.md
+++ b/docs/cli/browser.md
@@ -174,22 +174,7 @@ Notes:
  or `--element`.
 - `existing-session` / `user` profiles support page screenshots and `--ref`
  screenshots from snapshot output, but not CSS `--element` screenshots.
- `--labels` overlays current snapshot refs on the screenshot. On
+- `--labels` overlays current snapshot refs on the screenshot.
  Playwright-backed profiles, it works with `--full-page` (full-page label
  overlay), `--ref` (element-clip label overlay by ARIA ref), and `--element`
  (element-clip label overlay by CSS selector); in element-clip modes, labels
  are projected relative to the element. The response also includes an
  `annotations` array with each ref's bounding box. Each item has `ref`,
  `number`, `role`, optional `name`, and `box: {x, y, width, height}`;
  coordinates are in the captured image's space (viewport / fullpage /
  element-relative). The field is omitted when empty.
  `existing-session` profiles render a chrome-mcp overlay on page screenshots
  but do not use the Playwright projection helper and do not include
  `annotations`; CSS `--element` screenshots are unsupported there. Without
  Playwright or chrome-mcp, labeled screenshots are not available. Prior
  releases ignored `--full-page`, `--ref`, and `--element` on labeled
  Playwright screenshots and always returned a viewport capture; labeled
  screenshots now honor those scopes.
 - `snapshot --urls` appends discovered link destinations to AI snapshots so
  agents can choose direct navigation targets instead of guessing from link
  text alone.
--- a/docs/cli/onboard.md
+++ b/docs/cli/onboard.md
@@ -182,10 +182,7 @@ Interactive onboarding behavior with reference mode:
 ### Non-interactive Z.AI endpoint choices
 <Note>
-`--auth-choice zai-api-key` auto-detects the best Z.AI endpoint and model for
+`--auth-choice zai-api-key` auto-detects the best Z.AI endpoint for your key (prefers the general API with `zai/glm-5.1`). If you specifically want the GLM Coding Plan endpoints, pick `zai-coding-global` or `zai-coding-cn`.
 your key. Coding Plan endpoints prefer `zai/glm-5.2`; general API endpoints use
 `zai/glm-5.1`. To force a Coding Plan endpoint, pick `zai-coding-global` or
 `zai-coding-cn`.
 </Note>
 ```bash
--- a/docs/cli/plugins.md
+++ b/docs/cli/plugins.md
@@ -159,7 +159,7 @@ is available, then fall back to `latest`.
  <Accordion title="--dangerously-force-unsafe-install">
    `--dangerously-force-unsafe-install` is deprecated and is now a no-op. OpenClaw no longer runs built-in install-time dangerous-code blocking for plugin installs.
-    Use the shared operator-owned `security.installPolicy` surface when host-specific install policy is required. Plugin `before_install` hooks are plugin-runtime lifecycle hooks and are not the primary policy boundary for CLI installs.
+    Use the shared operator-owned `security.installPolicy` surface when host-specific install policy is required. Plugin `before_install` hooks and `security.installPolicy` can still block installs.
    If a plugin you published on ClawHub is hidden or blocked by a registry scan, use the publisher steps in [ClawHub publishing](/clawhub/publishing). `--dangerously-force-unsafe-install` does not ask ClawHub to rescan the plugin or make a blocked release public.
@@ -405,7 +405,7 @@ Updates apply to tracked plugin installs in the managed plugin index and tracked
  </Accordion>
  <Accordion title="--dangerously-force-unsafe-install on update">
-    `--dangerously-force-unsafe-install` is also accepted on `plugins update` for compatibility, but it is deprecated and no longer changes plugin update behavior. Operator `security.installPolicy` can still block updates; plugin `before_install` hooks only apply in processes where plugin hooks are loaded.
+    `--dangerously-force-unsafe-install` is also accepted on `plugins update` for compatibility, but it is deprecated and no longer changes plugin update behavior. Operator `security.installPolicy` and plugin `before_install` hooks can still block updates.
  </Accordion>
 </AccordionGroup>
--- a/docs/concepts/active-memory.md
+++ b/docs/concepts/active-memory.md
@@ -479,9 +479,6 @@ names that plugin registers. Active Memory lists those tools in the recall
 prompt and passes the same list to the embedded sub-agent. If none of the
 configured tools are available, or the memory sub-agent fails, Active Memory
 skips recall for that turn and the main reply continues without memory context.
 For custom recall tools, non-empty model-visible tool output counts as recall
 evidence unless structured result fields explicitly report an empty result or
 failure.
 `toolsAllow` only accepts concrete memory tool names. Wildcards, `group:*`
 entries, and core agent tools such as `read`, `exec`, `message`, and
 `web_search` are ignored before the hidden memory sub-agent starts.
@@ -746,11 +743,7 @@ Before v2026.5.2 the plugin silently extended your configured `timeoutMs` by an
 extra 30000 ms during cold-start so model warm-up, embedding-index load, and
 the first recall could share one larger budget. v2026.5.2 moved that grace
 behind an explicit `setupGraceTimeoutMs` config — your configured `timeoutMs`
-is now the recall-work budget by default, unless you opt in. The blocking hook
+is now the budget by default, unless you opt in.
 uses two bounded phases around that budget: up to 1500 ms for session/config
 preflight before recall starts, then a separate fixed 1500 ms for abort
 settlement and transcript recovery after recall work stops. Neither allowance
 extends model or tool execution.
 If you upgraded from v2026.4.x and you set `timeoutMs` to a value tuned for the
 old implicit-grace world (the recommended starter `timeoutMs: 15000` is one
@@ -772,16 +765,14 @@ outer watchdog budgets back to the pre-v5.2 effective values:
 }
 ```
-The v2026.5.2 change removed the old implicit 30000 ms cold-start extension.
+Per the v2026.5.2 changelog: _"use the configured recall timeout as the
-Beyond the configured recall-work budget, the hook can use up to 1500 ms for
+blocking prompt-build hook budget by default and move cold-start setup grace
-preflight and another 1500 ms for post-recall completion. Its worst-case
+behind explicit `setupGraceTimeoutMs` config, so the plugin no longer silently
-blocking time is therefore `timeoutMs + setupGraceTimeoutMs + 3000` ms.
+extends 15000 ms configs to 45000 ms on the main lane."_
 The embedded recall runner uses the same effective timeout budget, so
 `setupGraceTimeoutMs` covers both the outer prompt-build watchdog and the inner
-blocking recall run. The preflight cap covers session/config checks before that
+blocking recall run.
 budget begins. The post-recall allowance lets the outer hook settle abort
 cleanup and read any final transcript state.
 For resource-tight gateways where cold-start latency is a known trade-off,
 lower values (5000–15000 ms) work too — the trade-off is a higher chance of
--- a/docs/concepts/agent-loop.md
+++ b/docs/concepts/agent-loop.md
@@ -97,7 +97,7 @@ These run inside the agent loop or gateway pipeline:
 - **`agent_end`**: inspect the final message list and run metadata after completion.
 - **`before_compaction` / `after_compaction`**: observe or annotate compaction cycles.
 - **`before_tool_call` / `after_tool_call`**: intercept tool params/results.
- **`before_install`**: inspect staged skill or plugin install material after operator install policy runs, when plugin hooks are loaded in the current OpenClaw process.
+- **`before_install`**: inspect install context and optionally block skill or plugin installs after operator install policy runs.
 - **`tool_result_persist`**: synchronously transform tool results before they are written to an OpenClaw-owned session transcript.
 - **`message_received` / `message_sending` / `message_sent`**: inbound + outbound message hooks.
 - **`session_start` / `session_end`**: session lifecycle boundaries.
@@ -109,7 +109,6 @@ Hook decision rules for outbound/tool guards:
 - `before_tool_call`: `{ block: false }` is a no-op and does not clear a prior block.
 - `before_install`: `{ block: true }` is terminal and stops lower-priority handlers.
 - `before_install`: `{ block: false }` is a no-op and does not clear a prior block.
 - Use `security.installPolicy`, not `before_install`, for operator-owned install allow/block decisions that must cover CLI install and update paths.
 - `message_sending`: `{ cancel: true }` is terminal and stops lower-priority handlers.
 - `message_sending`: `{ cancel: false }` is a no-op and does not clear a prior cancel.
--- a/docs/concepts/mantis.md
+++ b/docs/concepts/mantis.md
@@ -247,13 +247,12 @@ of only a bot-to-bot Slack transcript.
 evidence pipeline. It checks out the trusted candidate ref in a separate
 worktree, runs `pnpm openclaw qa telegram --credential-source convex
 --credential-role ci`, writes a `mantis-evidence.json` manifest from the
-Telegram QA summary, `qa-evidence.json`, and report artifacts, renders the
+Telegram QA summary and observed-message artifact, renders the redacted
-redacted evidence HTML through a Crabbox desktop browser, generates a
+transcript HTML through a Crabbox desktop browser, generates a motion-trimmed GIF
-motion-trimmed GIF with `crabbox media preview`, and posts the inline PR
+with `crabbox media preview`, and posts the inline PR evidence comment when a PR
-evidence comment when a PR number is available. This lane is QA-evidence visual
+number is available. This lane is transcript-visual rather than logged-in
-rather than logged-in Telegram Web proof: the Telegram Bot API gives stable live
+Telegram Web proof: the Telegram Bot API gives stable live message evidence, but
-message evidence, but Telegram Web login state is not required for normal Mantis
+Telegram Web login state is not required for normal Mantis automation.
 automation.
 `Mantis Telegram Desktop Proof` is the agentic native Telegram Desktop
 before/after wrapper. A maintainer can trigger it from a PR comment with
@@ -495,8 +494,8 @@ zero:
 - `pnpm openclaw qa discord` already runs a live Discord lane with driver and
  SUT bots.
- The live transport runner already writes reports, QA evidence, and
+- The live transport runner already writes reports and observed-message
-  transport-specific artifacts under `.artifacts/qa-e2e/`.
+  artifacts under `.artifacts/qa-e2e/`.
 - Convex credential leases already provide exclusive access to shared live
  transport credentials.
 - The browser control service already supports screenshots, snapshots,
--- a/docs/concepts/model-providers.md
+++ b/docs/concepts/model-providers.md
@@ -264,7 +264,7 @@ Gemini CLI JSON replies are parsed from `response`; usage falls back to `stats`,
 - Provider: `zai`
 - Auth: `ZAI_API_KEY`
- Example model: `zai/glm-5.2`
+- Example model: `zai/glm-5.1`
 - CLI: `openclaw onboard --auth-choice zai-api-key`
  - Model refs use the canonical `zai/*` provider ID.
  - `zai-api-key` auto-detects the matching Z.AI endpoint; `zai-coding-global`, `zai-coding-cn`, `zai-global`, and `zai-cn` force a specific surface
--- a/docs/concepts/qa-e2e-automation.md
+++ b/docs/concepts/qa-e2e-automation.md
@@ -318,17 +318,17 @@ Matrix has a [dedicated page](/concepts/qa-matrix) because of its scenario count
 These lanes register through `extensions/qa-lab/src/live-transports/shared/live-transport-cli.ts` and accept the same flags:
-| Flag                                  | Default                                            | Description                                                                                                                                     |
+| Flag                                  | Default                                            | Description                                                                                                           |
-| ------------------------------------- | -------------------------------------------------- | ----------------------------------------------------------------------------------------------------------------------------------------------- |
+| ------------------------------------- | -------------------------------------------------- | --------------------------------------------------------------------------------------------------------------------- |
-| `--scenario <id>`                     | -                                                  | Run only this scenario. Repeatable.                                                                                                             |
+| `--scenario <id>`                     | -                                                  | Run only this scenario. Repeatable.                                                                                   |
-| `--output-dir <path>`                 | `<repo>/.artifacts/qa-e2e/<transport>-<timestamp>` | Where reports, summaries, evidence, transport-specific artifacts, and the output log are written. Relative paths resolve against `--repo-root`. |
+| `--output-dir <path>`                 | `<repo>/.artifacts/qa-e2e/<transport>-<timestamp>` | Where reports/summary/observed messages and the output log are written. Relative paths resolve against `--repo-root`. |
-| `--repo-root <path>`                  | `process.cwd()`                                    | Repository root when invoking from a neutral cwd.                                                                                               |
+| `--repo-root <path>`                  | `process.cwd()`                                    | Repository root when invoking from a neutral cwd.                                                                     |
-| `--sut-account <id>`                  | `sut`                                              | Temporary account id inside the QA gateway config.                                                                                              |
+| `--sut-account <id>`                  | `sut`                                              | Temporary account id inside the QA gateway config.                                                                    |
-| `--provider-mode <mode>`              | `live-frontier`                                    | `mock-openai` or `live-frontier` (legacy `live-openai` still works).                                                                            |
+| `--provider-mode <mode>`              | `live-frontier`                                    | `mock-openai` or `live-frontier` (legacy `live-openai` still works).                                                  |
-| `--model <ref>` / `--alt-model <ref>` | provider default                                   | Primary/alternate model refs.                                                                                                                   |
+| `--model <ref>` / `--alt-model <ref>` | provider default                                   | Primary/alternate model refs.                                                                                         |
-| `--fast`                              | off                                                | Provider fast mode where supported.                                                                                                             |
+| `--fast`                              | off                                                | Provider fast mode where supported.                                                                                   |
-| `--credential-source <env\|convex>`   | `env`                                              | See [Convex credential pool](#convex-credential-pool).                                                                                          |
+| `--credential-source <env\|convex>`   | `env`                                              | See [Convex credential pool](#convex-credential-pool).                                                                |
-| `--credential-role <maintainer\|ci>`  | `ci` in CI, `maintainer` otherwise                 | Role used when `--credential-source convex`.                                                                                                    |
+| `--credential-role <maintainer\|ci>`  | `ci` in CI, `maintainer` otherwise                 | Role used when `--credential-source convex`.                                                                          |
 Each lane exits non-zero on any failed scenario. `--allow-failures` writes artifacts without setting a failing exit code.
@@ -346,6 +346,10 @@ Required env when `--credential-source env`:
 - `OPENCLAW_QA_TELEGRAM_DRIVER_BOT_TOKEN`
 - `OPENCLAW_QA_TELEGRAM_SUT_BOT_TOKEN`
 Optional:
 - `OPENCLAW_QA_TELEGRAM_CAPTURE_CONTENT=1` keeps message bodies in observed-message artifacts (default redacts).
 Scenarios (`extensions/qa-lab/src/live-transports/telegram/telegram-live.runtime.ts`):
 - `telegram-canary`
@@ -371,26 +375,26 @@ Output artifacts:
 - `telegram-qa-report.md`
 - `qa-evidence.json` - evidence entries for the live transport checks, including profile, coverage, provider, channel, artifacts, result, and RTT fields.
 - `telegram-qa-observed-messages.json` - bodies redacted unless `OPENCLAW_QA_TELEGRAM_CAPTURE_CONTENT=1`.
-Package Telegram runs use the same Telegram credential contract. Repeated RTT
+Package RTT comparison uses the same Telegram credential contract while keeping
-measurement is part of the normal package Telegram live lane; the RTT
+its RTT sample controls on the RTT harness path:
 distribution is folded into `qa-evidence.json` under `result.timing` for the
 selected RTT check.
 ```bash
-OPENCLAW_QA_CREDENTIAL_SOURCE=convex \
+pnpm rtt openclaw@beta \
-pnpm test:docker:npm-telegram-live
+  --credential-source convex \
  --credential-role maintainer \
  --samples 20 \
  --sample-timeout-ms 30000
 ```
-When `OPENCLAW_QA_CREDENTIAL_SOURCE=convex` is set, the package live wrapper
+When `--credential-source convex` is set, the RTT Docker wrapper leases a
-leases a `kind: "telegram"` credential, exports the leased group/driver/SUT bot
+`kind: "telegram"` credential, exports the leased group/driver/SUT bot env into
-env into the installed-package run, heartbeats the lease, and releases it on
+the installed-package run, heartbeats the lease, and releases it on shutdown.
-shutdown. The package wrapper defaults to 20 RTT checks of
+`--samples` and `--sample-timeout-ms` still feed
-`telegram-mentioned-message-reply`, a 30s RTT timeout, and Convex role
+`OPENCLAW_NPM_TELEGRAM_WARM_SAMPLES` and
-`maintainer` outside CI when Convex is selected. Override
+`OPENCLAW_NPM_TELEGRAM_SAMPLE_TIMEOUT_MS`, so `result.json` remains comparable
-`OPENCLAW_NPM_TELEGRAM_RTT_SAMPLES`, `OPENCLAW_NPM_TELEGRAM_RTT_TIMEOUT_MS`,
+across env-backed and Convex-backed RTT runs.
 or `OPENCLAW_NPM_TELEGRAM_RTT_MAX_FAILURES` to tune RTT measurement without
 creating a separate RTT command or Telegram-specific summary format.
 ### Discord QA
--- a/docs/concepts/usage-tracking.md
+++ b/docs/concepts/usage-tracking.md
@@ -32,13 +32,8 @@ title: "Usage tracking"
 ## Custom `/usage full` footer
-`/usage full` shows a built-in compact footer with model, reasoning, fast/slow,
+Set `messages.usageTemplate` to customize the per-response `/usage full`
-context window, turn tokens, cache, and cost when those fields are available. No
+footer. The value can be an inline template object or a JSON file path:
 template file is required.
 `messages.usageTemplate` is only for advanced custom layouts. The value is a
 JSON file path (supports `~`) or an inline object, and it replaces the built-in
 footer when valid:
 ```json
 {
@@ -48,182 +43,9 @@ footer when valid:
 }
 ```
-Missing or empty templates fall back to the built-in footer quietly. Unreadable
+Templates read the `openclaw.usageLine.v1` contract and can use `scales`,
-or invalid configured templates also fall back to the built-in footer and emit an
+`aliases`, and `output.surfaces` to render channel-specific footers. Missing,
-operator warning.
+unreadable, invalid, or empty templates fall back to the built-in usage line.
 Start custom templates from the built-in shape, then edit the parts you want to
 change:
 ```jsonc
 {
  "schema": "openclaw.usageBar.v1",
  "scales": {
    "braille": "⠐⡀⡄⡆⡇⣇⣧⣷⣿",
    "block": "░▏▎▍▌▋▊▉█",
    "shade": "░▒▓█",
    "moon": "🌑🌘🌗🌖🌕",
    "level": "▁▂▃▄▅▆▇█",
    "weather": ["🥶", "☁️", "🌥", "⛅️", "🌤", "☀️"],
    "plants": ["🪾", "🍂", "🌱", "☘️", "🍀", "🌿"],
    "moons6": ["🌑", "🌚", "🌘", "🌗", "🌖", "🌝"],
  },
  "aliases": {
    "models": {
      "claude-opus-4-6": "opus46",
      "claude-opus-4-8": "opus48",
      "claude-sonnet-4-6": "sonnet46",
      "claude-haiku-4-5": "haiku45",
      "gpt-5.5": "gpt5.5",
    },
    "reasoning": {
      "off": "🌑",
      "minimal": "🌚",
      "low": "🌘",
      "medium": "🌗",
      "high": "🌕",
      "xhigh": "🌝",
    },
  },
  "output": {
    "sep": "",
    "default": [
      { "text": "{model.provider}{identity.emoji|🤖} {model.display_name|alias:models}" },
      { "map": "model.is_fallback", "cases": { "true": " 🔄" } },
      { "map": "model.is_override", "cases": { "true": " 📌" } },
      { "when": "model.reasoning", "text": " {model.reasoning|alias:reasoning}" },
      { "map": "state.fast_mode", "cases": { "true": " ⚡", "false": " 🐌" } },
      {
        "when": "context.max_tokens",
        "text": " | 📚 [{context.pct_used|meter:5:braille}]{context.max_tokens|num}",
      },
      {
        "when": "usage.has_split_tokens",
        "text": " ↕️ {usage.input_tokens|num|?}/{usage.output_tokens|num|?}",
      },
      { "when": "usage.has_total_only_tokens", "text": " ↕️ {usage.total_tokens|num}" },
      { "when": "usage.cache_hit_pct", "text": " 🗄 {usage.cache_hit_pct|pct}" },
      { "when": "cost.turn_usd", "text": " 💰{cost.turn_usd|fixed:4}" },
    ],
    "surfaces": {
      "discord": [
        { "text": "-# -\n" },
        { "text": "-# {model.provider}{identity.emoji|🤖} {model.display_name|alias:models}" },
        { "map": "model.is_fallback", "cases": { "true": "🔄" } },
        { "map": "model.is_override", "cases": { "true": "📌" } },
        { "when": "model.reasoning", "text": " {model.reasoning|alias:reasoning}" },
        { "map": "state.fast_mode", "cases": { "true": " ⚡️", "false": " 🐌" } },
        {
          "when": "context.max_tokens",
          "text": " | 📚 [{context.pct_used|meter:5:braille}]{context.max_tokens|num}",
        },
        {
          "when": "usage.has_split_tokens",
          "text": " ↕️ {usage.input_tokens|num|?}/{usage.output_tokens|num|?}",
        },
        { "when": "usage.has_total_only_tokens", "text": " ↕️ {usage.total_tokens|num}" },
        { "when": "usage.cache_hit_pct", "text": " 🗄 {usage.cache_hit_pct|pct}" },
        { "when": "cost.turn_usd", "text": " 💰{cost.turn_usd|fixed:4}" },
      ],
    },
  },
 }
 ```
 ### Shape
 ```jsonc
 {
  "schema": "openclaw.usageBar.v1",
  "scales": { "<name>": "low-to-high glyphs" }, // string (1 glyph/char) or array
  "aliases": { "<table>": { "<value>": "<label>" } },
  "output": {
    "sep": "", // joins surviving pieces
    "default": [
      /* pieces */
    ], // fallback for any surface
    "surfaces": {
      "discord": [
        /* pieces */
      ],
      "telegram": [
        /* pieces */
      ],
    },
  },
 }
 ```
 Each surface is an ordered list of **pieces**; the engine renders each, drops
 empties, and joins survivors with `sep`. A surface with no entry uses
 `output.default`.
 ### Contract Paths
 A piece reads values from the per-turn contract by dot-path. Absent values are
 empty (so a `when` guard or a `|fallback` keeps the piece clean).
 | Path                                                                                | Meaning                                |
 | ----------------------------------------------------------------------------------- | -------------------------------------- |
 | `surface`                                                                           | channel id (`discord`/`telegram`/etc.) |
 | `model.provider` / `model.display_name`                                             | provider id / model id                 |
 | `model.reasoning`                                                                   | effort (`off` through `xhigh`)         |
 | `model.is_fallback` / `model.is_override`                                           | bool: fallback used / model pinned     |
 | `state.fast_mode`                                                                   | bool: fast vs slow                     |
 | `context.max_tokens` / `context.pct_used`                                           | window budget / 0-100 used             |
 | `usage.input_tokens` / `usage.output_tokens` / `usage.total_tokens`                 | turn aggregate                         |
 | `usage.has_split_tokens` / `usage.has_total_only_tokens` / `usage.cache_hit_pct`    | token display guards and cache percent |
 | `usage.last.input_tokens` / `usage.last.output_tokens` / `usage.last.cache_hit_pct` | final model call only                  |
 | `cost.turn_usd`                                                                     | estimated turn cost                    |
 | `identity.name` / `identity.emoji`                                                  | agent name / chosen emoji              |
 (Provider rate-limit windows are **not** in this contract.)
 ### Verbs
 Pipe a value through verbs left to right; a non-verb segment is the fallback.
 | Verb            | Effect                                | Example                           |
 | --------------- | ------------------------------------- | --------------------------------- |
 | `num`           | compact count                         | `272000 -> 272k`                  |
 | `fixed:N`       | N decimals (default 2)                | `0.0377`                          |
 | `dur`           | seconds to duration                   | `14820 -> 4h07m`                  |
 | `pct`           | append `%`                            | `96 -> 96%`                       |
 | `inv`           | `100 - x`                             | for used to remaining             |
 | `alias:TABLE`   | lookup in `aliases`, echo if unlisted | `medium -> 🌗`                    |
 | `meter:W:SCALE` | W-cell glyph bar over a 0-100 value   | `[⣿⣿⠐⠐⠐]` (`meter:1` = one glyph) |
 ### Piece forms
 - `{ "text": "📚 {context.max_tokens|num}" }`: literal + interpolation.
 - `{ "when": "<path>", "text": "..." }`: render only if the path is truthy.
 - `{ "map": "<path>", "cases": { "true": "⚡", "false": "🐌" } }`: value to glyph.
 - `{ "each": "limits.windows", "item": "{label}" }`: iterate an array.
 ### Example
 ```jsonc
 {
  "schema": "openclaw.usageBar.v1",
  "scales": { "braille": "⠐⡀⡄⡆⡇⣇⣧⣷⣿" },
  "aliases": { "reasoning": { "medium": "🌗", "high": "🌕" } },
  "output": {
    "surfaces": {
      "discord": [
        { "text": "{model.display_name}" },
        { "when": "model.reasoning", "text": " {model.reasoning|alias:reasoning}" },
        { "map": "state.fast_mode", "cases": { "true": " ⚡", "false": " 🐌" } },
        {
          "when": "context.max_tokens",
          "text": " | 📚 [{context.pct_used|meter:5:braille}]{context.max_tokens|num}",
        },
      ],
    },
  },
 }
 ```
 renders e.g. `claude-sonnet-4-6 🌗 🐌 | 📚 [⣿⣿⣿⣿⣧]272k`.
 ## Providers + credentials
--- a/docs/gateway/config-channels.md
+++ b/docs/gateway/config-channels.md
@@ -130,8 +130,6 @@ WhatsApp runs through the gateway's web channel (Baileys Web). It starts automat
 }
 ```
 - Top-level `bindings[]` entries with `type: "acp"` configure persistent ACP bindings for WhatsApp DMs and groups. Use an E.164 direct number or WhatsApp group JID in `match.peer.id`. Field semantics are shared in [ACP Agents](/tools/acp-agents#persistent-channel-bindings).
 <Accordion title="Multi-account WhatsApp">
 ```json5
--- a/docs/gateway/config-tools.md
+++ b/docs/gateway/config-tools.md
@@ -339,7 +339,7 @@ Configures inbound media understanding (image/audio/video):
    - `capabilities`: optional list (`image`, `audio`, `video`). Defaults: `openai`/`anthropic`/`minimax` → image, `google` → image+audio+video, `groq` → audio.
    - `prompt`, `maxChars`, `maxBytes`, `timeoutSeconds`, `language`: per-entry overrides.
-    - `tools.media.image.timeoutSeconds` and matching image model `timeoutSeconds` entries also apply when the agent calls the explicit `image` tool. For image understanding, this timeout applies to the request itself and is not reduced by earlier preparation work.
+    - `tools.media.image.timeoutSeconds` and matching image model `timeoutSeconds` entries also apply when the agent calls the explicit `image` tool.
    - Failures fall back to the next entry.
    Provider auth follows standard order: `auth-profiles.json` → env vars → `models.providers.*.apiKey`.
--- a/docs/help/testing-live.md
+++ b/docs/help/testing-live.md
@@ -73,7 +73,7 @@ Live tests are split into two layers so we can isolate failures:
  - `pnpm test:live` (or `OPENCLAW_LIVE_TEST=1` if invoking Vitest directly)
 - Set `OPENCLAW_LIVE_MODELS=modern`, `small`, or `all` (alias for modern) to actually run this suite; otherwise it skips to keep `pnpm test:live` focused on gateway smoke
 - How to select models:
-  - `OPENCLAW_LIVE_MODELS=modern` to run the modern allowlist (Opus/Sonnet 4.6+, GPT-5.2 + Codex, Gemini 3, DeepSeek V4, GLM 5.1, MiniMax M3, Grok 4.3)
+  - `OPENCLAW_LIVE_MODELS=modern` to run the modern allowlist (Opus/Sonnet 4.6+, GPT-5.2 + Codex, Gemini 3, DeepSeek V4, GLM 4.7, MiniMax M3, Grok 4.3)
  - `OPENCLAW_LIVE_MODELS=small` to run the constrained small-model allowlist (Qwen 8B/9B local-compatible routes, Ollama Gemma, OpenRouter Qwen/GLM, and Z.AI GLM)
  - `OPENCLAW_LIVE_MODELS=all` is an alias for the modern allowlist
  - or `OPENCLAW_LIVE_MODELS="openai/gpt-5.5,anthropic/claude-opus-4-6,..."` (comma allowlist)
@@ -357,9 +357,6 @@ Narrow, explicit allowlists are fastest and least flaky:
 - Tool calling across several providers:
  - `OPENCLAW_LIVE_GATEWAY_MODELS="openai/gpt-5.5,anthropic/claude-opus-4-6,google/gemini-3-flash-preview,deepseek/deepseek-v4-flash,zai/glm-5.1,minimax/MiniMax-M3" pnpm test:live src/gateway/gateway-models.profiles.live.test.ts`
 - Z.AI Coding Plan GLM-5.2 direct smoke:
  - `ZAI_CODING_LIVE_TEST=1 pnpm test:live src/agents/zai.live.test.ts`
 - Google focus (Gemini API key + Antigravity):
  - Gemini (API key): `OPENCLAW_LIVE_GATEWAY_MODELS="google/gemini-3-flash-preview" pnpm test:live src/gateway/gateway-models.profiles.live.test.ts`
  - Antigravity (OAuth): `OPENCLAW_LIVE_GATEWAY_MODELS="google-antigravity/claude-opus-4-6-thinking,google-antigravity/gemini-3-pro-high" pnpm test:live src/gateway/gateway-models.profiles.live.test.ts`
@@ -391,7 +388,7 @@ This is the "common models" run we expect to keep working:
 - Google (Gemini API): `google/gemini-3.1-pro-preview` and `google/gemini-3-flash-preview` (avoid older Gemini 2.x models)
 - Google (Antigravity): `google-antigravity/claude-opus-4-6-thinking` and `google-antigravity/gemini-3-flash`
 - DeepSeek: `deepseek/deepseek-v4-flash` and `deepseek/deepseek-v4-pro`
- Z.AI (GLM): `zai/glm-5.1` (general API) or `zai/glm-5.2` (Coding Plan)
+- Z.AI (GLM): `zai/glm-5.1`
 - MiniMax: `minimax/MiniMax-M3`
 Run gateway smoke with tools + image:
@@ -405,7 +402,7 @@ Pick at least one per provider family:
 - Anthropic: `anthropic/claude-opus-4-6` (or `anthropic/claude-sonnet-4-6`)
 - Google: `google/gemini-3-flash-preview` (or `google/gemini-3.1-pro-preview`)
 - DeepSeek: `deepseek/deepseek-v4-flash`
- Z.AI (GLM): `zai/glm-5.1` (general API) or `zai/glm-5.2` (Coding Plan)
+- Z.AI (GLM): `zai/glm-5.1`
 - MiniMax: `minimax/MiniMax-M3`
 Optional additional coverage (nice to have):
--- a/docs/help/testing.md
+++ b/docs/help/testing.md
@@ -218,27 +218,17 @@ inside every shard.
    `OPENCLAW_NPM_TELEGRAM_PACKAGE_TGZ=/path/to/openclaw-current.tgz` or
    `OPENCLAW_CURRENT_PACKAGE_TGZ` to test a resolved local tarball instead of
    installing from the registry.
  - Emits repeated RTT timing in `qa-evidence.json` by default with
    `OPENCLAW_NPM_TELEGRAM_RTT_SAMPLES=20`. Override
    `OPENCLAW_NPM_TELEGRAM_RTT_SAMPLES`,
    `OPENCLAW_NPM_TELEGRAM_RTT_TIMEOUT_MS`, or
    `OPENCLAW_NPM_TELEGRAM_RTT_MAX_FAILURES` to tune the RTT run.
    `OPENCLAW_NPM_TELEGRAM_RTT_CHECKS` accepts a comma-separated list of
    Telegram QA check IDs to sample; when unset, the default RTT-capable check
    is `telegram-mentioned-message-reply`.
  - Uses the same Telegram env credentials or Convex credential source as
    `pnpm openclaw qa telegram`. For CI/release automation, set
    `OPENCLAW_NPM_TELEGRAM_CREDENTIAL_SOURCE=convex` plus
-    `OPENCLAW_QA_CONVEX_SITE_URL` and a role secret. If
+    `OPENCLAW_QA_CONVEX_SITE_URL` and the role secret. If
    `OPENCLAW_QA_CONVEX_SITE_URL` and a Convex role secret are present in CI,
    the Docker wrapper selects Convex automatically.
  - The wrapper validates Telegram or Convex credential env on the host before
    Docker build/install work. Set `OPENCLAW_NPM_TELEGRAM_SKIP_CREDENTIAL_PREFLIGHT=1`
    only when deliberately debugging pre-credential setup.
  - `OPENCLAW_NPM_TELEGRAM_CREDENTIAL_ROLE=ci|maintainer` overrides the shared
-    `OPENCLAW_QA_CREDENTIAL_ROLE` for this lane only. When Convex credentials
+    `OPENCLAW_QA_CREDENTIAL_ROLE` for this lane only.
    are selected and no role is set, the wrapper uses `ci` in CI and
    `maintainer` outside CI.
  - GitHub Actions exposes this lane as the manual maintainer workflow
    `NPM Telegram Beta E2E`. It does not run on merge. The workflow uses the
    `qa-live-shared` environment and Convex CI credential leases.
@@ -354,11 +344,11 @@ gh workflow run package-acceptance.yml --ref main \
    want artifacts without a failing exit code.
  - Requires two distinct bots in the same private group, with the SUT bot exposing a Telegram username.
  - For stable bot-to-bot observation, enable Bot-to-Bot Communication Mode in `@BotFather` for both bots and ensure the driver bot can observe group bot traffic.
-  - Writes a Telegram QA report, summary, and `qa-evidence.json` under `.artifacts/qa-e2e/...`. Replying scenarios include RTT from driver send request to observed SUT reply.
+  - Writes a Telegram QA report, summary, and observed-messages artifact under `.artifacts/qa-e2e/...`. Replying scenarios include RTT from driver send request to observed SUT reply.
 `Mantis Telegram Live` is the PR-evidence wrapper around this lane. It runs the
-candidate ref with Convex-leased Telegram credentials, renders the redacted QA
+candidate ref with Convex-leased Telegram credentials, renders the redacted
-report/evidence bundle in a Crabbox desktop browser, records MP4 evidence,
+observed-message transcript in a Crabbox desktop browser, records MP4 evidence,
 generates a motion-trimmed GIF, uploads the artifact bundle, and posts inline PR
 evidence through the Mantis GitHub App when `pr_number` is set. Maintainers can
 start it from the Actions UI through `Mantis Scenario` (`scenario_id:
--- a/docs/nodes/index.md
+++ b/docs/nodes/index.md
@@ -214,59 +214,6 @@ permission boundary. Dangerous plugin node commands still require explicit
 After a node changes its declared command list, reject the old device pairing
 and approve the new request so the gateway stores the updated command snapshot.
 ## Config (`openclaw.json`)
 Node-related settings live under `gateway.nodes` and `tools.exec`:
 ```json5
 {
  gateway: {
    nodes: {
      // Auto-approve first-time node pairing from trusted networks (CIDR list).
      // Disabled when unset. Only applies to first-time role:node requests
      // with no requested scopes; does not auto-approve upgrades.
      pairing: {
        autoApproveCidrs: ["192.168.1.0/24"],
      },
      // Opt into dangerous/privacy-heavy node commands (camera.snap, etc.).
      allowCommands: ["camera.snap", "screen.record"],
      // Block exact command names even if defaults or allowCommands include them.
      denyCommands: ["camera.clip"],
    },
  },
  tools: {
    exec: {
      // Default exec host: "node" routes all exec calls to a paired node.
      host: "node",
      // Security mode for node exec: allow only approved/allowlisted commands.
      security: "allowlist",
      // Pin exec to a specific node (id or name). Omit to allow any node.
      node: "build-node",
    },
  },
 }
 ```
 Use exact node command names. `denyCommands` removes a command even when a
 platform default or `allowCommands` entry would otherwise allow it. See
 [Gateway configuration reference](/gateway/configuration-reference#gateway-field-details)
 for gateway node pairing and command-policy field details.
 Per-agent exec node override:
 ```json5
 {
  agents: {
    list: [
      {
        id: "main",
        tools: { exec: { node: "build-node" } },
      },
    ],
  },
 }
 ```
 ## Screenshots (canvas snapshots)
 If the node is showing the Canvas (WebView), `canvas.snapshot` returns `{ format, base64 }`.
--- a/docs/plugins/cli-backend-plugins.md
+++ b/docs/plugins/cli-backend-plugins.md
@@ -197,30 +197,22 @@ only for behavior that really belongs to the backend.
 `CliBackendPlugin` can also define:
-| Hook                               | Use                                                                         |
+| Hook                               | Use                                                    |
-| ---------------------------------- | --------------------------------------------------------------------------- |
+| ---------------------------------- | ------------------------------------------------------ |
-| `normalizeConfig(config, context)` | Rewrite legacy user config after merge                                      |
+| `normalizeConfig(config, context)` | Rewrite legacy user config after merge                 |
-| `resolveExecutionArgs(ctx)`        | Add request-scoped flags such as thinking effort or side-question isolation |
+| `resolveExecutionArgs(ctx)`        | Add request-scoped flags such as thinking effort       |
-| `prepareExecution(ctx)`            | Create temporary auth or config bridges before launch                       |
+| `prepareExecution(ctx)`            | Create temporary auth or config bridges before launch  |
-| `transformSystemPrompt(ctx)`       | Apply a final CLI-specific system prompt transform                          |
+| `transformSystemPrompt(ctx)`       | Apply a final CLI-specific system prompt transform     |
-| `textTransforms`                   | Bidirectional prompt/output replacements                                    |
+| `textTransforms`                   | Bidirectional prompt/output replacements               |
-| `defaultAuthProfileId`             | Prefer a specific OpenClaw auth profile                                     |
+| `defaultAuthProfileId`             | Prefer a specific OpenClaw auth profile                |
-| `authEpochMode`                    | Decide how auth changes invalidate stored CLI sessions                      |
+| `authEpochMode`                    | Decide how auth changes invalidate stored CLI sessions |
-| `nativeToolMode`                   | Declare whether the CLI has always-on native tools                          |
+| `nativeToolMode`                   | Declare whether the CLI has always-on native tools     |
-| `sideQuestionToolMode`             | Declare disabled native tools for `/btw` side questions                     |
+| `bundleMcp` / `bundleMcpMode`      | Opt into OpenClaw's loopback MCP tool bridge           |
-| `bundleMcp` / `bundleMcpMode`      | Opt into OpenClaw's loopback MCP tool bridge                                |
+| `ownsNativeCompaction`             | Backend owns its own compaction - OpenClaw defers      |
 | `ownsNativeCompaction`             | Backend owns its own compaction - OpenClaw defers                           |
 Keep these hooks provider-owned. Do not add CLI-specific branches to core when a
 backend hook can express the behavior.
 `ctx.executionMode` is `"agent"` for normal turns and `"side-question"` for
 ephemeral `/btw` calls. Use it when the CLI needs different one-shot flags, such
 as disabling native tools, session persistence, or resume behavior for BTW. If a
 backend normally has `nativeToolMode: "always-on"` but its side-question argv
 reliably disables those tools, also set `sideQuestionToolMode: "disabled"`;
 otherwise OpenClaw fails closed when BTW requires a no-tools CLI run.
 ### `ownsNativeCompaction`: opting out of OpenClaw compaction
 If your backend runs an agent that compacts its **own** transcript, set
--- a/docs/plugins/codex-harness-reference.md
+++ b/docs/plugins/codex-harness-reference.md
@@ -313,13 +313,9 @@ available timeout in this order:
 - For `image_generate` without a configured timeout, the 120 second
  image-generation default.
 - For the media-understanding `image` tool, `tools.media.image.timeoutSeconds`
-  converted to milliseconds, or the 60 second media default. For image
+  converted to milliseconds, or the 60 second media default.
  understanding, this applies to the request itself and is not reduced by
  earlier preparation work.
 - The 90 second dynamic-tool default.
 This watchdog is the outer dynamic `item/tool/call` budget. Provider-specific
 request timeouts run inside that call and keep their own timeout semantics.
 Dynamic tool budgets are capped at 600000 ms. On timeout, OpenClaw aborts the
 tool signal where supported and returns a failed dynamic-tool response to Codex
 so the turn can continue instead of leaving the session in `processing`.
--- a/docs/plugins/codex-harness.md
+++ b/docs/plugins/codex-harness.md
@@ -143,39 +143,12 @@ The native Codex app-server harness supports context engines that require
 pre-prompt assembly. Generic CLI backends, including `codex-cli`, do not provide
 that host capability.
 Codex thread bindings live in OpenClaw's SQLite plugin state and use the stable
 agent-scoped OpenClaw session key, or an opaque conversation-binding id, as
 their owner. Physical session ids fence delayed cleanup but may rotate without
 losing the Codex thread. Context-engine compaction adopts the successor id
 before continuing native Codex compaction. The bounded store rejects a new
 binding at its safety limit instead of evicting an existing thread's continuity
 record.
 Conversation binds create or resume their Codex thread on the first bound
 message after channel approval; an abandoned approval consumes no thread row.
 That first message carries the prepared thread directly into its turn.
 Subsequent messages use a metadata-only resume to subscribe the shared client,
 then unsubscribe after the turn completes.
 The runtime does not poll transcript-adjacent binding files. Upgrades from
 releases that used `*.jsonl.codex-app-server.json` sidecars migrate them during
 normal startup preflight. `openclaw doctor --fix` can run the same migration
 manually.
 Successfully matched sidecars are archived before the new runtime resumes their
 threads. Migration imports durable thread ownership only; it does not infer
 Codex context usage from OpenClaw counters or crawl Codex rollout files. For
 agent-session harness bindings, the next resume attempts to restore a cached
 native snapshot when Codex has one, and ongoing turns persist the current-context
 usage reported by app-server notifications, not the cumulative thread lifetime
 total. Conversation bindings
 keep metadata-only resumes and leave continuity and compaction with the native
 Codex thread. Conflicting or ambiguous sidecars stay in place with a warning for
 operator review.
 For Codex-backed agents, `/compact` starts native Codex app-server compaction on
-the bound thread. OpenClaw bounds the request-acceptance RPC but does not wait
+the bound thread. OpenClaw does not wait for completion, impose an OpenClaw
-for compaction completion, restart the shared app-server, or fall back to a
+timeout, restart the shared app-server, or fall back to a context-engine or
-context-engine or public OpenAI summarizer. If the native Codex thread binding
+public OpenAI summarizer. If the native Codex thread binding is missing or
-is missing or stale, the command fails closed so the operator sees the real
+stale, the command fails closed so the operator sees the real runtime boundary
-runtime boundary instead of silently switching compaction backends.
+instead of silently switching compaction backends.
 ```json5
 {
@@ -584,14 +557,10 @@ or shortens that specific tool budget. The `image_generate` tool uses
 `agents.defaults.imageGenerationModel.timeoutMs` when the tool call does not
 provide its own timeout, or a 120 second image-generation default otherwise.
 The media-understanding `image` tool uses
-`tools.media.image.timeoutSeconds` or its 60 second media default. For image
+`tools.media.image.timeoutSeconds` or its 60 second media default. Dynamic tool
-understanding, that timeout applies to the request itself and is not
+budgets are capped at 600000 ms. On timeout, OpenClaw aborts the tool signal
 reduced by earlier preparation work. Dynamic tool budgets are
 capped at 600000 ms. On timeout, OpenClaw aborts the tool signal
 where supported and returns a failed dynamic-tool response to Codex so the turn
 can continue instead of leaving the session in `processing`.
 This watchdog is the outer dynamic `item/tool/call` budget; provider-specific
 request timeouts run inside that call and keep their own timeout semantics.
 After Codex accepts a turn, and after OpenClaw responds to a turn-scoped
 app-server request, the harness expects Codex to make current-turn progress and
--- a/docs/plugins/hooks.md
+++ b/docs/plugins/hooks.md
@@ -152,8 +152,7 @@ observation-only.
 - `gateway_start` / `gateway_stop` - start or stop plugin-owned services with the Gateway
 - `deactivate` - deprecated compatibility alias for `gateway_stop`; use `gateway_stop` in new plugins
 - `cron_changed` - observe gateway-owned cron lifecycle changes (added, updated, removed, started, finished, scheduled)
- **`before_install`** - inspect staged skill or plugin install material from a loaded
+- **`before_install`** - inspect skill or plugin install context and optionally block
  plugin runtime
 ## Debug runtime hooks
@@ -463,19 +462,11 @@ Decision rules:
 ## Install hooks
-Use `security.installPolicy` for operator-owned allow/block decisions. That
+`before_install` runs after the operator-owned `security.installPolicy` check
-policy runs from OpenClaw config, covers CLI install and update paths, and fails
+when one is configured. The `builtinScan` field remains in the event payload for
-closed when enabled but unavailable.
+compatibility, but OpenClaw no longer runs built-in install-time dangerous-code
-
+blocking, so it is an empty `ok` result. Return additional findings or
-`before_install` is a plugin-runtime lifecycle hook. It runs after
+`{ block: true, blockReason }` to stop the install.
 `security.installPolicy` only in the OpenClaw process where plugin hooks have
 already been loaded, such as Gateway-backed install flows. It is useful for
 plugin-owned observations, warnings, and compatibility checks, but it is not the
 primary enterprise or host security boundary for installs. The `builtinScan`
 field remains in the event payload for compatibility, but OpenClaw no longer
 runs built-in install-time dangerous-code blocking, so it is an empty `ok`
 result. Return additional findings or `{ block: true, blockReason }` to stop the
 install in that process.
 `block: true` is terminal. `block: false` is treated as no decision.
 Handler failures block the install fail-closed.
--- a/docs/plugins/sdk-overview.md
+++ b/docs/plugins/sdk-overview.md
@@ -378,10 +378,7 @@ AI CLI backend such as `claude-cli` or `my-cli`.
  (for example normalizing old flag shapes).
 - Use `resolveExecutionArgs` for request-scoped argv rewrites that belong to
  the CLI dialect, such as mapping OpenClaw thinking levels to a native effort
-  flag. The hook receives `ctx.executionMode`; use `"side-question"` to add
+  flag.
  backend-native isolation flags for ephemeral `/btw` calls. If those flags
  reliably disable native tools for an otherwise always-on CLI, declare
  `sideQuestionToolMode: "disabled"` too.
 For an end-to-end authoring guide, see
 [CLI backend plugins](/plugins/cli-backend-plugins).
@@ -431,10 +428,6 @@ semantics.
 ### Hook decision semantics
 `before_install` is a plugin-runtime lifecycle hook, not the operator install
 policy surface. Use `security.installPolicy` when an allow/block decision must
 cover CLI and Gateway-backed install or update paths.
 - `before_tool_call`: returning `{ block: true }` is terminal. Once any handler sets it, lower-priority handlers are skipped.
 - `before_tool_call`: returning `{ block: false }` is treated as no decision (same as omitting `block`), not as an override.
 - `before_install`: returning `{ block: true }` is terminal. Once any handler sets it, lower-priority handlers are skipped.
--- a/docs/plugins/sdk-provider-plugins.md
+++ b/docs/plugins/sdk-provider-plugins.md
@@ -515,7 +515,6 @@ API key auth, and dynamic model resolution.
      - `openclaw/plugin-sdk/provider-model-shared` - `ProviderReplayFamily`, `buildProviderReplayFamilyHooks(...)`, and the raw replay builders (`buildOpenAICompatibleReplayPolicy`, `buildAnthropicReplayPolicyForModel`, `buildGoogleGeminiReplayPolicy`, `buildHybridAnthropicOrOpenAIReplayPolicy`). Also exports Gemini replay helpers (`sanitizeGoogleGeminiReplayHistory`, `resolveTaggedReasoningOutputMode`) and endpoint/model helpers (`resolveProviderEndpoint`, `normalizeProviderId`, `normalizeGooglePreviewModelId`).
      - `openclaw/plugin-sdk/provider-stream` - `ProviderStreamFamily`, `buildProviderStreamFamilyHooks(...)`, `composeProviderStreamWrappers(...)`, plus the shared OpenAI/Codex wrappers (`createOpenAIAttributionHeadersWrapper`, `createOpenAIFastModeWrapper`, `createOpenAIServiceTierWrapper`, `createOpenAIResponsesContextManagementWrapper`, `createCodexNativeWebSearchWrapper`), DeepSeek V4 OpenAI-compatible wrapper (`createDeepSeekV4OpenAICompatibleThinkingWrapper`), Anthropic Messages thinking prefill cleanup (`createAnthropicThinkingPrefillPayloadWrapper`), plain-text tool-call compat (`createPlainTextToolCallCompatWrapper`), and shared proxy/provider wrappers (`createOpenRouterWrapper`, `createToolStreamWrapper`, `createMinimaxFastModeWrapper`).
      - `openclaw/plugin-sdk/provider-stream-shared` - lightweight payload and event wrappers for hot provider paths, including `createOpenAICompatibleCompletionsThinkingOffWrapper`, `createPayloadPatchStreamWrapper`, and `createPlainTextToolCallCompatWrapper`.
      - `openclaw/plugin-sdk/provider-tools` - `ProviderToolCompatFamily`, `buildProviderToolCompatFamilyHooks("deepseek" | "gemini" | "openai")`, and underlying provider schema helpers.
      For Gemini-family providers, keep the reasoning-output mode aligned with
--- a/docs/plugins/sdk-subpaths.md
+++ b/docs/plugins/sdk-subpaths.md
@@ -164,7 +164,7 @@ and pairing-path families.
    | `plugin-sdk/provider-tools` | `ProviderToolCompatFamily`, `buildProviderToolCompatFamilyHooks`, and DeepSeek/Gemini/OpenAI schema cleanup + diagnostics |
    | `plugin-sdk/provider-usage` | Provider usage snapshot types, shared usage fetch helpers, and provider fetchers such as `fetchClaudeUsage` |
    | `plugin-sdk/provider-stream` | `ProviderStreamFamily`, `buildProviderStreamFamilyHooks`, `composeProviderStreamWrappers`, stream wrapper types, plain-text tool-call compat, and shared Anthropic/Bedrock/DeepSeek V4/Google/Kilocode/Moonshot/OpenAI/OpenRouter/Z.A.I/MiniMax/Copilot wrapper helpers |
-    | `plugin-sdk/provider-stream-shared` | Public shared provider stream wrapper helpers including `composeProviderStreamWrappers`, `createOpenAICompatibleCompletionsThinkingOffWrapper`, `createPlainTextToolCallCompatWrapper`, `createPayloadPatchStreamWrapper`, `createToolStreamWrapper`, and Anthropic/DeepSeek/OpenAI-compatible stream utilities |
+    | `plugin-sdk/provider-stream-shared` | Public shared provider stream wrapper helpers including `composeProviderStreamWrappers`, `createPlainTextToolCallCompatWrapper`, `createPayloadPatchStreamWrapper`, `createToolStreamWrapper`, and Anthropic/DeepSeek/OpenAI-compatible stream utilities |
    | `plugin-sdk/provider-transport-runtime` | Native provider transport helpers such as guarded fetch, transport message transforms, and writable transport event streams |
    | `plugin-sdk/provider-onboard` | Onboarding config patch helpers |
    | `plugin-sdk/global-singleton` | Process-local singleton/map/cache helpers |
@@ -236,7 +236,6 @@ usage endpoint failed or returned no usable usage data.
    | `plugin-sdk/config-contracts` | Focused type-only config surface for plugin config shapes such as `OpenClawConfig` and channel/provider config types |
    | `plugin-sdk/plugin-config-runtime` | Runtime plugin-config lookup helpers such as `requireRuntimeConfig`, `resolvePluginConfigObject`, and `resolveLivePluginConfigObject` |
    | `plugin-sdk/config-mutation` | Transactional config mutation helpers such as `mutateConfigFile`, `replaceConfigFile`, and `logConfigUpdated` |
    | `plugin-sdk/message-tool-delivery-hints` | Shared message-tool delivery metadata hint strings |
    | `plugin-sdk/runtime-config-snapshot` | Current process config snapshot helpers such as `getRuntimeConfig`, `getRuntimeConfigSnapshot`, and test snapshot setters |
    | `plugin-sdk/telegram-command-config` | Telegram command-name/description normalization and duplicate/conflict checks, even when the bundled Telegram contract surface is unavailable |
    | `plugin-sdk/text-autolink-runtime` | File-reference autolink detection without the broad text barrel |
--- a/docs/providers/zai.md
+++ b/docs/providers/zai.md
@@ -19,7 +19,7 @@ OpenClaw uses the `zai` provider with a Z.AI API key.
 ## GLM models
 GLM is a model family, not a separate provider. In OpenClaw, GLM models use
-refs such as `zai/glm-5.2`: provider `zai`, model id `glm-5.2`.
+refs such as `zai/glm-5.1`: provider `zai`, model id `glm-5.1`.
 ## Getting started
@@ -85,12 +85,12 @@ you want to force a specific Coding Plan or general API surface.
  models: {
    providers: {
      zai: {
-        // GLM-5.2 uses the Coding Plan endpoint.
+        // Example value. Onboarding writes the matching baseUrl for your endpoint.
-        baseUrl: "https://api.z.ai/api/coding/paas/v4",
+        baseUrl: "https://api.z.ai/api/paas/v4",
      },
    },
  },
-  agents: { defaults: { model: { primary: "zai/glm-5.2" } } },
+  agents: { defaults: { model: { primary: "zai/glm-5.1" } } },
 }
 ```
@@ -105,31 +105,28 @@ openclaw models list --all --provider zai
 The manifest-backed catalog currently includes:
-| Model ref            | Notes                           |
+| Model ref            | Notes         |
-| -------------------- | ------------------------------- |
+| -------------------- | ------------- |
-| `zai/glm-5.2`        | Coding Plan default; 1M context |
+| `zai/glm-5.1`        | Default model |
-| `zai/glm-5.1`        | General API default             |
+| `zai/glm-5`          |               |
-| `zai/glm-5`          |                                 |
+| `zai/glm-5-turbo`    |               |
-| `zai/glm-5-turbo`    |                                 |
+| `zai/glm-5v-turbo`   |               |
-| `zai/glm-5v-turbo`   |                                 |
+| `zai/glm-4.7`        |               |
-| `zai/glm-4.7`        |                                 |
+| `zai/glm-4.7-flash`  |               |
-| `zai/glm-4.7-flash`  |                                 |
+| `zai/glm-4.7-flashx` |               |
-| `zai/glm-4.7-flashx` |                                 |
+| `zai/glm-4.6`        |               |
-| `zai/glm-4.6`        |                                 |
+| `zai/glm-4.6v`       |               |
-| `zai/glm-4.6v`       |                                 |
+| `zai/glm-4.5`        |               |
-| `zai/glm-4.5`        |                                 |
+| `zai/glm-4.5-air`    |               |
-| `zai/glm-4.5-air`    |                                 |
+| `zai/glm-4.5-flash`  |               |
-| `zai/glm-4.5-flash`  |                                 |
+| `zai/glm-4.5v`       |               |
 | `zai/glm-4.5v`       |                                 |
 <Tip>
 GLM models are available as `zai/<model>` (example: `zai/glm-5`).
 </Tip>
 <Note>
-Coding Plan setup defaults to `zai/glm-5.2`; general API setup keeps
+The default bundled model ref is `zai/glm-5.1`. GLM versions and availability
 `zai/glm-5.1`. Endpoint auto-detection falls back to `glm-5.1` or `glm-4.7`
 when the selected plan does not expose GLM-5.2. GLM versions and availability
 can change; run `openclaw models list --all --provider zai` to see the catalog
 known to your installed version.
 </Note>
@@ -176,7 +173,7 @@ known to your installed version.
      agents: {
        defaults: {
          models: {
-            "zai/glm-5.2": {
+            "zai/glm-5.1": {
              params: { preserveThinking: true },
            },
          },
--- a/docs/reference/RELEASING.md
+++ b/docs/reference/RELEASING.md
@@ -99,14 +99,10 @@ the maintainer-only release runbook.
   file, lane, workflow job, package profile, provider, or model allowlist that
   proves the fix. Rerun the full umbrella only when the changed surface makes
   prior evidence stale.
-9. For a tagged beta candidate, run
+9. For beta, tag `vYYYY.M.PATCH-beta.N`, then run `pnpm release:candidate -- --tag
-   `pnpm release:candidate -- --tag vYYYY.M.PATCH-beta.N` from the matching
+vYYYY.M.PATCH-beta.N` from the matching `release/YYYY.M.PATCH` branch. The helper runs
-   `release/YYYY.M.PATCH` branch. For stable, pass the required Windows source
+   the local generated-release checks, dispatches or verifies the full release
-   release too:
+   validation and npm preflight evidence, runs Parallels and Telegram package
   `pnpm release:candidate -- --tag vYYYY.M.PATCH --windows-node-tag vX.Y.Z`.
   The helper runs the local generated-release checks, dispatches or verifies
   the full release validation and npm preflight evidence, runs Parallels
   fresh/update proof against the exact prepared tarball plus Telegram package
   proof, records plugin npm and ClawHub plans, and prints the exact
   `OpenClaw Release Publish` command only after the evidence bundle is green.
   `OpenClaw Release Publish` dispatches the selected or all-publishable plugin
@@ -146,12 +142,9 @@ the maintainer-only release runbook.
    direct push, it opens or updates an appcast PR. Stable Windows Hub
    readiness requires the signed `OpenClawCompanion-Setup-x64.exe`,
    `OpenClawCompanion-Setup-arm64.exe`, and
-    `OpenClawCompanion-SHA256SUMS.txt` assets on the OpenClaw GitHub release.
+    `OpenClawCompanion-SHA256SUMS.txt` assets on the OpenClaw GitHub release;
-    Pass the exact signed `openclaw/openclaw-windows-node` release tag as
+    promote them with the `Windows Node Release` workflow after the matching
-    `windows_node_tag` and its candidate-approved installer digest map as
+    `openclaw/openclaw-windows-node` release has passed its signing workflow.
    `windows_node_installer_digests`; `OpenClaw Release Publish` keeps the
    release draft, dispatches `Windows Node Release`, and verifies all three
    assets before publication.
 11. After publish, run the npm post-publish verifier, optional standalone
    published-npm Telegram E2E when you need post-publish channel proof,
    dist-tag promotion when needed, verify the generated GitHub release page,
@@ -260,36 +253,21 @@ the maintainer-only release runbook.
  to the GitHub release as `openclaw-<version>-dependency-evidence.zip`.
 - Run `OpenClaw Release Publish` for the mutating publish sequence after the
  tag exists. Dispatch it from `release/YYYY.M.PATCH` (or `main` when publishing a
-  main-reachable tag), pass the release tag, successful OpenClaw npm
+  main-reachable tag), pass the release tag and successful OpenClaw npm
-  `preflight_run_id`, and successful `full_release_validation_run_id`, and keep
+  `preflight_run_id`, and keep the default plugin publish scope
-  the default plugin publish scope `all-publishable` unless you are deliberately
+  `all-publishable` unless you are deliberately running a focused repair. The
-  running a focused repair. The workflow serializes plugin npm publish, plugin
+  workflow serializes plugin npm publish, plugin ClawHub publish, and OpenClaw
-  ClawHub publish, and OpenClaw npm publish so the core package is not published
+  npm publish so the core package is not published before its externalized
-  before its externalized plugins.
+  plugins.
- Stable `OpenClaw Release Publish` requires an exact `windows_node_tag` after
+- Run the manual `Windows Node Release` workflow for stable releases after the
-  the matching non-prerelease `openclaw/openclaw-windows-node` release exists.
+  matching `openclaw/openclaw-windows-node` release exists. It downloads the
-  It also requires the candidate-approved `windows_node_installer_digests` map.
+  signed Windows Hub installers from the companion repo, verifies their
-  Before dispatching any publish child, it verifies that source release is
+  Authenticode signatures on a Windows runner, writes a SHA-256 manifest, and
-  published, non-prerelease, contains the required x64/ARM64 installers, and
+  uploads the installers plus manifest onto the canonical OpenClaw GitHub
-  still matches that approved map. It then dispatches `Windows Node Release`
+  release. Website download links should target exact OpenClaw release asset
-  while the OpenClaw release is still a draft, carrying the pinned installer
+  URLs for the current stable release, or `releases/latest/download/...` only
-  digest map unchanged. The child
+  after verifying GitHub's latest redirect points at that same release; do not
-  workflow downloads the signed Windows Hub installers from that exact tag,
+  link only to the companion repo release page.
  matches them against the pinned digests, verifies their Authenticode
  signatures use the expected OpenClaw Foundation signer on a Windows runner,
  writes a SHA-256 manifest, and uploads the installers plus manifest onto the
  canonical OpenClaw GitHub release, then re-downloads the promoted assets and
  verifies the manifest membership and hashes. The parent verifies the current
  x64, ARM64, and checksum asset contract before publication. Direct recovery
  rejects unexpected `OpenClawCompanion-*` asset names before replacing the
  expected contract assets with the pinned source bytes. Manually dispatch
  `Windows Node Release` only for recovery, and always pass an exact tag, never
  `latest`, plus the explicit `expected_installer_digests` JSON map from the
  approved source release. Website download links should target exact OpenClaw
  release asset URLs for the current stable release, or
  `releases/latest/download/...` only after verifying GitHub's latest redirect
  points at that same release; do not link only to the companion repo release
  page.
 - Release checks now run in a separate manual workflow:
  `OpenClaw Release Checks`
 - `OpenClaw Release Checks` also runs the QA Lab mock parity lane plus the fast
@@ -719,12 +697,7 @@ orchestrates the trusted-publisher workflows in the order the release needs:
   `ref=<release-sha>`.
 5. Dispatch `Plugin ClawHub Release` with the same scope and SHA.
 6. Dispatch `OpenClaw NPM Release` with the release tag, npm dist-tag, and
-   saved `preflight_run_id` after verifying the saved
+   saved `preflight_run_id`.
   `full_release_validation_run_id`.
 7. For stable releases, create or update the GitHub release as a draft, dispatch
   `Windows Node Release` with the explicit `windows_node_tag` and
   candidate-approved `windows_node_installer_digests`, and verify the canonical
   installer/checksum assets before publishing the draft.
 Beta publish example:
@@ -733,7 +706,6 @@ gh workflow run openclaw-release-publish.yml \
  --ref release/YYYY.M.PATCH \
  -f tag=vYYYY.M.PATCH-beta.N \
  -f preflight_run_id=<successful-openclaw-npm-preflight-run-id> \
  -f full_release_validation_run_id=<successful-full-release-validation-run-id> \
  -f npm_dist_tag=beta
 ```
@@ -743,10 +715,7 @@ Stable publish to the default beta dist-tag:
 gh workflow run openclaw-release-publish.yml \
  --ref release/YYYY.M.PATCH \
  -f tag=vYYYY.M.PATCH \
  -f windows_node_tag=vX.Y.Z \
  -f windows_node_installer_digests='{"OpenClawCompanion-Setup-x64.exe":"sha256:<approved-x64-sha256>","OpenClawCompanion-Setup-arm64.exe":"sha256:<approved-arm64-sha256>"}' \
  -f preflight_run_id=<successful-openclaw-npm-preflight-run-id> \
  -f full_release_validation_run_id=<successful-full-release-validation-run-id> \
  -f npm_dist_tag=beta
 ```
@@ -756,10 +725,7 @@ Stable promotion directly to `latest` is explicit:
 gh workflow run openclaw-release-publish.yml \
  --ref release/YYYY.M.PATCH \
  -f tag=vYYYY.M.PATCH \
  -f windows_node_tag=vX.Y.Z \
  -f windows_node_installer_digests='{"OpenClawCompanion-Setup-x64.exe":"sha256:<approved-x64-sha256>","OpenClawCompanion-Setup-arm64.exe":"sha256:<approved-arm64-sha256>"}' \
  -f preflight_run_id=<successful-openclaw-npm-preflight-run-id> \
  -f full_release_validation_run_id=<successful-full-release-validation-run-id> \
  -f npm_dist_tag=latest
 ```
@@ -789,13 +755,6 @@ package cannot ship without every publishable official plugin, including
 - `tag`: required release tag; must already exist
 - `preflight_run_id`: successful `OpenClaw NPM Release` preflight run id;
  required when `publish_openclaw_npm=true`
 - `full_release_validation_run_id`: successful `Full Release Validation` run
  id; required when `publish_openclaw_npm=true`
 - `windows_node_tag`: exact non-prerelease `openclaw/openclaw-windows-node`
  release tag; required for stable OpenClaw publish
 - `windows_node_installer_digests`: candidate-approved compact JSON map of the
  current Windows installer names to their pinned `sha256:` digests; required
  for stable OpenClaw publish
 - `npm_dist_tag`: npm target tag for the OpenClaw package
 - `plugin_publish_scope`: defaults to `all-publishable`; use `selected` only
  for focused plugin-only repair work with `publish_openclaw_npm=false`
@@ -841,21 +800,14 @@ When cutting a stable npm release:
   Matrix, and Telegram coverage from one manual workflow
 4. If you intentionally only need the deterministic normal test graph, run the
   manual `CI` workflow on the release ref instead
-5. Select the exact non-prerelease `openclaw/openclaw-windows-node` release tag
+5. Save the successful `preflight_run_id`
-   whose signed x64 and ARM64 installers should ship. Save it as
+6. Run `OpenClaw Release Publish` with the same `tag`, the same `npm_dist_tag`,
-   `windows_node_tag`, and save their validated digest map as
+   and the saved `preflight_run_id`; it publishes externalized plugins to npm
-   `windows_node_installer_digests`. The release-candidate helper records both
+   and ClawHub before promoting the OpenClaw npm package
-   and includes them in its generated publish command.
+7. If the release landed on `beta`, use the
 6. Save the successful `preflight_run_id` and `full_release_validation_run_id`
 7. Run `OpenClaw Release Publish` with the same `tag`, the same `npm_dist_tag`,
   the selected `windows_node_tag`, its saved `windows_node_installer_digests`,
   the saved `preflight_run_id`, and the saved `full_release_validation_run_id`;
   it publishes externalized plugins to npm and ClawHub before promoting the
   OpenClaw npm package
 8. If the release landed on `beta`, use the
   `openclaw/releases/.github/workflows/openclaw-npm-dist-tags.yml`
   workflow to promote that stable version from `beta` to `latest`
-9. If the release intentionally published directly to `latest` and `beta`
+8. If the release intentionally published directly to `latest` and `beta`
   should follow the same stable build immediately, use that same release
   workflow to point both dist-tags at the stable version, or let its scheduled
   self-healing sync move `beta` later
--- a/docs/reference/transcript-hygiene.md
+++ b/docs/reference/transcript-hygiene.md
@@ -20,7 +20,6 @@ Scope includes:
 - Thinking signature cleanup
 - Image payload sanitization
 - Blank text-block cleanup before provider replay
 - Incomplete reasoning-only length-turn cleanup before provider replay
 - User-input provenance tagging (for inter-session routed prompts)
 - Empty assistant error-turn repair for Bedrock Converse replay
@@ -92,21 +91,6 @@ Implementation:
 ---
 ## Global rule: incomplete reasoning-only turns
 Assistant turns that hit the provider output limit with only thinking or
 redacted-thinking content are omitted from the in-memory replay copy. Such turns
 contain incomplete provider state and may carry a partial thinking signature.
 Empty length turns remain unchanged, as do length turns with visible text, tool
 calls, or unknown content blocks. Stored transcripts are not rewritten.
 Implementation:
 - `normalizeAssistantReplayContent` in `src/agents/embedded-agent-runner/replay-history.ts`
 ---
 ## Global rule: inter-session input provenance
 When an agent sends a prompt into another session via `sessions_send` (including
--- a/docs/tools/acp-agents.md
+++ b/docs/tools/acp-agents.md
@@ -336,7 +336,6 @@ top-level `bindings[]` entries.
 - **Discord channel/thread:** `match.channel="discord"` + `match.peer.id="<channelOrThreadId>"`
 - **Slack channel/DM:** `match.channel="slack"` + `match.peer.id="<channelId|channel:<channelId>|#<channelId>|userId|user:<userId>|slack:<userId>|<@userId>>"`. Prefer stable Slack ids; channel bindings also match replies inside that channel's threads.
 - **Telegram forum topic:** `match.channel="telegram"` + `match.peer.id="<chatId>:topic:<topicId>"`
 - **WhatsApp DM/group:** `match.channel="whatsapp"` + `match.peer.id="<E.164|group JID>"`. Use E.164 numbers such as `+15555550123` for direct chats and WhatsApp group JIDs such as `120363424282127706@g.us` for groups.
 - **iMessage DM/group:** `match.channel="imessage"` + `match.peer.id="<handle|chat_id:*|chat_guid:*|chat_identifier:*>"`. Prefer `chat_id:*` for stable group bindings.
 </ParamField>
@@ -454,9 +453,8 @@ Use `agents.list[].runtime` to define ACP defaults once per agent:
 ### Behavior
- OpenClaw ensures the configured ACP session exists after channel-specific admission and before use.
+- OpenClaw ensures the configured ACP session exists before use.
- Messages in that channel, topic, or chat route to the configured ACP session.
+- Messages in that channel or topic route to the configured ACP session.
 - Configured ACP bindings own their session route. Channel broadcast fan-out does not replace the configured ACP session for a matched binding.
 - In bound conversations, `/new` and `/reset` reset the same ACP session key in place.
 - Temporary runtime bindings (for example created by thread-focus flows) still apply where present.
 - For cross-agent ACP spawns without an explicit `cwd`, OpenClaw inherits the target agent workspace from agent config.
--- a/docs/tools/browser-control.md
+++ b/docs/tools/browser-control.md
@@ -13,12 +13,7 @@ CLI, and scripting patterns (snapshots, refs, waits, debug flows).
 ## Control API (optional)
-For local integrations only, the Gateway exposes a small loopback HTTP API.
+For local integrations only, the Gateway exposes a small loopback HTTP API:
 This standalone server is opt-in — set the environment variable
 `OPENCLAW_EAGER_BROWSER_CONTROL_SERVER=1` in the gateway service environment
 and restart the gateway before the HTTP endpoints become available. Without
 this variable the browser control runtime still works through the CLI and
 agent tools, but nothing listens on the loopback control port.
 - Status/start/stop: `GET /`, `POST /start`, `POST /stop`
 - Tabs: `GET /tabs`, `POST /tabs/open`, `POST /tabs/focus`, `DELETE /tabs/:targetId`
@@ -263,14 +258,7 @@ Snapshot flags at a glance:
 - `--format aria`: accessibility tree with `axN` refs. When Playwright is available, OpenClaw binds refs with backend DOM ids to the live page so follow-up actions can use them; otherwise treat the output as inspection-only.
 - `--efficient` (or `--mode efficient`): compact role snapshot preset. Set `browser.snapshotDefaults.mode: "efficient"` to make this the default (see [Gateway configuration](/gateway/configuration-reference#browser)).
 - `--interactive`, `--compact`, `--depth`, `--selector` force a role snapshot with `ref=e12` refs. `--frame "<iframe>"` scopes role snapshots to an iframe.
- With Playwright, `--labels` adds a screenshot with overlayed ref labels
+- `--labels` adds a viewport-only screenshot with overlayed ref labels and prints the saved path.
  (prints `MEDIA:<path>`) plus an `annotations` array with each ref's bounding
  box. On `screenshot`, Playwright-backed labels work with `--full-page`,
  `--ref`, and `--element`; on `snapshot`, the accompanying screenshot remains
  viewport-only. Existing-session/chrome-mcp profiles render overlay labels on
  page screenshots but do not return `annotations` or use the Playwright
  full-page/ref/element projection helper. Without Playwright or chrome-mcp,
  labeled screenshots are not available.
 - `--urls` appends discovered link destinations to AI snapshots.
 ## Snapshots and refs
@@ -286,9 +274,7 @@ OpenClaw supports two "snapshot" styles:
  - Output: a role-based list/tree with `[ref=e12]` (and optional `[nth=1]`).
  - Actions: `openclaw browser click e12`, `openclaw browser highlight e12`.
  - Internally, the ref is resolved via `getByRole(...)` (plus `nth()` for duplicates).
-  - Add `--labels` to include a screenshot with overlayed `e12` labels. On
+  - Add `--labels` to include a viewport screenshot with overlayed `e12` labels.
    Playwright-backed profiles this also returns per-ref bounding-box metadata
    (`annotations[]`).
  - Add `--urls` when link text is ambiguous and the agent needs concrete
    navigation targets.
--- a/docs/tools/btw.md
+++ b/docs/tools/btw.md
@@ -42,14 +42,8 @@ app-server thread as an ephemeral side thread. That keeps Codex OAuth and native
 thread behavior intact while still isolating the side answer from the parent
 transcript. Like Codex `/side`, the side thread keeps the current Codex
 permissions and native tool surface, with guardrails that tell the model not to
-treat inherited parent-thread work as active instructions.
+treat inherited parent-thread work as active instructions. Non-Codex runtimes
-
+keep the older direct one-shot path.
 For CLI runtime aliases, BTW uses the owning CLI backend in side-question mode
 instead of falling back to a direct provider call. OpenClaw seeds sanitized
 conversation context into a fresh one-shot CLI invocation, disables OpenClaw MCP
 tool bundling and reusable CLI session state for that invocation, and lets the
 backend add any CLI-native no-resume or no-tools flags it supports. Direct
 non-CLI runtimes keep the direct one-shot path.
 ## What it does not do
--- a/docs/tools/plugin.md
+++ b/docs/tools/plugin.md
@@ -147,12 +147,10 @@ such as `@beta` stay pinned to the selected package and fail when incompatible.
 Configure `security.installPolicy` to run a trusted local policy command before
 plugin install or update proceeds. The policy receives metadata plus the staged
-source path and can allow or block the install. It covers CLI and Gateway-backed
+source path and can allow or block the install. It runs before plugin
-plugin install/update paths. Plugin `before_install` hooks run later only in
+`before_install` hooks. The deprecated `--dangerously-force-unsafe-install`
-OpenClaw processes where plugin hooks are loaded, so use `security.installPolicy`
+flag is accepted for compatibility but does not bypass install policy, hooks, or
-for operator-owned install decisions. The deprecated
+OpenClaw's built-in plugin dependency denylist.
 `--dangerously-force-unsafe-install` flag is accepted for compatibility but does
 not bypass install policy or OpenClaw's built-in plugin dependency denylist.
 See [Skills config](/tools/skills-config#operator-install-policy-securityinstallpolicy)
 for the shared `security.installPolicy` exec schema used by both skills and
--- a/docs/tools/tool-search.md
+++ b/docs/tools/tool-search.md
@@ -16,9 +16,9 @@ search or dynamic-tools surface. Codex-native code mode, tool search, deferred
 dynamic tools, and nested tool calls are stable Codex harness surfaces and do
 not depend on `tools.toolSearch`.
-When enabled for OpenClaw runs, the model receives one `tool_search_code` tool
+When enabled for OpenClaw runs, the model receives one `tool_search_code` tool by default.
-by default. That tool runs a short JavaScript body in an isolated Node
+That tool runs a short JavaScript body in an isolated Node subprocess with an
-subprocess with an `openclaw.tools` bridge:
+`openclaw.tools` bridge:
 ```js
 const hits = await openclaw.tools.search("create a GitHub issue");
@@ -49,8 +49,8 @@ run:
 3. List eligible MCP tools through the session MCP runtime.
 4. Add eligible client tools supplied for the current run.
 5. Index compact descriptors for search.
-6. Expose the OpenClaw code bridge, the structured fallback tools, or the
+6. Expose either the OpenClaw code bridge or the structured fallback tools to the
-   compact directory surface to the model.
+   model.
 At execution time every real tool call returns to OpenClaw. The isolated Node
 runtime does not hold plugin implementations, MCP client objects, or secrets.
@@ -59,26 +59,18 @@ normal policy, approval, hook, logging, and result handling still apply.
 ## Modes
-`tools.toolSearch` has three model-facing modes:
+`tools.toolSearch` has two model-facing modes:
 - `code`: exposes `tool_search_code`, the default compact JavaScript bridge.
 - `tools`: exposes `tool_search`, `tool_describe`, and `tool_call` as plain
  structured tools for providers that should not receive code.
 - `directory`: exposes `tool_search`, `tool_describe`, and `tool_call` plus a
  bounded prompt directory of available tool names and descriptions for
  providers that should see tool names without every full schema. OpenClaw can
  also expose a small bounded set of likely or required tool schemas directly
  for the current turn.
-All modes use the same policy-filtered catalog and normal OpenClaw execution
+Both modes use the same catalog and execution path. The only difference is the
-path. If the current runtime cannot launch the isolated Node code-mode child
+shape the model sees. If the current runtime cannot launch the isolated Node
-process, the default `code` mode falls back to `tools` before catalog
+code-mode child process, the default `code` mode falls back to `tools` before
-compaction. In `directory` mode, client-provided tools stay directly visible
+catalog compaction.
 for the current run while OpenClaw tools, plugin tools, and MCP tools can be
 compacted behind the directory catalog. A direct call to an exact hidden
 directory name is hydrated from that same authorized catalog before execution.
-All modes are experimental. Prefer direct tool exposure for small OpenClaw tool
+Both modes are experimental. Prefer direct tool exposure for small OpenClaw tool
 catalogs, and prefer the Codex-native stable surfaces for Codex harness runs.
 There is no separate source-selection config. When Tool Search is enabled, the
@@ -98,10 +90,7 @@ Tool Search changes the shape:
  contract
 - Tool Search tools mode: the model sees three compact structured fallback
  tools
- Tool Search directory mode: the model sees a bounded directory plus
+- during the turn: the model loads only the tool schemas it actually needs
  search/describe/call controls and a small bounded set of likely or required
  schemas
 - during the turn: the model can load remaining schemas as needed
 Direct tool exposure is still the right default for small catalogs. Tool Search
 is best when one run can see many tools, especially from MCP servers or
@@ -143,20 +132,6 @@ The structured fallback mode exposes the same operations as tools:
 - `tool_describe`
 - `tool_call`
 Directory mode exposes:
 - `tool_search`
 - `tool_describe`
 - `tool_call`
 It also keeps client-provided tools directly visible and may expose a small
 bounded set of likely or required catalog tool schemas directly for the current
 turn. If the bounded directory omits entries, use `tool_search` to find them. If
 the model requests an exact hidden directory tool name directly, OpenClaw
 hydrates it from the authorized catalog before normal execution.
 Directory-mode client tool names must not collide with OpenClaw, plugin, or MCP
 tool names because exact deferred dispatch uses those names.
 ## Runtime boundary
 The code bridge runs in a short-lived Node subprocess. The subprocess starts
@@ -211,18 +186,6 @@ Use the structured fallback tools instead for OpenClaw runs:
 }
 ```
 Use the compact directory surface instead for OpenClaw runs:
 ```json5
 {
  tools: {
    toolSearch: {
      mode: "directory",
    },
  },
 }
 ```
 Tune code-mode timeout and search result limits:
 ```json5
--- a/extensions/active-memory/index.test.ts
+++ b/extensions/active-memory/index.test.ts
--- a/extensions/active-memory/index.ts
+++ b/extensions/active-memory/index.ts
--- a/extensions/active-memory/openclaw.plugin.json
+++ b/extensions/active-memory/openclaw.plugin.json
@@ -123,12 +123,11 @@
      "help": "Optional explicit denylist of chat/user IDs. Sessions whose resolved conversation id matches the list are skipped even when the chat type is allowed. Applied after allowedChatIds."
    },
    "timeoutMs": {
-      "label": "Timeout (ms)",
+      "label": "Timeout (ms)"
      "help": "Recall work budget on the main lane. Before recall, the hook allows up to 1500 ms for session/config preflight. After recall starts, it reserves another fixed 1500 ms only for abort settlement and transcript recovery."
    },
    "setupGraceTimeoutMs": {
      "label": "Setup Grace Timeout (ms)",
-      "help": "Advanced: extra recall-work budget for cold embedded-run setup. Defaults to 0. The separate 1500 ms preflight cap and 1500 ms post-recall completion allowance still apply."
+      "help": "Advanced: extra blocking budget for cold embedded-run setup before the recall timeout is considered exhausted. Defaults to 0 so timeoutMs remains the main-lane hook budget unless you opt in."
    },
    "queryMode": {
      "label": "Query Mode",
--- a/extensions/anthropic/cli-backend.ts
+++ b/extensions/anthropic/cli-backend.ts
@@ -34,7 +34,6 @@ export function buildAnthropicCliBackend(): CliBackendPlugin {
    bundleMcp: true,
    bundleMcpMode: "claude-config-file",
    nativeToolMode: "always-on",
    sideQuestionToolMode: "disabled",
    ownsNativeCompaction: true,
    config: {
      command: "claude",
--- a/extensions/anthropic/cli-shared.test.ts
+++ b/extensions/anthropic/cli-shared.test.ts
@@ -150,61 +150,6 @@ describe("resolveClaudeCliExecutionArgs", () => {
      }),
    ).toEqual(["-p", "--effort", "max"]);
  });
  it("forces isolated no-tool one-shot args for side-question execution", () => {
    expect(
      resolveClaudeCliExecutionArgs({
        workspaceDir: "/tmp",
        provider: "claude-cli",
        modelId: "claude-opus-4-7",
        thinkingLevel: "max",
        useResume: true,
        executionMode: "side-question",
        baseArgs: [
          "-p",
          "--output-format",
          "stream-json",
          "--allowedTools=mcp__openclaw__*",
          "--allowedTools",
          "Read",
          "Grep",
          "--permission-mode",
          "bypassPermissions",
          "--session-id=abc",
          "--resume",
          "old-session",
          "--resume-session-at",
          "old-message",
          "--resume-session-at=old-message-equals",
          "--mcp-config",
          "/tmp/side-question-mcp.json",
          "--bare",
          "--safe-mode",
          "--strict-mcp-config",
          "--no-session-persistence",
          "--max-turns",
          "4",
          "--effort",
          "high",
        ],
      }),
    ).toEqual([
      "-p",
      "--output-format",
      "stream-json",
      "--safe-mode",
      "--tools",
      "",
      "--disallowedTools",
      "mcp__*",
      "--strict-mcp-config",
      "--no-session-persistence",
      "--max-turns",
      "1",
      "--permission-mode",
      "default",
    ]);
  });
 });
 describe("normalizeClaudeBackendConfig", () => {
--- a/extensions/anthropic/cli-shared.ts
+++ b/extensions/anthropic/cli-shared.ts
@@ -67,26 +67,8 @@ const CLAUDE_LEGACY_SKIP_PERMISSIONS_ARG = "--dangerously-skip-permissions";
 const CLAUDE_PERMISSION_MODE_ARG = "--permission-mode";
 const CLAUDE_SETTING_SOURCES_ARG = "--setting-sources";
 const CLAUDE_EFFORT_ARG = "--effort";
 const CLAUDE_BARE_ARG = "--bare";
 const CLAUDE_SAFE_MODE_ARG = "--safe-mode";
 const CLAUDE_TOOLS_ARG = "--tools";
 const CLAUDE_DISALLOWED_TOOLS_ARG = "--disallowedTools";
 const CLAUDE_MCP_CONFIG_ARG = "--mcp-config";
 const CLAUDE_STRICT_MCP_CONFIG_ARG = "--strict-mcp-config";
 const CLAUDE_NO_SESSION_PERSISTENCE_ARG = "--no-session-persistence";
 const CLAUDE_MAX_TURNS_ARG = "--max-turns";
 const CLAUDE_SESSION_ID_ARG = "--session-id";
 const CLAUDE_RESUME_ARG = "--resume";
 const CLAUDE_RESUME_SESSION_AT_ARG = "--resume-session-at";
 const CLAUDE_RESUME_SHORT_ARG = "-r";
 const CLAUDE_CONTINUE_ARG = "--continue";
 const CLAUDE_CONTINUE_SHORT_ARG = "-c";
 const CLAUDE_FORK_SESSION_ARG = "--fork-session";
 const CLAUDE_SAFE_SETTING_SOURCES = "user";
 const CLAUDE_BYPASS_PERMISSION_MODE = "bypassPermissions";
 const CLAUDE_DEFAULT_PERMISSION_MODE = "default";
 const CLAUDE_NO_TOOLS_VALUE = "";
 const CLAUDE_DENY_MCP_TOOLS_VALUE = "mcp__*";
 type ClaudeCliEffort = "low" | "medium" | "high" | "xhigh" | "max";
@@ -250,89 +232,10 @@ function stripClaudeEffortArgs(args: readonly string[]): string[] {
  return normalized;
 }
 const CLAUDE_SIDE_QUESTION_VARIADIC_VALUE_ARGS = new Set([
  "--allowedTools",
  "--allowed-tools",
  CLAUDE_DISALLOWED_TOOLS_ARG,
  "--disallowed-tools",
  CLAUDE_TOOLS_ARG,
  CLAUDE_MCP_CONFIG_ARG,
 ]);
 const CLAUDE_SIDE_QUESTION_VALUE_ARGS = new Set([
  CLAUDE_PERMISSION_MODE_ARG,
  CLAUDE_SESSION_ID_ARG,
  CLAUDE_RESUME_ARG,
  CLAUDE_RESUME_SESSION_AT_ARG,
  CLAUDE_RESUME_SHORT_ARG,
  CLAUDE_MAX_TURNS_ARG,
 ]);
 const CLAUDE_SIDE_QUESTION_BARE_ARGS = new Set([
  CLAUDE_CONTINUE_ARG,
  CLAUDE_CONTINUE_SHORT_ARG,
  CLAUDE_FORK_SESSION_ARG,
  CLAUDE_BARE_ARG,
  CLAUDE_SAFE_MODE_ARG,
  CLAUDE_STRICT_MCP_CONFIG_ARG,
  CLAUDE_NO_SESSION_PERSISTENCE_ARG,
 ]);
 function stripClaudeSideQuestionConflictingArgs(args: readonly string[]): string[] {
  const normalized: string[] = [];
  for (let i = 0; i < args.length; i += 1) {
    const arg = args[i] ?? "";
    const equalsIndex = arg.indexOf("=");
    const argName = equalsIndex > 0 ? arg.slice(0, equalsIndex) : arg;
    if (CLAUDE_SIDE_QUESTION_BARE_ARGS.has(argName)) {
      continue;
    }
    if (CLAUDE_SIDE_QUESTION_VARIADIC_VALUE_ARGS.has(argName)) {
      if (equalsIndex < 0) {
        while (typeof args[i + 1] === "string" && !args[i + 1]?.startsWith("-")) {
          i += 1;
        }
      }
      continue;
    }
    if (CLAUDE_SIDE_QUESTION_VALUE_ARGS.has(argName)) {
      if (equalsIndex < 0) {
        const maybeValue = args[i + 1];
        if (typeof maybeValue === "string" && !maybeValue.startsWith("-")) {
          i += 1;
        }
      }
      continue;
    }
    normalized.push(arg);
  }
  return normalized;
 }
 function resolveClaudeCliSideQuestionExecutionArgs(baseArgs: readonly string[]): string[] {
  return [
    ...stripClaudeSideQuestionConflictingArgs(stripClaudeEffortArgs(baseArgs)),
    CLAUDE_SAFE_MODE_ARG,
    CLAUDE_TOOLS_ARG,
    CLAUDE_NO_TOOLS_VALUE,
    CLAUDE_DISALLOWED_TOOLS_ARG,
    CLAUDE_DENY_MCP_TOOLS_VALUE,
    CLAUDE_STRICT_MCP_CONFIG_ARG,
    CLAUDE_NO_SESSION_PERSISTENCE_ARG,
    CLAUDE_MAX_TURNS_ARG,
    "1",
    CLAUDE_PERMISSION_MODE_ARG,
    CLAUDE_DEFAULT_PERMISSION_MODE,
  ];
 }
 /** Resolve final Claude CLI execution args for one backend invocation. */
 export function resolveClaudeCliExecutionArgs(
  context: CliBackendResolveExecutionArgsContext,
 ): string[] {
  if (context.executionMode === "side-question") {
    return resolveClaudeCliSideQuestionExecutionArgs(context.baseArgs);
  }
  const effort = mapClaudeCliThinkingLevelToEffort(context.thinkingLevel);
  if (!effort) {
    return [...context.baseArgs];
--- a/extensions/browser/skills/browser-automation/SKILL.md
+++ b/extensions/browser/skills/browser-automation/SKILL.md
@@ -25,7 +25,7 @@ Use this skill when you need the `browser` tool for anything beyond a single pag
   - Use the same `targetId` for follow-up actions so refs stay on the same tab.
   - For durable Playwright refs, request `refs="aria"` when supported. If you receive `axN` refs from `snapshotFormat="aria"`, use them only after that same snapshot call; stale or unbound `axN` refs fail fast and need a fresh snapshot.
   - Use `urls=true` when link text is ambiguous or a direct navigation target would avoid brittle clicks.
-   - Use `labels=true` on snapshot or screenshot when visual position matters. On Playwright-backed profiles, the response includes an `annotations` array (`{ref, number, role, name?, box}`) with each ref's bounding box in the captured image's coordinate space, so you can reason about position without re-snapshotting; screenshot labels can also combine with `fullPage=true` (CLI: `--full-page`) to label the whole document, or `ref` / `element` to clip to one element. `profile="user"` and other existing-session (chrome-mcp) profiles render an overlay into page screenshots but do not attach `annotations` or use the Playwright full-page/ref/element projection helper, so read positions from the labeled image itself on those profiles. The raw-CDP fallback (no Playwright) does not support labeled screenshots at all and returns a 501, so only request `labels` when Playwright is available.
+   - Use `labels=true` on snapshot or screenshot when visual position matters.
 4. Act narrowly:
   - Prefer `action="act"` with a ref from the latest snapshot.
   - After navigation, modal changes, or form submission, snapshot again before the next action.
--- a/extensions/browser/src/browser-tool.actions.ts
+++ b/extensions/browser/src/browser-tool.actions.ts
@@ -486,7 +486,6 @@ export async function executeSnapshotAction(params: {
      labels: snapshot.labels,
      labelsCount: snapshot.labelsCount,
      labelsSkipped: snapshot.labelsSkipped,
      annotations: snapshot.annotations,
      imagePath: snapshot.imagePath,
      imageType: snapshot.imageType,
      refsFallback,
--- a/extensions/browser/src/browser/client-actions-types.ts
+++ b/extensions/browser/src/browser/client-actions-types.ts
@@ -1,8 +1,6 @@
 /**
 * Shared result types for browser client action helpers.
 */
 import type { AnnotationItem } from "./screenshot-annotate.js";
 /** Generic success result for action endpoints. */
 export type BrowserActionOk = { ok: true };
@@ -22,10 +20,4 @@ export type BrowserActionPathResult = {
  labels?: boolean;
  labelsCount?: number;
  labelsSkipped?: number;
  /**
   * Per-ref bounding boxes when labels=true. Coordinates are in the
   * captured image's space (viewport / fullpage / element-relative).
   * Omitted when empty.
   */
  annotations?: AnnotationItem[];
 };
--- a/extensions/browser/src/browser/client.ts
+++ b/extensions/browser/src/browser/client.ts
@@ -18,7 +18,6 @@ import type {
 } from "./client.types.js";
 import { DEFAULT_BROWSER_SNAPSHOT_TIMEOUT_MS } from "./constants.js";
 import type { BrowserDoctorReport } from "./doctor.js";
 import type { AnnotationItem } from "./screenshot-annotate.js";
 export type { BrowserStatus, BrowserTab, BrowserTransport } from "./client.types.js";
 export type { BrowserDoctorCheck, BrowserDoctorReport } from "./doctor.js";
@@ -125,11 +124,6 @@ export type SnapshotResult =
      labels?: boolean;
      labelsCount?: number;
      labelsSkipped?: number;
      /**
       * Per-ref bounding boxes when labels=true. Coordinates are in the
       * captured image's space. Omitted when empty.
       */
      annotations?: AnnotationItem[];
      imagePath?: string;
      imageType?: "png" | "jpeg";
      blockedByDialog?: boolean;
--- a/extensions/browser/src/browser/pw-tools-core.annotate.test.ts
+++ b/extensions/browser/src/browser/pw-tools-core.annotate.test.ts
@@ -1,205 +0,0 @@
 import { beforeEach, describe, expect, it, vi } from "vitest";
 import {
  installPwToolsCoreTestHooks,
  setPwToolsCoreCurrentPage,
  setPwToolsCoreCurrentRefLocator,
 } from "./pw-tools-core.test-harness.js";
 installPwToolsCoreTestHooks();
 const mod = await import("./pw-tools-core.js");
 type EvaluateArg = unknown;
 function evaluateMockReturning(view: { x: number; y: number; width?: number; height?: number }) {
  // Caller reads { x, y, width, height } in one evaluate; default to a normal
  // desktop viewport so refs near the top stay in-viewport unless a test puts
  // them out of range explicitly.
  const result = { width: 1280, height: 720, ...view };
  return vi.fn(async (arg: EvaluateArg) => {
    if (typeof arg === "function") {
      return result;
    }
    return true;
  });
 }
 describe("screenshotWithLabelsViaPlaywright (viewport)", () => {
  beforeEach(() => {
    vi.clearAllMocks();
  });
  it("calls page.screenshot without fullPage and returns annotations", async () => {
    const evaluate = evaluateMockReturning({ x: 0, y: 100 });
    const screenshot = vi.fn(async () => Buffer.from("PNG"));
    setPwToolsCoreCurrentPage({ evaluate, screenshot, url: () => "https://example.com" });
    setPwToolsCoreCurrentRefLocator({
      boundingBox: async () => ({ x: 10, y: 200, width: 50, height: 20 }),
    });
    const result = await mod.screenshotWithLabelsViaPlaywright({
      cdpUrl: "http://127.0.0.1:18792",
      targetId: "T1",
      refs: { e1: { role: "button", name: "Submit" } },
      type: "png",
    });
    expect(screenshot).toHaveBeenCalledWith(expect.objectContaining({ type: "png" }));
    expect(screenshot).not.toHaveBeenCalledWith(expect.objectContaining({ fullPage: true }));
    expect(result.annotations).toHaveLength(1);
    expect(result.annotations[0]).toMatchObject({
      ref: "e1",
      number: 1,
      role: "button",
      name: "Submit",
    });
    // viewport-mode box = doc(box.x + scroll.x, box.y + scroll.y) - scroll = bbox
    expect(result.annotations[0]?.box).toEqual({ x: 10, y: 200, width: 50, height: 20 });
    expect(result.skipped).toBe(0);
  });
  it("runs the clear script even when screenshot throws", async () => {
    const evaluate = evaluateMockReturning({ x: 0, y: 0 });
    const screenshot = vi.fn(async () => {
      throw new Error("boom");
    });
    setPwToolsCoreCurrentPage({ evaluate, screenshot });
    setPwToolsCoreCurrentRefLocator({
      boundingBox: async () => ({ x: 0, y: 0, width: 1, height: 1 }),
    });
    await expect(
      mod.screenshotWithLabelsViaPlaywright({
        cdpUrl: "http://127.0.0.1:18792",
        targetId: "T1",
        refs: { e1: { role: "button" } },
      }),
    ).rejects.toThrow(/boom/);
    // The clear script must have run (string evaluate calls include the overlay attr)
    const clearCalls = evaluate.mock.calls.filter(
      ([arg]) => typeof arg === "string" && arg.includes("data-openclaw-labels"),
    );
    // inject + clear = at least 2 string evaluations
    expect(clearCalls.length).toBeGreaterThanOrEqual(2);
  });
  it("counts off-viewport refs as skipped but still surfaces them in annotations", async () => {
    const evaluate = evaluateMockReturning({ x: 0, y: 0, width: 1280, height: 720 });
    const screenshot = vi.fn(async () => Buffer.from("PNG"));
    setPwToolsCoreCurrentPage({ evaluate, screenshot });
    // bbox is far below the viewport (y: 5000): not drawn, but still reported
    // so callers keep the position and a non-zero skipped count.
    setPwToolsCoreCurrentRefLocator({
      boundingBox: async () => ({ x: 0, y: 5000, width: 50, height: 20 }),
    });
    const result = await mod.screenshotWithLabelsViaPlaywright({
      cdpUrl: "http://127.0.0.1:18792",
      targetId: "T1",
      refs: { e1: { role: "button" } },
    });
    expect(result.skipped).toBe(1);
    expect(result.labels).toBe(0);
    expect(result.annotations).toHaveLength(1);
    expect(result.annotations[0]?.ref).toBe("e1");
  });
 });
 describe("screenshotWithLabelsViaPlaywright (fullpage)", () => {
  beforeEach(() => vi.clearAllMocks());
  it("forwards fullPage:true to page.screenshot and uses doc-space annotations", async () => {
    const evaluate = evaluateMockReturning({ x: 0, y: 1000 });
    const screenshot = vi.fn(async () => Buffer.from("FULL"));
    setPwToolsCoreCurrentPage({ evaluate, screenshot });
    setPwToolsCoreCurrentRefLocator({
      boundingBox: async () => ({ x: 10, y: 200, width: 50, height: 20 }),
    });
    const result = await mod.screenshotWithLabelsViaPlaywright({
      cdpUrl: "http://127.0.0.1:18792",
      targetId: "T1",
      refs: { e1: { role: "button" } },
      fullPage: true,
    });
    expect(screenshot).toHaveBeenCalledWith(expect.objectContaining({ fullPage: true }));
    // doc-space: scroll y=1000 + bbox y=200 = 1200
    expect(result.annotations[0]?.box.y).toBe(1200);
    expect(result.annotations[0]?.box.x).toBe(10);
  });
 });
 describe("screenshotWithLabelsViaPlaywright (element/ref)", () => {
  beforeEach(() => vi.clearAllMocks());
  it("uses refLocator.screenshot for ref mode and projects relative to element", async () => {
    const evaluate = evaluateMockReturning({ x: 0, y: 0 });
    // First call resolves the element rect (container), second resolves e1 annotation bbox.
    const boundingBox = vi
      .fn<() => Promise<{ x: number; y: number; width: number; height: number } | null>>()
      .mockResolvedValueOnce({ x: 50, y: 100, width: 200, height: 300 })
      .mockResolvedValueOnce({ x: 60, y: 110, width: 30, height: 20 });
    const elementScreenshot = vi.fn(async () => Buffer.from("ELEM"));
    setPwToolsCoreCurrentPage({ evaluate, screenshot: vi.fn() });
    setPwToolsCoreCurrentRefLocator({ boundingBox, screenshot: elementScreenshot });
    const result = await mod.screenshotWithLabelsViaPlaywright({
      cdpUrl: "http://127.0.0.1:18792",
      targetId: "T1",
      refs: { e1: { role: "button" } },
      ref: "container",
    });
    expect(elementScreenshot).toHaveBeenCalledTimes(1);
    // Element-relative: doc(60,110) - elementRect(50,100) = (10,10)
    expect(result.annotations).toHaveLength(1);
    expect(result.annotations[0]?.box).toEqual({ x: 10, y: 10, width: 30, height: 20 });
  });
  it("throws when ref/element cannot be resolved", async () => {
    const evaluate = evaluateMockReturning({ x: 0, y: 0 });
    setPwToolsCoreCurrentPage({ evaluate, screenshot: vi.fn() });
    setPwToolsCoreCurrentRefLocator({
      boundingBox: async () => null,
      screenshot: vi.fn(),
    });
    await expect(
      mod.screenshotWithLabelsViaPlaywright({
        cdpUrl: "http://127.0.0.1:18792",
        targetId: "T1",
        refs: { e1: { role: "button" } },
        ref: "missing",
      }),
    ).rejects.toThrow(/element not found/i);
  });
 });
 describe("screenshotWithLabelsViaPlaywright (skipped accounting)", () => {
  beforeEach(() => vi.clearAllMocks());
  it("counts refs whose boundingBox is null toward skipped", async () => {
    const evaluate = evaluateMockReturning({ x: 0, y: 0 });
    const screenshot = vi.fn(async () => Buffer.from("PNG"));
    setPwToolsCoreCurrentPage({ evaluate, screenshot });
    // Two refs: first returns a box, second returns null (e.g. element detached).
    const boundingBox = vi
      .fn<() => Promise<{ x: number; y: number; width: number; height: number } | null>>()
      .mockResolvedValueOnce({ x: 10, y: 20, width: 30, height: 40 })
      .mockResolvedValueOnce(null);
    setPwToolsCoreCurrentRefLocator({ boundingBox });
    const result = await mod.screenshotWithLabelsViaPlaywright({
      cdpUrl: "http://127.0.0.1:18792",
      targetId: "T1",
      refs: { e1: { role: "button" }, e2: { role: "link" } },
    });
    expect(result.annotations).toHaveLength(1);
    expect(result.annotations[0]?.ref).toBe("e1");
    expect(result.skipped).toBe(1);
  });
 });
--- a/extensions/browser/src/browser/pw-tools-core.interactions.ts
+++ b/extensions/browser/src/browser/pw-tools-core.interactions.ts
@@ -41,15 +41,6 @@ import {
  toAIFriendlyError,
 } from "./pw-tools-core.shared.js";
 import { closePageViaPlaywright, resizeViewportViaPlaywright } from "./pw-tools-core.snapshot.js";
 import {
  ANNOTATION_MAX_LABELS_DEFAULT,
  type AnnotationItem,
  buildOverlayClearScript,
  buildOverlayInjectionScript,
  type CoordinateSpace,
  planAnnotations,
  type RawAnnotationInput,
 } from "./screenshot-annotate.js";
 type TargetOpts = {
  cdpUrl: string;
@@ -1296,15 +1287,7 @@ export async function screenshotWithLabelsViaPlaywright(opts: {
  maxLabels?: number;
  type?: "png" | "jpeg";
  timeoutMs?: number;
-  fullPage?: boolean;
+}): Promise<{ buffer: Buffer; labels: number; skipped: number }> {
  ref?: string;
  element?: string;
 }): Promise<{
  buffer: Buffer;
  labels: number;
  skipped: number;
  annotations: AnnotationItem[];
 }> {
  const page = await getPageForTargetId(opts);
  ensurePageState(page);
  restoreRoleRefsForTarget({ cdpUrl: opts.cdpUrl, targetId: opts.targetId, page });
@@ -1312,151 +1295,119 @@ export async function screenshotWithLabelsViaPlaywright(opts: {
  const maxLabels =
    typeof opts.maxLabels === "number" && Number.isFinite(opts.maxLabels)
      ? Math.max(1, Math.floor(opts.maxLabels))
-      : ANNOTATION_MAX_LABELS_DEFAULT;
+      : 150;
-  const refKey = normalizeOptionalString(opts.ref) ?? undefined;
+  const viewport = await page.evaluate(() => ({
-  const elementSelector = normalizeOptionalString(opts.element) ?? undefined;
+    scrollX: window.scrollX || 0,
-  const space: CoordinateSpace = opts.fullPage
+    scrollY: window.scrollY || 0,
    ? "fullpage"
    : refKey || elementSelector
      ? "element"
      : "viewport";
  // Read scroll + viewport size. Scroll converts Playwright's viewport-space
  // boundingBoxes into document-space inputs; the viewport size lets the helper
  // restore the shipped `labelsSkipped` semantics by counting off-viewport refs
  // as skipped (in viewport capture mode).
  const view = await page.evaluate(() => ({
    x: window.scrollX || 0,
    y: window.scrollY || 0,
    width: window.innerWidth || 0,
    height: window.innerHeight || 0,
  }));
  const scroll = { x: view.x, y: view.y };
-  let elementRect: { x: number; y: number; width: number; height: number } | undefined;
+  const refs = Object.keys(opts.refs ?? {});
-  if (space === "element") {
+  const boxes: Array<{ ref: string; x: number; y: number; w: number; h: number }> = [];
-    const box = await resolveElementBoundingBoxForLabels(page, refKey, elementSelector);
+  let skipped = 0;
    if (!box) {
      throw new Error(
        `screenshotWithLabelsViaPlaywright: element not found for ${
          refKey ? `ref="${refKey}"` : `selector="${elementSelector ?? ""}"`
        }`,
      );
    }
    // Convert viewport-space bbox to document space.
    elementRect = {
      x: box.x + scroll.x,
      y: box.y + scroll.y,
      width: box.width,
      height: box.height,
    };
  }
-  const refKeys = Object.keys(opts.refs ?? {});
+  for (const ref of refs) {
-  const inputs: RawAnnotationInput[] = [];
+    if (boxes.length >= maxLabels) {
-  let bboxFailures = 0;
+      skipped += 1;
  for (const ref of refKeys) {
    const box = await refLocator(page, ref)
      .boundingBox()
      .catch(() => null);
    if (!box) {
      bboxFailures += 1;
      continue;
    }
-    inputs.push({
+    try {
-      ref,
+      const box = await refLocator(page, ref).boundingBox();
-      role: opts.refs[ref].role,
+      if (!box) {
-      name: opts.refs[ref].name,
+        skipped += 1;
-      doc: {
+        continue;
-        x: box.x + scroll.x,
+      }
-        y: box.y + scroll.y,
+      const x0 = box.x;
-        width: box.width,
+      const y0 = box.y;
-        height: box.height,
+      const x1 = box.x + box.width;
-      },
+      const y1 = box.y + box.height;
-    });
+      const vx0 = viewport.scrollX;
      const vy0 = viewport.scrollY;
      const vx1 = viewport.scrollX + viewport.width;
      const vy1 = viewport.scrollY + viewport.height;
      if (x1 < vx0 || x0 > vx1 || y1 < vy0 || y0 > vy1) {
        skipped += 1;
        continue;
      }
      boxes.push({
        ref,
        x: x0 - viewport.scrollX,
        y: y0 - viewport.scrollY,
        w: Math.max(1, box.width),
        h: Math.max(1, box.height),
      });
    } catch {
      skipped += 1;
    }
  }
  const plan = planAnnotations({
    inputs,
    space,
    scroll,
    viewport: { width: view.width, height: view.height },
    elementRect,
    maxLabels,
  });
  try {
-    if (plan.overlayItems.length > 0) {
+    if (boxes.length > 0) {
-      const captureY = space === "element" ? elementRect?.y : space === "viewport" ? scroll.y : 0;
+      await page.evaluate((labels) => {
-      await page.evaluate(buildOverlayInjectionScript({ items: plan.overlayItems, captureY }));
+        const existing = document.querySelectorAll("[data-openclaw-labels]");
        existing.forEach((el) => el.remove());
        const root = document.createElement("div");
        root.setAttribute("data-openclaw-labels", "1");
        root.style.position = "fixed";
        root.style.left = "0";
        root.style.top = "0";
        root.style.zIndex = "2147483647";
        root.style.pointerEvents = "none";
        root.style.fontFamily =
          '"SF Mono","SFMono-Regular",Menlo,Monaco,Consolas,"Liberation Mono","Courier New",monospace';
        const clamp = (value: number, min: number, max: number) =>
          Math.min(max, Math.max(min, value));
        for (const label of labels) {
          const box = document.createElement("div");
          box.setAttribute("data-openclaw-labels", "1");
          box.style.position = "absolute";
          box.style.left = `${label.x}px`;
          box.style.top = `${label.y}px`;
          box.style.width = `${label.w}px`;
          box.style.height = `${label.h}px`;
          box.style.border = "2px solid #ffb020";
          box.style.boxSizing = "border-box";
          const tag = document.createElement("div");
          tag.setAttribute("data-openclaw-labels", "1");
          tag.textContent = label.ref;
          tag.style.position = "absolute";
          tag.style.left = `${label.x}px`;
          tag.style.top = `${clamp(label.y - 18, 0, 20000)}px`;
          tag.style.background = "#ffb020";
          tag.style.color = "#1a1a1a";
          tag.style.fontSize = "12px";
          tag.style.lineHeight = "14px";
          tag.style.padding = "1px 4px";
          tag.style.borderRadius = "3px";
          tag.style.boxShadow = "0 1px 2px rgba(0,0,0,0.35)";
          tag.style.whiteSpace = "nowrap";
          root.appendChild(box);
          root.appendChild(tag);
        }
        document.documentElement.appendChild(root);
      }, boxes);
    }
-    const buffer =
+
-      space === "element"
+    const buffer = await page.screenshot({ type, timeout: opts.timeoutMs });
-        ? await captureElementScreenshotForLabels(
+    return { buffer, labels: boxes.length, skipped };
            page,
            refKey,
            elementSelector,
            type,
            opts.timeoutMs,
          )
        : await page.screenshot({
            type,
            fullPage: Boolean(opts.fullPage),
            timeout: opts.timeoutMs,
          });
    return {
      // `labels` reports overlay boxes actually drawn on the captured image
      // (in-viewport, within budget); off-viewport refs are surfaced via
      // `annotations` but not drawn, and are reflected in `skipped`.
      buffer,
      labels: plan.overlayItems.length,
      skipped: plan.skipped + bboxFailures,
      annotations: plan.annotations,
    };
  } finally {
-    await page.evaluate(buildOverlayClearScript()).catch(() => {});
+    await page
      .evaluate(() => {
        const existing = document.querySelectorAll("[data-openclaw-labels]");
        existing.forEach((el) => el.remove());
      })
      .catch(() => {});
  }
 }
 async function resolveElementBoundingBoxForLabels(
  page: Page,
  refKey: string | undefined,
  cssSelector: string | undefined,
 ): Promise<{ x: number; y: number; width: number; height: number } | null> {
  if (refKey) {
    try {
      return await refLocator(page, refKey).boundingBox();
    } catch {
      return null;
    }
  }
  if (cssSelector) {
    try {
      return await page.locator(cssSelector).first().boundingBox();
    } catch {
      return null;
    }
  }
  return null;
 }
 async function captureElementScreenshotForLabels(
  page: Page,
  refKey: string | undefined,
  cssSelector: string | undefined,
  type: "png" | "jpeg",
  timeoutMs: number | undefined,
 ): Promise<Buffer> {
  if (refKey) {
    return await refLocator(page, refKey).screenshot({ type, timeout: timeoutMs });
  }
  if (cssSelector) {
    return await page.locator(cssSelector).first().screenshot({ type, timeout: timeoutMs });
  }
  throw new Error("captureElementScreenshotForLabels: requires refKey or cssSelector");
 }
 /** Sets file inputs for a role ref or selector with strict existing-path checks. */
 export async function setInputFilesViaPlaywright(opts: {
  cdpUrl: string;
--- a/extensions/browser/src/browser/routes/agent.snapshot.ts
+++ b/extensions/browser/src/browser/routes/agent.snapshot.ts
@@ -5,7 +5,6 @@
 * navigation policy checks, media storage, and screenshot normalization.
 */
 import path from "node:path";
 import { getImageMetadata } from "../../media/media-services.js";
 import { ensureMediaDir, saveMediaBuffer } from "../../media/store.js";
 import { captureScreenshot, snapshotAria, snapshotRoleViaCdp } from "../cdp.js";
 import {
@@ -25,8 +24,6 @@ import {
  assertBrowserNavigationResultAllowed,
 } from "../navigation-guard.js";
 import { getBrowserProfileCapabilities } from "../profile-capabilities.js";
 import type { AnnotationItem } from "../screenshot-annotate.js";
 import { scaleAnnotations } from "../screenshot-annotate.js";
 import {
  DEFAULT_BROWSER_SCREENSHOT_MAX_BYTES,
  DEFAULT_BROWSER_SCREENSHOT_MAX_SIDE,
@@ -195,24 +192,11 @@ async function saveNormalizedScreenshotResponse(params: {
  labels?: boolean;
  labelsCount?: number;
  labelsSkipped?: number;
  annotations?: AnnotationItem[];
 }) {
  // Measure original dimensions BEFORE normalization so we can rescale
  // annotation coordinates if the response pipeline shrinks the image
  // (longest-side or byte-budget cap). Annotation boxes are in the captured
  // image's pixel space, so they would otherwise drift from the saved media.
  const originalMeta = params.annotations?.length
    ? ((await getImageMetadata(params.buffer)) ?? undefined)
    : undefined;
  const normalized = await normalizeBrowserScreenshot(params.buffer, {
    maxSide: DEFAULT_BROWSER_SCREENSHOT_MAX_SIDE,
    maxBytes: DEFAULT_BROWSER_SCREENSHOT_MAX_BYTES,
  });
  const annotations = await rescaleAnnotationsForNormalization({
    annotations: params.annotations,
    originalMeta,
    normalizedBuffer: normalized.buffer,
  });
  await saveBrowserMediaResponse({
    res: params.res,
    buffer: normalized.buffer,
@@ -223,39 +207,9 @@ async function saveNormalizedScreenshotResponse(params: {
    labels: params.labels,
    labelsCount: params.labelsCount,
    labelsSkipped: params.labelsSkipped,
    annotations,
  });
 }
 /**
 * Keep annotation coordinates aligned with the saved media after
 * normalizeBrowserScreenshot. Returns the original annotations unchanged
 * when normalization did not change the image dimensions, or when image
 * metadata is unavailable (best-effort: better to ship pre-resize coords
 * than to drop the field entirely).
 */
 async function rescaleAnnotationsForNormalization(params: {
  annotations?: AnnotationItem[];
  originalMeta?: { width?: number; height?: number };
  normalizedBuffer: Buffer;
 }): Promise<AnnotationItem[] | undefined> {
  if (!params.annotations || params.annotations.length === 0) {
    return params.annotations;
  }
  const orig = params.originalMeta;
  if (!orig?.width || !orig?.height) {
    return params.annotations;
  }
  const next = await getImageMetadata(params.normalizedBuffer);
  if (!next?.width || !next?.height) {
    return params.annotations;
  }
  if (next.width === orig.width && next.height === orig.height) {
    return params.annotations;
  }
  return scaleAnnotations(params.annotations, next.width / orig.width, next.height / orig.height);
 }
 async function saveBrowserMediaResponse(params: {
  res: BrowserResponse;
  buffer: Buffer;
@@ -266,7 +220,6 @@ async function saveBrowserMediaResponse(params: {
  labels?: boolean;
  labelsCount?: number;
  labelsSkipped?: number;
  annotations?: AnnotationItem[];
 }) {
  await ensureMediaDir();
  const saved = await saveMediaBuffer(
@@ -283,9 +236,6 @@ async function saveBrowserMediaResponse(params: {
    ...(params.labels ? { labels: true } : {}),
    ...(typeof params.labelsCount === "number" ? { labelsCount: params.labelsCount } : {}),
    ...(typeof params.labelsSkipped === "number" ? { labelsSkipped: params.labelsSkipped } : {}),
    ...(params.annotations && params.annotations.length > 0
      ? { annotations: params.annotations }
      : {}),
  });
 }
@@ -528,9 +478,6 @@ export function registerBrowserAgentSnapshotRoutes(
                refs: snap.refs,
                type,
                timeoutMs,
                fullPage,
                ref,
                element,
              });
              await saveNormalizedScreenshotResponse({
                res,
@@ -541,7 +488,6 @@ export function registerBrowserAgentSnapshotRoutes(
                labels: true,
                labelsCount: labeled.labels,
                labelsSkipped: labeled.skipped,
                annotations: labeled.annotations,
              });
              return;
            }
@@ -797,18 +743,10 @@ export function registerBrowserAgentSnapshotRoutes(
              type: "png",
              timeoutMs: plan.timeoutMs,
            });
            const originalMeta = labeled.annotations.length
              ? ((await getImageMetadata(labeled.buffer)) ?? undefined)
              : undefined;
            const normalized = await normalizeBrowserScreenshot(labeled.buffer, {
              maxSide: DEFAULT_BROWSER_SCREENSHOT_MAX_SIDE,
              maxBytes: DEFAULT_BROWSER_SCREENSHOT_MAX_BYTES,
            });
            const scaledAnnotations = await rescaleAnnotationsForNormalization({
              annotations: labeled.annotations,
              originalMeta,
              normalizedBuffer: normalized.buffer,
            });
            await ensureMediaDir();
            const saved = await saveMediaBuffer(
              normalized.buffer,
@@ -826,9 +764,6 @@ export function registerBrowserAgentSnapshotRoutes(
              labels: true,
              labelsCount: labeled.labels,
              labelsSkipped: labeled.skipped,
              ...(scaledAnnotations && scaledAnnotations.length > 0
                ? { annotations: scaledAnnotations }
                : {}),
              imagePath: path.resolve(saved.path),
              imageType,
              ...snap,
--- a/extensions/browser/src/browser/screenshot-annotate.test.ts
+++ b/extensions/browser/src/browser/screenshot-annotate.test.ts
@@ -1,345 +0,0 @@
 import { describe, expect, it } from "vitest";
 import {
  ANNOTATION_OVERLAY_ATTR,
  type AnnotationItem,
  buildOverlayClearScript,
  buildOverlayInjectionScript,
  planAnnotations,
  type RawAnnotationInput,
  refToNumber,
  scaleAnnotations,
 } from "./screenshot-annotate.js";
 const sampleInputs: RawAnnotationInput[] = [
  {
    ref: "e1",
    role: "button",
    name: "Submit",
    doc: { x: 100, y: 200, width: 50, height: 20 },
  },
  {
    ref: "e2",
    role: "link",
    doc: { x: 300, y: 1500, width: 80, height: 18 },
  },
 ];
 describe("refToNumber", () => {
  it("extracts number from `e<N>` form", () => {
    expect(refToNumber("e12")).toBe(12);
    expect(refToNumber("e0")).toBe(0);
  });
  it("extracts number from `ax<N>` form", () => {
    expect(refToNumber("ax12")).toBe(12);
  });
  it("extracts number from bare numeric form", () => {
    expect(refToNumber("12")).toBe(12);
  });
  it("returns 0 for non-numeric refs", () => {
    expect(refToNumber("foo")).toBe(0);
    expect(refToNumber("")).toBe(0);
  });
 });
 describe("planAnnotations - viewport mode", () => {
  it("subtracts scroll from doc coords", () => {
    const plan = planAnnotations({
      inputs: sampleInputs,
      space: "viewport",
      scroll: { x: 0, y: 1000 },
    });
    expect(plan.annotations).toHaveLength(2);
    expect(plan.annotations[0]).toEqual({
      ref: "e1",
      number: 1,
      role: "button",
      name: "Submit",
      box: { x: 100, y: -800, width: 50, height: 20 },
    });
    expect(plan.annotations[1]).toEqual({
      ref: "e2",
      number: 2,
      role: "link",
      box: { x: 300, y: 500, width: 80, height: 18 },
    });
    expect(plan.skipped).toBe(0);
  });
  it("keeps overlay items in document space regardless of mode", () => {
    const plan = planAnnotations({
      inputs: sampleInputs,
      space: "viewport",
      scroll: { x: 0, y: 1000 },
    });
    expect(plan.overlayItems).toEqual([
      { ref: "e1", x: 100, y: 200, w: 50, h: 20 },
      { ref: "e2", x: 300, y: 1500, w: 80, h: 18 },
    ]);
  });
  it("omits empty name field", () => {
    const plan = planAnnotations({
      inputs: [{ ref: "e1", role: "button", name: "", doc: { x: 0, y: 0, width: 1, height: 1 } }],
      space: "viewport",
      scroll: { x: 0, y: 0 },
    });
    expect(plan.annotations[0]).not.toHaveProperty("name");
  });
  it("throws when scroll missing in viewport mode", () => {
    expect(() => planAnnotations({ inputs: sampleInputs, space: "viewport" })).toThrow(/scroll/);
  });
 });
 describe("planAnnotations - viewport off-screen accounting", () => {
  it("counts off-viewport refs as skipped but keeps them in annotations when viewport size is given", () => {
    const plan = planAnnotations({
      inputs: [
        { ref: "e1", role: "button", doc: { x: 10, y: 50, width: 40, height: 20 } }, // in viewport
        { ref: "e2", role: "link", doc: { x: 10, y: 5000, width: 40, height: 20 } }, // below viewport
      ],
      space: "viewport",
      scroll: { x: 0, y: 0 },
      viewport: { width: 1280, height: 720 },
    });
    // Only the in-viewport ref is drawn.
    expect(plan.overlayItems.map((o) => o.ref)).toEqual(["e1"]);
    // Both refs are surfaced for callers (off-viewport box can be out of image).
    expect(plan.annotations.map((a) => a.ref)).toEqual(["e1", "e2"]);
    // The off-viewport ref raises skipped, preserving the shipped contract.
    expect(plan.skipped).toBe(1);
  });
  it("does not count off-viewport refs when viewport size is omitted", () => {
    const plan = planAnnotations({
      inputs: [{ ref: "e2", role: "link", doc: { x: 10, y: 5000, width: 40, height: 20 } }],
      space: "viewport",
      scroll: { x: 0, y: 0 },
    });
    expect(plan.skipped).toBe(0);
    expect(plan.overlayItems).toHaveLength(1);
    expect(plan.annotations).toHaveLength(1);
  });
 });
 describe("planAnnotations - fullpage mode", () => {
  it("returns box equal to doc (document coordinates)", () => {
    const plan = planAnnotations({ inputs: sampleInputs, space: "fullpage" });
    expect(plan.annotations[0].box).toEqual({ x: 100, y: 200, width: 50, height: 20 });
    expect(plan.annotations[1].box).toEqual({ x: 300, y: 1500, width: 80, height: 18 });
  });
  it("does not require scroll", () => {
    expect(() => planAnnotations({ inputs: sampleInputs, space: "fullpage" })).not.toThrow();
  });
 });
 describe("planAnnotations - element mode", () => {
  const elementRect = { x: 50, y: 100, width: 200, height: 300 };
  it("projects box relative to element top-left", () => {
    const plan = planAnnotations({
      inputs: [{ ref: "e1", role: "button", doc: { x: 60, y: 110, width: 40, height: 20 } }],
      space: "element",
      elementRect,
    });
    expect(plan.annotations[0].box).toEqual({ x: 10, y: 10, width: 40, height: 20 });
  });
  it("filters out inputs that do not overlap element rect", () => {
    const plan = planAnnotations({
      inputs: [
        { ref: "e1", role: "button", doc: { x: 60, y: 110, width: 40, height: 20 } }, // inside
        { ref: "e2", role: "link", doc: { x: 500, y: 500, width: 40, height: 20 } }, // outside
      ],
      space: "element",
      elementRect,
    });
    expect(plan.annotations).toHaveLength(1);
    expect(plan.annotations[0].ref).toBe("e1");
    expect(plan.overlayItems).toHaveLength(1);
  });
  it("throws when elementRect missing", () => {
    expect(() => planAnnotations({ inputs: [], space: "element" })).toThrow(/elementRect/);
  });
 });
 describe("planAnnotations - maxLabels", () => {
  it("truncates to maxLabels and reports skipped", () => {
    const inputs = Array.from({ length: 5 }, (_, i) => ({
      ref: `e${i + 1}`,
      role: "button",
      doc: { x: 0, y: i * 10, width: 5, height: 5 },
    }));
    const plan = planAnnotations({ inputs, space: "fullpage", maxLabels: 2 });
    expect(plan.annotations).toHaveLength(2);
    expect(plan.overlayItems).toHaveLength(2);
    expect(plan.skipped).toBe(3);
  });
  it("uses ANNOTATION_MAX_LABELS_DEFAULT when not specified", () => {
    const inputs = Array.from({ length: 200 }, (_, i) => ({
      ref: `e${i + 1}`,
      role: "button",
      doc: { x: 0, y: i, width: 5, height: 5 },
    }));
    const plan = planAnnotations({ inputs, space: "fullpage" });
    expect(plan.annotations).toHaveLength(150);
    expect(plan.skipped).toBe(50);
  });
 });
 describe("buildOverlayInjectionScript", () => {
  it("returns a self-contained IIFE", () => {
    const script = buildOverlayInjectionScript({
      items: [{ ref: "e1", x: 100, y: 200, w: 50, h: 20 }],
    });
    expect(script).toMatch(/^\(\s*\(\s*\)\s*=>\s*\{/);
    expect(script).toMatch(/\}\s*\)\s*\(\s*\)\s*;?\s*$/);
  });
  it("embeds the overlay attr", () => {
    const script = buildOverlayInjectionScript({ items: [] });
    expect(script).toContain(ANNOTATION_OVERLAY_ATTR);
  });
  it("embeds each item's ref text and coordinates", () => {
    const script = buildOverlayInjectionScript({
      items: [
        { ref: "e1", x: 100, y: 200, w: 50, h: 20 },
        { ref: "ax42", x: 999, y: 1500, w: 80, h: 18 },
      ],
    });
    expect(script).toMatch(/"ref":\s*"e1"/);
    expect(script).toMatch(/"ref":\s*"ax42"/);
    expect(script).toMatch(/"x":\s*100/);
    expect(script).toMatch(/"x":\s*999/);
  });
  it("handles empty items without throwing", () => {
    expect(() => buildOverlayInjectionScript({ items: [] })).not.toThrow();
  });
  it("rounds coordinates to integers", () => {
    const script = buildOverlayInjectionScript({
      items: [{ ref: "e1", x: 100.7, y: 200.4, w: 50.6, h: 20.1 }],
    });
    expect(script).toMatch(/"x":\s*101/); // 100.7 -> 101
    expect(script).toMatch(/"y":\s*200/); // 200.4 -> 200
  });
  it("clamps zero/negative-size boxes to 1px so they remain visible", () => {
    const script = buildOverlayInjectionScript({
      items: [{ ref: "e1", x: 10, y: 10, w: 0, h: 0 }],
    });
    expect(script).toMatch(/"w":\s*1/);
    expect(script).toMatch(/"h":\s*1/);
  });
  it("escapes hostile ref characters via JSON.stringify (no breakout)", () => {
    const hostile = 'e1");alert(1);//';
    const script = buildOverlayInjectionScript({
      items: [{ ref: hostile, x: 0, y: 0, w: 1, h: 1 }],
    });
    // The hostile `"` MUST be escaped as `\"` inside the JSON literal.
    expect(script).toContain('"e1\\");alert(1);//"');
    // The unescaped breakout MUST NOT appear anywhere in the script as a
    // bare statement that would terminate the JSON literal early.
    expect(script).not.toContain('e1");alert(1);');
  });
  it("flips label below the box when y < 14 (no headroom)", () => {
    const script = buildOverlayInjectionScript({
      items: [{ ref: "e1", x: 0, y: 5, w: 10, h: 10 }],
    });
    // labelTop = relativeY < 14 ? it.y + 2 : it.y - 14
    // The expression literal `relativeY < 14 ? (it.y + 2) : (it.y - 14)` is in the script.
    expect(script).toContain("relativeY < 14 ? (it.y + 2) : (it.y - 14)");
  });
  it("uses capture-relative y when deciding whether to flip labels below boxes", () => {
    const script = buildOverlayInjectionScript({
      items: [{ ref: "e1", x: 0, y: 1005, w: 10, h: 10 }],
      captureY: 1000,
    });
    expect(script).toContain("var captureY = 1000;");
    expect(script).toContain("var relativeY = it.y - captureY;");
    expect(script).toContain("relativeY < 14 ? (it.y + 2) : (it.y - 14)");
  });
 });
 describe("buildOverlayClearScript", () => {
  it("returns an IIFE selecting overlay attr", () => {
    const script = buildOverlayClearScript();
    expect(script).toContain(`[${ANNOTATION_OVERLAY_ATTR}]`);
    expect(script).toMatch(/^\(\s*\(\s*\)\s*=>\s*\{/);
  });
 });
 describe("scaleAnnotations", () => {
  const sample: AnnotationItem[] = [
    {
      ref: "e1",
      number: 1,
      role: "button",
      name: "Submit",
      box: { x: 100, y: 200, width: 50, height: 20 },
    },
  ];
  it("returns identity (structural copy) when both factors are 1", () => {
    const out = scaleAnnotations(sample, 1, 1);
    expect(out[0]).toEqual(sample[0]);
    expect(out[0]).not.toBe(sample[0]);
    expect(out[0]?.box).not.toBe(sample[0]?.box);
  });
  it("scales box dimensions by independent x/y factors", () => {
    const out = scaleAnnotations(sample, 0.5, 0.485);
    expect(out[0]?.box).toEqual({
      x: 50,
      y: 97,
      width: 25,
      height: 10,
    });
  });
  it("clamps width/height to a minimum of 1 to avoid disappearing labels", () => {
    const tiny: AnnotationItem[] = [
      {
        ref: "e1",
        number: 1,
        role: "button",
        box: { x: 0, y: 0, width: 1, height: 1 },
      },
    ];
    const out = scaleAnnotations(tiny, 0.1, 0.1);
    expect(out[0]?.box.width).toBeGreaterThanOrEqual(1);
    expect(out[0]?.box.height).toBeGreaterThanOrEqual(1);
  });
  it("returns identity (structural copy) for invalid factors", () => {
    const out = scaleAnnotations(sample, Number.NaN, 0.5);
    expect(out[0]?.box).toEqual(sample[0]?.box);
    const out2 = scaleAnnotations(sample, 0, 0.5);
    expect(out2[0]?.box).toEqual(sample[0]?.box);
    const out3 = scaleAnnotations(sample, -1, 1);
    expect(out3[0]?.box).toEqual(sample[0]?.box);
  });
  it("preserves ref/number/role/name fields verbatim", () => {
    const out = scaleAnnotations(sample, 0.5, 0.5);
    expect(out[0]?.ref).toBe("e1");
    expect(out[0]?.number).toBe(1);
    expect(out[0]?.role).toBe("button");
    expect(out[0]?.name).toBe("Submit");
  });
 });
--- a/extensions/browser/src/browser/screenshot-annotate.ts
+++ b/extensions/browser/src/browser/screenshot-annotate.ts
@@ -1,282 +0,0 @@
 // extensions/browser/src/browser/screenshot-annotate.ts
 //
 // Pure helper module for screenshot label annotations.
 // Has no Playwright / CDP / page dependency: takes document-space inputs,
 // returns coordinate-projected annotations + IIFE strings the caller can
 // hand to page.evaluate / Runtime.evaluate.
 //
 // Used by:
 //   - pw-tools-core.interactions.ts (Playwright path, M1.2-a)
 //   - planned: raw-CDP path in M1.2-b
 //
 // chrome-mcp path keeps its own inline overlay (renderChromeMcpLabels) for now.
 export const ANNOTATION_OVERLAY_ATTR = "data-openclaw-labels";
 export const ANNOTATION_OVERLAY_ROOT_ID = "__openclaw-annotations__";
 export const ANNOTATION_MAX_LABELS_DEFAULT = 150;
 export type CoordinateSpace = "viewport" | "fullpage" | "element";
 export interface RawAnnotationInput {
  ref: string;
  role: string;
  name?: string;
  /** Bounding box in document coordinates (viewport top-left + scroll). */
  doc: { x: number; y: number; width: number; height: number };
 }
 export interface AnnotationBox {
  x: number;
  y: number;
  width: number;
  height: number;
 }
 export interface AnnotationItem {
  ref: string;
  number: number;
  role: string;
  name?: string;
  box: AnnotationBox;
 }
 export interface OverlayItem {
  ref: string;
  x: number;
  y: number;
  w: number;
  h: number;
 }
 export interface AnnotationPlan {
  /** Always document-space items, fed to buildOverlayInjectionScript. */
  overlayItems: OverlayItem[];
  /** Items projected into the capture mode's image-space coordinates. */
  annotations: AnnotationItem[];
  /** Refs dropped because of maxLabels truncation. */
  skipped: number;
 }
 export interface PlanAnnotationsParams {
  inputs: RawAnnotationInput[];
  space: CoordinateSpace;
  /** Required when space === "viewport". */
  scroll?: { x: number; y: number };
  /**
   * Viewport size (CSS px). Only meaningful when space === "viewport". When
   * provided, refs whose document box falls outside the current viewport rect
   * (`scroll` + this size) are counted as skipped instead of drawn, preserving
   * the shipped `labelsSkipped` contract. Omit it to disable that accounting.
   */
  viewport?: { width: number; height: number };
  /** Required when space === "element". */
  elementRect?: { x: number; y: number; width: number; height: number };
  maxLabels?: number;
 }
 export function refToNumber(ref: string): number {
  const match = ref.match(/(\d+)/);
  if (!match) {
    return 0;
  }
  const n = Number(match[1]);
  return Number.isFinite(n) ? n : 0;
 }
 export function planAnnotations(params: PlanAnnotationsParams): AnnotationPlan {
  const maxLabels = params.maxLabels ?? ANNOTATION_MAX_LABELS_DEFAULT;
  if (params.space === "viewport" && !params.scroll) {
    throw new Error("planAnnotations: scroll is required when space is 'viewport'");
  }
  if (params.space === "element" && !params.elementRect) {
    throw new Error("planAnnotations: elementRect is required when space is 'element'");
  }
  // Element-mode filter: discard inputs that do not overlap the element rect.
  let kept = params.inputs;
  if (params.space === "element" && params.elementRect) {
    const er = params.elementRect;
    kept = params.inputs.filter((input) => rectsOverlap(input.doc, er));
  }
  // Viewport capture only shows refs inside the current viewport rect. An
  // off-viewport ref is still surfaced in `annotations` (with its real,
  // possibly out-of-image box) so callers can locate it, but it is not drawn
  // and is counted as skipped. This keeps the shipped `labelsSkipped` meaning
  // ("refs not present in the captured viewport image") instead of silently
  // narrowing it. Only applied when the caller supplies the viewport size;
  // without it we cannot decide off-screen state and skip nothing.
  const viewportRect =
    params.space === "viewport" && params.scroll && params.viewport
      ? {
          x: params.scroll.x,
          y: params.scroll.y,
          width: params.viewport.width,
          height: params.viewport.height,
        }
      : undefined;
  const overlayItems: OverlayItem[] = [];
  const annotations: AnnotationItem[] = [];
  let skipped = 0;
  for (const input of kept) {
    if (viewportRect && !rectsOverlap(input.doc, viewportRect)) {
      // Outside the captured viewport: count as skipped (compat) but still
      // report the annotation; do not draw it or consume the label budget.
      skipped += 1;
      annotations.push(toAnnotation(input, params));
      continue;
    }
    if (overlayItems.length >= maxLabels) {
      skipped += 1;
      continue;
    }
    overlayItems.push({
      ref: input.ref,
      x: input.doc.x,
      y: input.doc.y,
      w: input.doc.width,
      h: input.doc.height,
    });
    annotations.push(toAnnotation(input, params));
  }
  return { overlayItems, annotations, skipped };
 }
 function toAnnotation(input: RawAnnotationInput, params: PlanAnnotationsParams): AnnotationItem {
  return {
    ref: input.ref,
    number: refToNumber(input.ref),
    role: input.role,
    ...(input.name ? { name: input.name } : {}),
    box: projectBox(input.doc, params),
  };
 }
 function projectBox(
  doc: { x: number; y: number; width: number; height: number },
  params: PlanAnnotationsParams,
 ): AnnotationBox {
  if (params.space === "viewport") {
    const scroll = params.scroll!;
    return {
      x: doc.x - scroll.x,
      y: doc.y - scroll.y,
      width: doc.width,
      height: doc.height,
    };
  }
  if (params.space === "element") {
    const er = params.elementRect!;
    // NOTE: width/height pass through unchanged even when the input rect
    // partially extends past the element. The capture backend (e.g.
    // locator.screenshot) is responsible for clipping; the box may have
    // negative x/y or extend past elementRect width/height for partial overlaps.
    return {
      x: doc.x - er.x,
      y: doc.y - er.y,
      width: doc.width,
      height: doc.height,
    };
  }
  // fullpage: document coordinates as-is
  return { x: doc.x, y: doc.y, width: doc.width, height: doc.height };
 }
 function rectsOverlap(
  a: { x: number; y: number; width: number; height: number },
  b: { x: number; y: number; width: number; height: number },
 ): boolean {
  return a.x < b.x + b.width && a.x + a.width > b.x && a.y < b.y + b.height && a.y + a.height > b.y;
 }
 export function buildOverlayInjectionScript(params: {
  items: OverlayItem[];
  captureY?: number;
 }): string {
  const itemsJson = JSON.stringify(
    params.items.map((it) => ({
      ref: it.ref,
      x: round(it.x),
      y: round(it.y),
      w: Math.max(1, round(it.w)),
      h: Math.max(1, round(it.h)),
    })),
  );
  const attr = ANNOTATION_OVERLAY_ATTR;
  const rootId = ANNOTATION_OVERLAY_ROOT_ID;
  const captureY = Number.isFinite(params.captureY) ? round(params.captureY ?? 0) : 0;
  return `(() => {
  var items = ${itemsJson};
  var captureY = ${captureY};
  var existing = document.querySelectorAll("[${attr}]");
  for (var k = 0; k < existing.length; k++) existing[k].remove();
  var root = document.createElement("div");
  root.id = ${JSON.stringify(rootId)};
  root.setAttribute("${attr}", "1");
  root.style.cssText = "position:absolute;top:0;left:0;width:0;height:0;pointer-events:none;z-index:2147483647;font-family:'SF Mono','SFMono-Regular',Menlo,Monaco,Consolas,'Liberation Mono','Courier New',monospace;";
  for (var i = 0; i < items.length; i++) {
    var it = items[i];
    var box = document.createElement("div");
    box.setAttribute("${attr}", "1");
    box.style.cssText = "position:absolute;left:" + it.x + "px;top:" + it.y + "px;width:" + it.w + "px;height:" + it.h + "px;border:2px solid #ffb020;box-sizing:border-box;pointer-events:none;";
    var tag = document.createElement("div");
    tag.setAttribute("${attr}", "1");
    tag.textContent = String(it.ref);
    var relativeY = it.y - captureY;
    var labelTop = relativeY < 14 ? (it.y + 2) : (it.y - 14);
    tag.style.cssText = "position:absolute;left:" + it.x + "px;top:" + labelTop + "px;background:#ffb020;color:#1a1a1a;font:bold 11px/14px monospace;padding:0 4px;border-radius:2px;white-space:nowrap;pointer-events:none;";
    root.appendChild(box);
    root.appendChild(tag);
  }
  document.documentElement.appendChild(root);
  return true;
 })();`;
 }
 export function buildOverlayClearScript(): string {
  const attr = ANNOTATION_OVERLAY_ATTR;
  return `(() => {
  var existing = document.querySelectorAll("[${attr}]");
  for (var k = 0; k < existing.length; k++) existing[k].remove();
  return true;
 })();`;
 }
 /**
 * Scale annotation boxes by independent x/y factors. Used to keep annotation
 * coordinates aligned with the saved image after the response pipeline
 * resizes the screenshot (e.g. via normalizeBrowserScreenshot capping the
 * longest side or the byte budget). Returns a new array; inputs are not
 * mutated. When both factors are 1 the boxes are returned unchanged (modulo
 * structural copy) so callers can share the same code path for resized and
 * non-resized captures.
 */
 export function scaleAnnotations(
  items: AnnotationItem[],
  scaleX: number,
  scaleY: number,
 ): AnnotationItem[] {
  if (!Number.isFinite(scaleX) || !Number.isFinite(scaleY) || scaleX <= 0 || scaleY <= 0) {
    return items.map((it) => ({ ...it, box: { ...it.box } }));
  }
  if (scaleX === 1 && scaleY === 1) {
    return items.map((it) => ({ ...it, box: { ...it.box } }));
  }
  return items.map((it) => ({
    ...it,
    box: {
      x: round(it.box.x * scaleX),
      y: round(it.box.y * scaleY),
      width: Math.max(1, round(it.box.width * scaleX)),
      height: Math.max(1, round(it.box.height * scaleY)),
    },
  }));
 }
 function round(v: number): number {
  return Math.round(v);
 }
--- a/extensions/browser/src/cli/browser-cli-actions-input/register.navigation.test.ts
+++ b/extensions/browser/src/cli/browser-cli-actions-input/register.navigation.test.ts
@@ -61,18 +61,4 @@ describe("browser navigation commands", () => {
    expect(capture.runtimeErrors.join("\n")).toContain("Invalid width: maximum is 8192");
    expect(mocks.runBrowserResizeWithOutput).not.toHaveBeenCalled();
  });
  it("navigate and resize commands are registered after removing dead import (#83878)", async () => {
    const program = createNavigationProgram();
    const browserCmd = program.commands.find((c) => c.name() === "browser");
    expect(browserCmd).toBeDefined();
    const cmds = browserCmd!.commands.map((c) => c.name());
    expect(cmds).toContain("resize");
    expect(cmds).toContain("navigate");
    // Verify the shared module still exports requireRef (used by other modules)
    const shared = await import("./shared.js");
    expect(typeof shared.requireRef).toBe("function");
  });
 });
--- a/extensions/browser/src/cli/browser-cli-actions-input/register.navigation.ts
+++ b/extensions/browser/src/cli/browser-cli-actions-input/register.navigation.ts
@@ -12,7 +12,7 @@ import {
  type BrowserParentOpts,
 } from "../browser-cli-shared.js";
 import { danger, defaultRuntime } from "../core-api.js";
-import { resolveBrowserActionContext } from "./shared.js";
+import { requireRef, resolveBrowserActionContext } from "./shared.js";
 /** Registers Browser navigate and resize commands. */
 export function registerBrowserNavigationCommands(
@@ -94,4 +94,7 @@ export function registerBrowserNavigationCommands(
        defaultRuntime.exit(1);
      }
    });
  // Keep `requireRef` reachable; shared utilities are intended for other modules too.
  void requireRef;
 }
--- a/extensions/browser/src/cli/browser-cli-inspect.ts
+++ b/extensions/browser/src/cli/browser-cli-inspect.ts
@@ -51,11 +51,7 @@ export function registerBrowserInspectCommands(
    .option("--full-page", "Capture full scrollable page", false)
    .option("--ref <ref>", "ARIA ref from ai snapshot")
    .option("--element <selector>", "CSS selector for element screenshot")
-    .option(
+    .option("--labels", "Overlay role refs on the screenshot", false)
      "--labels",
      "Overlay role refs on the screenshot (works with --full-page, --ref, and --element)",
      false,
    )
    .option("--type <png|jpeg>", "Output type (default: png)", "png")
    .action(async (targetId: string | undefined, opts, cmd) => {
      const parent = parentOpts(cmd);
@@ -102,7 +98,7 @@ export function registerBrowserInspectCommands(
    .option("--depth <n>", "Role snapshot: max depth")
    .option("--selector <sel>", "Role snapshot: scope to CSS selector")
    .option("--frame <sel>", "Role snapshot: scope to an iframe selector")
-    .option("--labels", "Include label overlay screenshot with annotations", false)
+    .option("--labels", "Include viewport label overlay screenshot", false)
    .option("--urls", "Append discovered link URLs to AI snapshots", false)
    .option("--out <path>", "Write snapshot to a file")
    .action(async (opts, cmd) => {
--- a/extensions/canvas/src/cli.test.ts
+++ b/extensions/canvas/src/cli.test.ts
@@ -1,11 +1,7 @@
 // Canvas tests cover cli plugin behavior.
 import { Command } from "commander";
 import { describe, expect, it, vi } from "vitest";
-import {
+import { registerNodesCanvasCommands, type CanvasCliDependencies } from "./cli.js";
  createDefaultCanvasCliDependencies,
  registerNodesCanvasCommands,
  type CanvasCliDependencies,
 } from "./cli.js";
 function createCanvasCliDeps() {
  const writtenFiles: Array<{ filePath: string; base64: string }> = [];
@@ -51,26 +47,6 @@ function createCanvasCliDeps() {
  return { deps, runtime, writtenFiles };
 }
 function createCanvasCliDepsWithDefaultParsers() {
  const baseDeps = createDefaultCanvasCliDependencies();
  const harness = createCanvasCliDeps();
  return {
    ...harness,
    deps: {
      ...baseDeps,
      defaultRuntime: harness.runtime,
      nodesCallOpts: harness.deps.nodesCallOpts,
      runNodesCommand: harness.deps.runNodesCommand,
      getNodesTheme: harness.deps.getNodesTheme,
      resolveNodeId: harness.deps.resolveNodeId,
      buildNodeInvokeParams: harness.deps.buildNodeInvokeParams,
      callGatewayCli: harness.deps.callGatewayCli,
      writeBase64ToFile: harness.deps.writeBase64ToFile,
      shortenHomePath: harness.deps.shortenHomePath,
    },
  };
 }
 describe("canvas CLI", () => {
  it("registers under nodes and captures a snapshot media path", async () => {
    const program = new Command();
@@ -159,8 +135,6 @@ describe("canvas CLI", () => {
  it.each([
    ["--max-width", "640px", "--max-width must be a positive integer."],
    ["--quality", "0.8x", "--quality must be a number."],
    ["--quality", "-0.1", "--quality must be between 0 and 1."],
    ["--quality", "5", "--quality must be between 0 and 1."],
  ])("rejects partial numeric snapshot %s values", async (flag, value, message) => {
    const program = new Command();
    program.exitOverride();
@@ -177,62 +151,6 @@ describe("canvas CLI", () => {
    expect(deps.callGatewayCli).not.toHaveBeenCalled();
  });
  it.each(["0", "1"])("accepts snapshot --quality boundary value %s", async (quality) => {
    const program = new Command();
    program.exitOverride();
    const nodes = program.command("nodes");
    const { deps } = createCanvasCliDeps();
    registerNodesCanvasCommands(nodes, deps);
    await program.parseAsync(
      ["nodes", "canvas", "snapshot", "--node", "ios-node", "--quality", quality],
      {
        from: "user",
      },
    );
    expect(deps.callGatewayCli).toHaveBeenCalledWith(
      "node.invoke",
      expect.any(Object),
      expect.objectContaining({
        params: expect.objectContaining({
          quality: Number(quality),
        }),
      }),
    );
  });
  it.each([
    ["snapshot"],
    ["present"],
    ["hide"],
    ["navigate", "https://example.com"],
    ["eval", "1 + 1"],
    ["a2ui", "push", "--text", "hello"],
    ["a2ui", "reset"],
  ])("rejects invalid %s invoke timeouts before invoking the node", async (...args) => {
    const program = new Command();
    program.exitOverride();
    const nodes = program.command("nodes");
    const { deps } = createCanvasCliDepsWithDefaultParsers();
    deps.resolveNodeId = vi.fn(async () => {
      throw new Error("resolveNodeId should not be called");
    });
    registerNodesCanvasCommands(nodes, deps);
    await expect(
      program.parseAsync(
        ["nodes", "canvas", ...args, "--node", "ios-node", "--invoke-timeout", "20ms"],
        {
          from: "user",
        },
      ),
    ).rejects.toThrow("--invoke-timeout must be a positive integer.");
    expect(deps.resolveNodeId).not.toHaveBeenCalled();
    expect(deps.callGatewayCli).not.toHaveBeenCalled();
  });
  it.each([
    ["--x", "1x"],
    ["--y", "2px"],
--- a/extensions/canvas/src/cli.ts
+++ b/extensions/canvas/src/cli.ts
@@ -97,11 +97,7 @@ function parseTimeoutMs(raw: unknown): number | undefined {
  if (raw === undefined || raw === null) {
    return undefined;
  }
-  const parsed = parseStrictPositiveInteger(raw);
+  return parseStrictPositiveInteger(raw);
  if (parsed === undefined) {
    throw new Error("--invoke-timeout must be a positive integer.");
  }
  return parsed;
 }
 function parseCanvasPositiveIntOption(raw: string | undefined, flag: string): number | undefined {
@@ -126,14 +122,6 @@ function parseCanvasFiniteNumberOption(raw: string | undefined, flag: string): n
  return parsed;
 }
 function parseCanvasSnapshotQualityOption(raw: string | undefined): number | undefined {
  const parsed = parseCanvasFiniteNumberOption(raw, "--quality");
  if (parsed !== undefined && (parsed < 0 || parsed > 1)) {
    throw new Error("--quality must be between 0 and 1.");
  }
  return parsed;
 }
 function parseNodeCandidates(raw: unknown): CanvasNodeCandidate[] {
  const payload =
    raw && typeof raw === "object" ? (raw as { nodes?: unknown; paired?: unknown }) : {};
@@ -257,8 +245,8 @@ async function invokeCanvas(
  command: string,
  params?: Record<string, unknown>,
 ) {
  const timeoutMs = deps.parseTimeoutMs(opts.invokeTimeout);
  const nodeId = await deps.resolveNodeId(opts, normalizeOptionalString(opts.node) ?? "");
  const timeoutMs = deps.parseTimeoutMs(opts.invokeTimeout);
  return await deps.callGatewayCli(
    "node.invoke",
    opts,
@@ -290,7 +278,7 @@ export function registerNodesCanvasCommands(nodes: Command, deps: CanvasCliDepen
        await deps.runNodesCommand("canvas snapshot", async () => {
          const format = parseCanvasSnapshotRequestFormat(opts.format);
          const maxWidth = parseCanvasPositiveIntOption(opts.maxWidth, "--max-width");
-          const quality = parseCanvasSnapshotQualityOption(opts.quality);
+          const quality = parseCanvasFiniteNumberOption(opts.quality, "--quality");
          const raw = await invokeCanvas(deps, opts, "canvas.snapshot", {
            format,
            maxWidth: Number.isFinite(maxWidth) ? maxWidth : undefined,
--- a/extensions/codex/doctor-contract-api.test.ts
+++ b/extensions/codex/doctor-contract-api.test.ts
@@ -1,44 +1,6 @@
 // Codex tests cover doctor contract api plugin behavior.
-import fs from "node:fs/promises";
+import { describe, expect, it } from "vitest";
-import os from "node:os";
+import { legacyConfigRules, normalizeCompatibilityConfig } from "./doctor-contract-api.js";
 import path from "node:path";
 import type { PluginStateKeyedStore } from "openclaw/plugin-sdk/plugin-state-runtime";
 import {
  createPluginStateKeyedStoreForTests,
  resetPluginStateStoreForTests,
 } from "openclaw/plugin-sdk/plugin-state-test-runtime";
 import type {
  OpenKeyedStoreOptions,
  PluginDoctorStateMigrationContext,
 } from "openclaw/plugin-sdk/runtime-doctor";
 import { afterEach, describe, expect, it } from "vitest";
 import {
  legacyConfigRules,
  normalizeCompatibilityConfig,
  stateMigrations,
 } from "./doctor-contract-api.js";
 import {
  bindingStoreKey,
  CODEX_APP_SERVER_BINDING_MAX_ENTRIES,
  CODEX_APP_SERVER_BINDING_NAMESPACE,
  type StoredCodexAppServerBinding,
 } from "./src/app-server/session-binding.js";
 import { legacyCodexConversationBindingId } from "./src/conversation-binding-data.js";
 function createDoctorContext(env: NodeJS.ProcessEnv): PluginDoctorStateMigrationContext {
  return {
    openPluginStateKeyedStore<T>(options: OpenKeyedStoreOptions) {
      return createPluginStateKeyedStoreForTests<T>("codex", {
        ...options,
        env: options.env ?? env,
      });
    },
  };
 }
 afterEach(() => {
  resetPluginStateStoreForTests();
 });
 describe("codex doctor contract", () => {
  it("reports the retired dynamic tools profile config key", () => {
@@ -80,856 +42,4 @@ describe("codex doctor contract", () => {
    });
    expect(original.plugins.entries.codex.config).toHaveProperty("codexDynamicToolsProfile");
  });
  it("imports shipped binding sidecars under session and legacy conversation identities", async () => {
    const stateDir = await fs.mkdtemp(path.join(os.tmpdir(), "openclaw-codex-doctor-"));
    const env = { ...process.env, OPENCLAW_STATE_DIR: stateDir };
    const sessionsDir = path.join(stateDir, "agents", "main", "sessions");
    const transcriptPath = path.join(sessionsDir, "session-current.jsonl");
    const sidecarPath = `${transcriptPath}.codex-app-server.json`;
    const legacyBinding = {
      schemaVersion: 1,
      threadId: "thread-1",
      sessionFile: transcriptPath,
      updatedAt: "2026-01-01T00:00:00.000Z",
    };
    await fs.mkdir(sessionsDir, { recursive: true });
    await fs.writeFile(transcriptPath, '{"type":"session","id":"session-current"}\n', "utf8");
    await fs.writeFile(
      path.join(sessionsDir, "sessions.json"),
      JSON.stringify({
        "agent:main:session-1": {
          sessionId: "session-current",
          sessionFile: "session-current.jsonl",
          totalTokens: 42_000,
          totalTokensFresh: true,
          contextTokens: 258_400,
          updatedAt: Date.now(),
        },
      }),
      "utf8",
    );
    await fs.writeFile(sidecarPath, JSON.stringify(legacyBinding), "utf8");
    const params = {
      config: {},
      env,
      stateDir,
      oauthDir: path.join(stateDir, "oauth"),
      context: createDoctorContext(env),
    };
    const migration = stateMigrations[0];
    if (!migration) {
      throw new Error("missing Codex binding migration");
    }
    await expect(migration.detectLegacyState(params)).resolves.toMatchObject({
      preview: [expect.stringContaining("legacy sidecar")],
    });
    await expect(migration.migrateLegacyState(params)).resolves.toMatchObject({
      changes: [expect.stringContaining("Migrated 1")],
      warnings: [],
    });
    const store = createDoctorContext(env).openPluginStateKeyedStore<StoredCodexAppServerBinding>({
      namespace: CODEX_APP_SERVER_BINDING_NAMESPACE,
      maxEntries: CODEX_APP_SERVER_BINDING_MAX_ENTRIES,
      overflowPolicy: "reject-new",
    });
    await expect(
      store.lookup(
        bindingStoreKey({
          kind: "session",
          agentId: "main",
          sessionId: "session-current",
          sessionKey: "agent:main:session-1",
        }),
      ),
    ).resolves.toMatchObject({
      state: "active",
      sessionId: "session-current",
      binding: { threadId: "thread-1" },
    });
    await expect(
      store.lookup(
        bindingStoreKey({
          kind: "conversation",
          bindingId: legacyCodexConversationBindingId(transcriptPath),
        }),
      ),
    ).resolves.toMatchObject({
      state: "active",
      binding: {
        threadId: "thread-1",
        cwd: "",
        historyCoveredThrough: expect.any(String),
      },
    });
    await expect(
      store.lookup(
        bindingStoreKey({
          kind: "conversation",
          bindingId: legacyCodexConversationBindingId(transcriptPath),
        }),
      ),
    ).resolves.not.toHaveProperty("binding.nativeContextUsage");
    await expect(fs.access(`${sidecarPath}.migrated`)).resolves.toBeUndefined();
    await expect(
      fs.readFile(path.join(sessionsDir, "sessions.json"), "utf8").then(JSON.parse),
    ).resolves.toMatchObject({
      "agent:main:session-1": { sessionId: "session-current", agentHarnessId: "codex" },
    });
    await fs.rm(`${sidecarPath}.migrated`);
    await fs.writeFile(sidecarPath, JSON.stringify(legacyBinding), "utf8");
    await expect(migration.migrateLegacyState(params)).resolves.toMatchObject({
      changes: [expect.stringContaining("Migrated 1")],
      warnings: [],
    });
    await expect(fs.access(`${sidecarPath}.migrated`)).resolves.toBeUndefined();
    const resetTranscript = path.join(sessionsDir, "session-before-reset.jsonl");
    const resetSidecar = `${resetTranscript}.codex-app-server.json`;
    await fs.writeFile(resetTranscript, '{"type":"session","id":"session-before-reset"}\n', "utf8");
    await fs.writeFile(
      resetSidecar,
      JSON.stringify({ schemaVersion: 1, threadId: "thread-before-reset" }),
      "utf8",
    );
    await expect(migration.migrateLegacyState(params)).resolves.toMatchObject({
      changes: [expect.stringContaining("Migrated 1 safe")],
      warnings: [expect.stringContaining("session owner could not be resolved")],
    });
    await expect(fs.access(resetSidecar)).resolves.toBeUndefined();
    await fs.rm(resetSidecar);
    const conflictingTranscript = path.join(sessionsDir, "session-2.jsonl");
    const conflictingSidecar = `${conflictingTranscript}.codex-app-server.json`;
    await fs.writeFile(conflictingTranscript, '{"type":"session","id":"session-2"}\n', "utf8");
    await fs.writeFile(
      conflictingSidecar,
      JSON.stringify({ schemaVersion: 1, threadId: "legacy-thread" }),
      "utf8",
    );
    await fs.writeFile(
      path.join(sessionsDir, "sessions.json"),
      JSON.stringify({
        "agent:main:session-1": {
          sessionId: "session-1",
          sessionFile: "session-1.jsonl",
          updatedAt: Date.now(),
        },
        "agent:main:session-2": {
          sessionId: "session-2",
          sessionFile: "session-2.jsonl",
          updatedAt: Date.now(),
        },
      }),
      "utf8",
    );
    const conflictingSessionKey = bindingStoreKey({
      kind: "session",
      agentId: "main",
      sessionId: "session-2",
      sessionKey: "agent:main:session-2",
    });
    await store.register(conflictingSessionKey, {
      version: 1,
      state: "active",
      binding: {
        threadId: "legacy-thread",
        cwd: "/repo",
        historyCoveredThrough: "2026-01-01T00:00:00.000Z",
      },
    });
    await expect(migration.migrateLegacyState(params)).resolves.toMatchObject({
      changes: [],
      warnings: [
        expect.stringContaining(`canonical plugin state changed at ${conflictingSessionKey}`),
      ],
    });
    await expect(
      store.lookup(
        bindingStoreKey({
          kind: "conversation",
          bindingId: legacyCodexConversationBindingId(conflictingTranscript),
        }),
      ),
    ).resolves.toBeUndefined();
    await expect(fs.access(conflictingSidecar)).resolves.toBeUndefined();
    await fs.rm(conflictingSidecar);
    const inverseTranscript = path.join(sessionsDir, "session-3.jsonl");
    const inverseSidecar = `${inverseTranscript}.codex-app-server.json`;
    const inverseConversationKey = bindingStoreKey({
      kind: "conversation",
      bindingId: legacyCodexConversationBindingId(inverseTranscript),
    });
    await fs.writeFile(inverseTranscript, '{"type":"session","id":"session-3"}\n', "utf8");
    await fs.writeFile(
      path.join(sessionsDir, "sessions.json"),
      JSON.stringify({
        "agent:main:session-3": {
          sessionId: "session-3",
          sessionFile: "session-3.jsonl",
          updatedAt: Date.now(),
        },
      }),
      "utf8",
    );
    await fs.writeFile(
      inverseSidecar,
      JSON.stringify({ schemaVersion: 1, threadId: "session-thread" }),
      "utf8",
    );
    await store.register(inverseConversationKey, {
      version: 1,
      state: "active",
      binding: { threadId: "conversation-thread", cwd: "/repo" },
    });
    await expect(migration.migrateLegacyState(params)).resolves.toMatchObject({
      changes: [expect.stringContaining("Migrated 1")],
      warnings: [],
    });
    await expect(
      store.lookup(
        bindingStoreKey({
          kind: "session",
          agentId: "main",
          sessionId: "session-3",
          sessionKey: "agent:main:session-3",
        }),
      ),
    ).resolves.toMatchObject({
      state: "active",
      sessionId: "session-3",
      binding: { threadId: "conversation-thread" },
    });
    await expect(store.lookup(inverseConversationKey)).resolves.toMatchObject({
      state: "active",
      binding: { threadId: "conversation-thread" },
    });
    await expect(fs.access(`${inverseSidecar}.migrated`)).resolves.toBeUndefined();
    await fs.rm(stateDir, { recursive: true, force: true });
  });
  it("does not publish Codex session ownership before every binding row persists", async () => {
    const stateDir = await fs.mkdtemp(path.join(os.tmpdir(), "openclaw-codex-doctor-order-"));
    const env = { ...process.env, OPENCLAW_STATE_DIR: stateDir };
    const sessionsDir = path.join(stateDir, "agents", "main", "sessions");
    const transcriptPath = path.join(sessionsDir, "session-order.jsonl");
    const sidecarPath = `${transcriptPath}.codex-app-server.json`;
    const storePath = path.join(sessionsDir, "sessions.json");
    await fs.mkdir(sessionsDir, { recursive: true });
    await fs.writeFile(transcriptPath, '{"type":"session","id":"session-order"}\n', "utf8");
    await fs.writeFile(
      storePath,
      JSON.stringify({
        "agent:main:order": {
          sessionId: "session-order",
          sessionFile: "session-order.jsonl",
          updatedAt: Date.now(),
        },
      }),
      "utf8",
    );
    await fs.writeFile(
      sidecarPath,
      JSON.stringify({ schemaVersion: 1, threadId: "thread-order" }),
      "utf8",
    );
    const store = createPluginStateKeyedStoreForTests<StoredCodexAppServerBinding>("codex", {
      namespace: CODEX_APP_SERVER_BINDING_NAMESPACE,
      maxEntries: CODEX_APP_SERVER_BINDING_MAX_ENTRIES,
      overflowPolicy: "reject-new",
      env,
    });
    const registerIfAbsent = store.registerIfAbsent.bind(store);
    let registerCalls = 0;
    const failingStore: PluginStateKeyedStore<StoredCodexAppServerBinding> = {
      ...store,
      async registerIfAbsent(key, value, opts) {
        registerCalls++;
        if (registerCalls === 2) {
          throw new Error("injected session binding write failure");
        }
        return await registerIfAbsent(key, value, opts);
      },
    };
    const failingContext: PluginDoctorStateMigrationContext = {
      openPluginStateKeyedStore<T>() {
        return failingStore as unknown as PluginStateKeyedStore<T>;
      },
    };
    const migration = stateMigrations[0];
    if (!migration) {
      throw new Error("missing Codex binding migration");
    }
    await expect(
      migration.migrateLegacyState({
        config: {},
        env,
        stateDir,
        oauthDir: path.join(stateDir, "oauth"),
        context: failingContext,
      }),
    ).resolves.toMatchObject({
      changes: [expect.stringContaining("Migrated 1 safe")],
      warnings: [expect.stringContaining("injected session binding write failure")],
    });
    await expect(fs.readFile(storePath, "utf8").then(JSON.parse)).resolves.toMatchObject({
      "agent:main:order": { sessionId: "session-order" },
    });
    expect(
      (JSON.parse(await fs.readFile(storePath, "utf8")) as Record<string, Record<string, unknown>>)[
        "agent:main:order"
      ],
    ).not.toHaveProperty("agentHarnessId");
    await expect(
      store.lookup(
        bindingStoreKey({
          kind: "session",
          agentId: "main",
          sessionId: "session-order",
          sessionKey: "agent:main:order",
        }),
      ),
    ).resolves.toBeUndefined();
    await expect(fs.access(sidecarPath)).resolves.toBeUndefined();
    await expect(
      migration.migrateLegacyState({
        config: {},
        env,
        stateDir,
        oauthDir: path.join(stateDir, "oauth"),
        context: createDoctorContext(env),
      }),
    ).resolves.toMatchObject({
      changes: [expect.stringContaining("Migrated 1")],
      warnings: [],
    });
    await expect(fs.readFile(storePath, "utf8").then(JSON.parse)).resolves.toMatchObject({
      "agent:main:order": {
        sessionId: "session-order",
        agentHarnessId: "codex",
      },
    });
    await expect(fs.access(`${sidecarPath}.migrated`)).resolves.toBeUndefined();
    await fs.rm(stateDir, { recursive: true, force: true });
  });
  it("retains a shipped binding when its session now belongs to another harness", async () => {
    const stateDir = await fs.mkdtemp(path.join(os.tmpdir(), "openclaw-codex-doctor-owner-"));
    const env = { ...process.env, OPENCLAW_STATE_DIR: stateDir };
    const sessionsDir = path.join(stateDir, "agents", "main", "sessions");
    const transcriptPath = path.join(sessionsDir, "session-foreign.jsonl");
    const sidecarPath = `${transcriptPath}.codex-app-server.json`;
    await fs.mkdir(sessionsDir, { recursive: true });
    await fs.writeFile(transcriptPath, '{"type":"session","id":"session-foreign"}\n', "utf8");
    await fs.writeFile(
      path.join(sessionsDir, "sessions.json"),
      JSON.stringify({
        "agent:main:foreign": {
          sessionId: "session-foreign",
          sessionFile: "session-foreign.jsonl",
          agentHarnessId: "openclaw",
          updatedAt: Date.now(),
        },
      }),
      "utf8",
    );
    await fs.writeFile(
      sidecarPath,
      JSON.stringify({
        schemaVersion: 1,
        threadId: "thread-foreign",
        sessionFile: transcriptPath,
      }),
      "utf8",
    );
    const migration = stateMigrations[0];
    if (!migration) {
      throw new Error("missing Codex binding migration");
    }
    await expect(
      migration.migrateLegacyState({
        config: {},
        env,
        stateDir,
        oauthDir: path.join(stateDir, "oauth"),
        context: createDoctorContext(env),
      }),
    ).resolves.toMatchObject({
      changes: [],
      warnings: [expect.stringContaining("owned by agent harness openclaw")],
    });
    await expect(fs.access(sidecarPath)).resolves.toBeUndefined();
    const store = createDoctorContext(env).openPluginStateKeyedStore<StoredCodexAppServerBinding>({
      namespace: CODEX_APP_SERVER_BINDING_NAMESPACE,
      maxEntries: CODEX_APP_SERVER_BINDING_MAX_ENTRIES,
      overflowPolicy: "reject-new",
    });
    await expect(
      store.lookup(
        bindingStoreKey({
          kind: "session",
          agentId: "main",
          sessionId: "session-foreign",
          sessionKey: "agent:main:foreign",
        }),
      ),
    ).resolves.toBeUndefined();
    await fs.rm(stateDir, { recursive: true, force: true });
  });
  it("imports sidecars from the pre-agent session directory before core moves it", async () => {
    const stateDir = await fs.mkdtemp(path.join(os.tmpdir(), "openclaw-codex-doctor-legacy-"));
    const env = { ...process.env, OPENCLAW_STATE_DIR: stateDir };
    const sessionsDir = path.join(stateDir, "sessions");
    const transcriptPath = path.join(sessionsDir, "legacy-session.jsonl");
    const sidecarPath = `${transcriptPath}.codex-app-server.json`;
    await fs.mkdir(sessionsDir, { recursive: true });
    await fs.writeFile(transcriptPath, '{"type":"session","id":"legacy-session"}\n', "utf8");
    await fs.writeFile(
      path.join(sessionsDir, "sessions.json"),
      JSON.stringify({
        "agent:main:legacy": {
          sessionId: "legacy-session",
          sessionFile: "legacy-session.jsonl",
          updatedAt: Date.now(),
        },
      }),
      "utf8",
    );
    await fs.writeFile(
      sidecarPath,
      JSON.stringify({
        schemaVersion: 1,
        threadId: "legacy-thread",
        sessionFile: transcriptPath,
      }),
      "utf8",
    );
    const params = {
      config: {},
      env,
      stateDir,
      oauthDir: path.join(stateDir, "oauth"),
      context: createDoctorContext(env),
    };
    const migration = stateMigrations[0];
    if (!migration) {
      throw new Error("missing Codex binding migration");
    }
    await expect(migration.migrateLegacyState(params)).resolves.toMatchObject({ warnings: [] });
    const store = createDoctorContext(env).openPluginStateKeyedStore<StoredCodexAppServerBinding>({
      namespace: CODEX_APP_SERVER_BINDING_NAMESPACE,
      maxEntries: CODEX_APP_SERVER_BINDING_MAX_ENTRIES,
      overflowPolicy: "reject-new",
    });
    await expect(
      store.lookup(
        bindingStoreKey({
          kind: "session",
          agentId: "main",
          sessionId: "legacy-session",
          sessionKey: "agent:main:legacy",
        }),
      ),
    ).resolves.toMatchObject({
      state: "active",
      sessionId: "legacy-session",
      binding: { threadId: "legacy-thread" },
    });
    await expect(fs.access(`${sidecarPath}.migrated`)).resolves.toBeUndefined();
    await expect(
      fs.readFile(path.join(sessionsDir, "sessions.json"), "utf8").then(JSON.parse),
    ).resolves.toMatchObject({
      "agent:main:legacy": { sessionId: "legacy-session", agentHarnessId: "codex" },
    });
  });
  it("uses the session index when a shipped sidecar transcript is missing", async () => {
    const stateDir = await fs.mkdtemp(path.join(os.tmpdir(), "openclaw-codex-doctor-"));
    const env = { ...process.env, OPENCLAW_STATE_DIR: stateDir };
    const sessionsDir = path.join(stateDir, "agents", "main", "sessions");
    const transcriptPath = path.join(sessionsDir, "missing.jsonl");
    const sidecarPath = `${transcriptPath}.codex-app-server.json`;
    await fs.mkdir(sessionsDir, { recursive: true });
    await fs.writeFile(
      path.join(sessionsDir, "sessions.json"),
      JSON.stringify({
        "agent:main:missing": {
          sessionId: "session-missing",
          sessionFile: "missing.jsonl",
          updatedAt: Date.now(),
        },
      }),
      "utf8",
    );
    await fs.writeFile(
      sidecarPath,
      JSON.stringify({
        schemaVersion: 1,
        threadId: "thread-legacy-conversation",
        sessionFile: transcriptPath,
      }),
      "utf8",
    );
    const migration = stateMigrations[0];
    if (!migration) {
      throw new Error("missing Codex binding migration");
    }
    await expect(
      migration.migrateLegacyState({
        config: {},
        env,
        stateDir,
        oauthDir: path.join(stateDir, "oauth"),
        context: createDoctorContext(env),
      }),
    ).resolves.toMatchObject({
      changes: [expect.stringContaining("Migrated 1")],
      warnings: [],
    });
    const store = createDoctorContext(env).openPluginStateKeyedStore<StoredCodexAppServerBinding>({
      namespace: CODEX_APP_SERVER_BINDING_NAMESPACE,
      maxEntries: CODEX_APP_SERVER_BINDING_MAX_ENTRIES,
      overflowPolicy: "reject-new",
    });
    await expect(
      store.lookup(
        bindingStoreKey({
          kind: "conversation",
          bindingId: legacyCodexConversationBindingId(transcriptPath),
        }),
      ),
    ).resolves.toMatchObject({
      state: "active",
      binding: { threadId: "thread-legacy-conversation" },
    });
    await expect(
      store.lookup(
        bindingStoreKey({
          kind: "session",
          agentId: "main",
          sessionId: "session-missing",
          sessionKey: "agent:main:missing",
        }),
      ),
    ).resolves.toMatchObject({
      state: "active",
      sessionId: "session-missing",
      binding: { threadId: "thread-legacy-conversation" },
    });
    await expect(fs.access(`${sidecarPath}.migrated`)).resolves.toBeUndefined();
    await fs.rm(stateDir, { recursive: true, force: true });
  });
  it("imports a binding without crawling Codex rollout files", async () => {
    const stateDir = await fs.mkdtemp(path.join(os.tmpdir(), "openclaw-codex-doctor-"));
    const env = { ...process.env, OPENCLAW_STATE_DIR: stateDir };
    const sessionsDir = path.join(stateDir, "agents", "main", "sessions");
    const transcriptPath = path.join(sessionsDir, "session-fresh.jsonl");
    const sidecarPath = `${transcriptPath}.codex-app-server.json`;
    await fs.mkdir(sessionsDir, { recursive: true });
    await fs.writeFile(transcriptPath, '{"type":"session","id":"session-fresh"}\n', "utf8");
    await fs.writeFile(
      path.join(sessionsDir, "sessions.json"),
      JSON.stringify({
        "agent:main:fresh": {
          sessionId: "session-fresh",
          sessionFile: "session-fresh.jsonl",
          updatedAt: Date.now(),
        },
      }),
      "utf8",
    );
    await fs.writeFile(
      sidecarPath,
      JSON.stringify({ schemaVersion: 1, threadId: "thread-without-rollout" }),
      "utf8",
    );
    const migration = stateMigrations[0];
    if (!migration) {
      throw new Error("missing Codex binding migration");
    }
    await expect(
      migration.migrateLegacyState({
        config: {},
        env,
        stateDir,
        oauthDir: path.join(stateDir, "oauth"),
        context: createDoctorContext(env),
      }),
    ).resolves.toEqual({
      changes: [expect.stringContaining("Migrated 1")],
      warnings: [],
    });
    const store = createDoctorContext(env).openPluginStateKeyedStore<StoredCodexAppServerBinding>({
      namespace: CODEX_APP_SERVER_BINDING_NAMESPACE,
      maxEntries: CODEX_APP_SERVER_BINDING_MAX_ENTRIES,
      overflowPolicy: "reject-new",
    });
    const targetKey = bindingStoreKey({
      kind: "conversation",
      bindingId: legacyCodexConversationBindingId(transcriptPath),
    });
    await expect(
      store.lookup(
        bindingStoreKey({
          kind: "session",
          agentId: "main",
          sessionId: "session-fresh",
          sessionKey: "agent:main:fresh",
        }),
      ),
    ).resolves.toMatchObject({
      state: "active",
      sessionId: "session-fresh",
      binding: { threadId: "thread-without-rollout" },
    });
    await expect(store.lookup(targetKey)).resolves.toMatchObject({
      state: "active",
      binding: { threadId: "thread-without-rollout" },
    });
    await expect(fs.access(`${sidecarPath}.migrated`)).resolves.toBeUndefined();
    await fs.rm(stateDir, { recursive: true, force: true });
  });
  it("retains an ambiguous sidecar and converges after its owner resolves", async () => {
    const stateDir = await fs.mkdtemp(path.join(os.tmpdir(), "openclaw-codex-doctor-"));
    const env = { ...process.env, HOME: stateDir, OPENCLAW_STATE_DIR: stateDir };
    const config = {
      agents: { list: [{ id: "alpha" }, { id: "beta" }] },
      session: { store: "~/shared/sessions.json" },
    };
    const sessionsDir = path.join(stateDir, "shared");
    const transcriptPath = path.join(sessionsDir, "ambiguous.jsonl");
    const sidecarPath = `${transcriptPath}.codex-app-server.json`;
    await fs.mkdir(sessionsDir, { recursive: true });
    await fs.writeFile(transcriptPath, '{"type":"message"}\n', "utf8");
    await fs.writeFile(
      sidecarPath,
      JSON.stringify({
        schemaVersion: 1,
        threadId: "thread-ambiguous",
        sessionFile: transcriptPath,
      }),
      "utf8",
    );
    const migration = stateMigrations[0];
    if (!migration) {
      throw new Error("missing Codex binding migration");
    }
    await expect(
      migration.migrateLegacyState({
        config,
        env,
        stateDir,
        oauthDir: path.join(stateDir, "oauth"),
        context: createDoctorContext(env),
      }),
    ).resolves.toMatchObject({
      changes: [expect.stringContaining("Migrated 1 safe")],
      warnings: [expect.stringContaining("session owner could not be resolved")],
    });
    const store = createDoctorContext(env).openPluginStateKeyedStore<StoredCodexAppServerBinding>({
      namespace: CODEX_APP_SERVER_BINDING_NAMESPACE,
      maxEntries: CODEX_APP_SERVER_BINDING_MAX_ENTRIES,
      overflowPolicy: "reject-new",
    });
    await expect(
      store.lookup(
        bindingStoreKey({
          kind: "conversation",
          bindingId: legacyCodexConversationBindingId(transcriptPath),
        }),
      ),
    ).resolves.toMatchObject({ state: "active", binding: { threadId: "thread-ambiguous" } });
    await expect(fs.access(sidecarPath)).resolves.toBeUndefined();
    const conversationKey = bindingStoreKey({
      kind: "conversation",
      bindingId: legacyCodexConversationBindingId(transcriptPath),
    });
    const imported = await store.lookup(conversationKey);
    if (imported?.state !== "active") {
      throw new Error("missing imported Codex conversation binding");
    }
    await store.register(conversationKey, {
      ...imported,
      binding: { ...imported.binding, threadId: "thread-recovered" },
    });
    await expect(
      migration.migrateLegacyState({
        config,
        env,
        stateDir,
        oauthDir: path.join(stateDir, "oauth"),
        context: createDoctorContext(env),
      }),
    ).resolves.toEqual({
      changes: [],
      warnings: [expect.stringContaining("session owner could not be resolved")],
    });
    await expect(store.lookup(conversationKey)).resolves.toMatchObject({
      state: "active",
      binding: { threadId: "thread-recovered" },
    });
    await fs.writeFile(
      path.join(sessionsDir, "sessions.json"),
      JSON.stringify({
        "agent:alpha:ambiguous": {
          sessionId: "session-ambiguous",
          sessionFile: "ambiguous.jsonl",
          totalTokens: 12_345,
          totalTokensFresh: true,
          contextTokens: 128_000,
          updatedAt: Date.now(),
        },
      }),
      "utf8",
    );
    await expect(
      migration.migrateLegacyState({
        config,
        env,
        stateDir,
        oauthDir: path.join(stateDir, "oauth"),
        context: createDoctorContext(env),
      }),
    ).resolves.toMatchObject({
      changes: [expect.stringContaining("Migrated 1")],
      warnings: [],
    });
    await expect(
      store.lookup(
        bindingStoreKey({
          kind: "session",
          agentId: "alpha",
          sessionId: "session-ambiguous",
          sessionKey: "agent:alpha:ambiguous",
        }),
      ),
    ).resolves.toMatchObject({
      state: "active",
      sessionId: "session-ambiguous",
      binding: { threadId: "thread-recovered" },
    });
    await expect(store.lookup(conversationKey)).resolves.toMatchObject({
      state: "active",
      binding: {
        threadId: "thread-recovered",
      },
    });
    await expect(store.lookup(conversationKey)).resolves.not.toHaveProperty(
      "binding.nativeContextUsage",
    );
    await expect(fs.access(`${sidecarPath}.migrated`)).resolves.toBeUndefined();
    await fs.rm(stateDir, { recursive: true, force: true });
  });
  it("uses canonical custom-store, agent, and nested transcript path resolution", async () => {
    const stateDir = await fs.mkdtemp(path.join(os.tmpdir(), "openclaw-codex-doctor-"));
    const customStoreRoot = await fs.mkdtemp(
      path.join(os.tmpdir(), "openclaw-codex-custom-store-"),
    );
    const env = { ...process.env, HOME: stateDir, OPENCLAW_STATE_DIR: stateDir };
    const config = {
      agents: { list: [{ id: "alpha" }] },
      session: { store: path.join(customStoreRoot, "{agentId}", "sessions.json") },
    };
    const sessionsDir = path.join(customStoreRoot, "alpha");
    const transcriptPath = path.join(sessionsDir, "nested", "session-custom.jsonl");
    const sidecarPath = `${transcriptPath}.codex-app-server.json`;
    await fs.mkdir(path.dirname(transcriptPath), { recursive: true });
    await fs.writeFile(transcriptPath, '{"type":"session","id":"session-custom"}\n', "utf8");
    await fs.writeFile(
      path.join(sessionsDir, "sessions.json"),
      JSON.stringify({
        "agent:alpha:custom": {
          sessionId: "session-custom",
          sessionFile: "nested/session-custom.jsonl",
          updatedAt: Date.now(),
        },
      }),
      "utf8",
    );
    await fs.writeFile(
      sidecarPath,
      JSON.stringify({ schemaVersion: 1, threadId: "thread-custom" }),
      "utf8",
    );
    const unrelatedSidecar = path.join(
      customStoreRoot,
      "unrelated",
      `not-a-session.jsonl.codex-app-server.json`,
    );
    await fs.mkdir(path.dirname(unrelatedSidecar), { recursive: true });
    await fs.writeFile(
      unrelatedSidecar,
      JSON.stringify({ schemaVersion: 1, threadId: "unrelated-thread" }),
      "utf8",
    );
    const migration = stateMigrations[0];
    if (!migration) {
      throw new Error("missing Codex binding migration");
    }
    await migration.migrateLegacyState({
      config,
      env,
      stateDir,
      oauthDir: path.join(stateDir, "oauth"),
      context: createDoctorContext(env),
    });
    const store = createDoctorContext(env).openPluginStateKeyedStore<StoredCodexAppServerBinding>({
      namespace: CODEX_APP_SERVER_BINDING_NAMESPACE,
      maxEntries: CODEX_APP_SERVER_BINDING_MAX_ENTRIES,
      overflowPolicy: "reject-new",
    });
    await expect(
      store.lookup(
        bindingStoreKey({
          kind: "session",
          agentId: "alpha",
          sessionId: "session-custom",
          sessionKey: "agent:alpha:custom",
        }),
      ),
    ).resolves.toMatchObject({
      state: "active",
      sessionId: "session-custom",
      binding: { threadId: "thread-custom" },
    });
    await expect(
      store.lookup(
        bindingStoreKey({
          kind: "conversation",
          bindingId: legacyCodexConversationBindingId(transcriptPath),
        }),
      ),
    ).resolves.toMatchObject({
      state: "active",
      binding: { threadId: "thread-custom" },
    });
    await expect(fs.access(unrelatedSidecar)).resolves.toBeUndefined();
    await fs.rm(stateDir, { recursive: true, force: true });
    await fs.rm(customStoreRoot, { recursive: true, force: true });
  });
 });
--- a/extensions/codex/doctor-contract-api.ts
+++ b/extensions/codex/doctor-contract-api.ts
@@ -1,4 +1,7 @@
-/** Doctor contract hooks for Codex config, state migration, and route ownership. */
+/**
 * Doctor contract hooks for Codex plugin config migrations and session-route
 * ownership warnings.
 */
 import type { OpenClawConfig } from "openclaw/plugin-sdk/config-contracts";
 import type { DoctorSessionRouteStateOwner } from "openclaw/plugin-sdk/runtime-doctor";
@@ -28,7 +31,9 @@ export const legacyConfigRules: LegacyConfigRule[] = [
  },
 ];
-/** Removes retired Codex plugin config keys while preserving unrelated config. */
+/**
 * Removes retired Codex plugin config keys while preserving unrelated config.
 */
 export function normalizeCompatibilityConfig({ cfg }: { cfg: OpenClawConfig }): {
  config: OpenClawConfig;
  changes: string[];
@@ -42,9 +47,10 @@ export function normalizeCompatibilityConfig({ cfg }: { cfg: OpenClawConfig }):
  const nextConfig = structuredClone(cfg) as OpenClawConfig & {
    plugins?: Record<string, unknown>;
  };
-  const nextPluginConfig = asRecord(
+  const nextPlugins = asRecord(nextConfig.plugins);
-    asRecord(asRecord(asRecord(nextConfig.plugins)?.entries)?.codex)?.config,
+  const nextEntries = asRecord(nextPlugins?.entries);
-  );
+  const nextEntry = asRecord(nextEntries?.codex);
  const nextPluginConfig = asRecord(nextEntry?.config);
  if (!nextPluginConfig) {
    return { config: cfg, changes: [] };
  }
@@ -69,5 +75,3 @@ export const sessionRouteStateOwners: DoctorSessionRouteStateOwner[] = [
    authProfilePrefixes: ["codex:", "codex-cli:", "openai-codex:"],
  },
 ];
 export { stateMigrations } from "./src/migration/session-binding-sidecars.js";
--- a/extensions/codex/harness.test.ts
+++ b/extensions/codex/harness.test.ts
@@ -1,18 +1,9 @@
 // Codex tests cover harness plugin behavior.
 import fs from "node:fs/promises";
 import os from "node:os";
 import path from "node:path";
 import { describe, expect, it } from "vitest";
 import { createCodexAppServerAgentHarness } from "./harness.js";
 import {
  createCodexTestBindingStore,
  testCodexAppServerBindingStore,
 } from "./src/app-server/session-binding.test-helpers.js";
 describe("Codex agent harness supports()", () => {
-  const harness = createCodexAppServerAgentHarness({
+  const harness = createCodexAppServerAgentHarness();
    bindingStore: testCodexAppServerBindingStore,
  });
  it("supports the canonical codex virtual provider", () => {
    expect(harness.supports({ provider: "codex", requestedRuntime: "codex" })).toEqual({
@@ -49,149 +40,8 @@ describe("Codex agent harness supports()", () => {
  });
  it("honors explicit provider id overrides", () => {
-    const narrowHarness = createCodexAppServerAgentHarness({
+    const narrowHarness = createCodexAppServerAgentHarness({ providerIds: ["codex"] });
      bindingStore: testCodexAppServerBindingStore,
      providerIds: ["codex"],
    });
    const result = narrowHarness.supports({ provider: "openai", requestedRuntime: "codex" });
    expect(result.supported).toBe(false);
  });
 });
 describe("Codex agent harness reset", () => {
  it("uses the host agent for global session keys", async () => {
    const bindingStore = createCodexTestBindingStore();
    const harness = createCodexAppServerAgentHarness({ bindingStore });
    const identity = {
      kind: "session" as const,
      agentId: "work",
      sessionId: "session-1",
      sessionKey: "global",
    };
    await bindingStore.mutate(identity, {
      kind: "set",
      binding: { threadId: "thread-work", cwd: "/repo" },
    });
    await harness.reset?.({
      agentId: "work",
      sessionId: "session-1",
      sessionKey: "global",
      reason: "reset",
    });
    await expect(bindingStore.read(identity)).resolves.toBeUndefined();
    await expect(
      bindingStore.mutate(identity, {
        kind: "set",
        binding: { threadId: "thread-stale", cwd: "/stale" },
      }),
    ).resolves.toBe(false);
    const nextIdentity = { ...identity, sessionId: "session-2" };
    await expect(
      bindingStore.mutate(nextIdentity, {
        kind: "set",
        binding: { threadId: "thread-next", cwd: "/next" },
      }),
    ).resolves.toBe(false);
    await expect(
      bindingStore.mutate(nextIdentity, {
        kind: "reclaim-generation",
        expectedPreviousSessionId: identity.sessionId,
      }),
    ).resolves.toBe(true);
    await expect(
      bindingStore.mutate(nextIdentity, {
        kind: "set",
        binding: { threadId: "thread-next", cwd: "/next" },
      }),
    ).resolves.toBe(true);
    await expect(bindingStore.read(nextIdentity)).resolves.toMatchObject({
      threadId: "thread-next",
    });
  });
  it("accepts an absent binding but rejects a mismatched reset generation", async () => {
    const bindingStore = createCodexTestBindingStore();
    const harness = createCodexAppServerAgentHarness({ bindingStore });
    const current = {
      kind: "session" as const,
      agentId: "main",
      sessionId: "session-1",
      sessionKey: "agent:main:main",
    };
    await expect(
      harness.reset?.({
        agentId: "main",
        sessionId: "missing-session",
        sessionKey: "agent:main:missing",
        reason: "reset",
      }),
    ).resolves.toBeUndefined();
    await bindingStore.mutate(current, {
      kind: "set",
      binding: { threadId: "thread-1", cwd: "/repo" },
    });
    await expect(
      harness.reset?.({
        agentId: "main",
        sessionId: "session-2",
        sessionKey: current.sessionKey,
        reason: "reset",
      }),
    ).rejects.toThrow("binding generation changed");
    await expect(bindingStore.read(current)).resolves.toMatchObject({ threadId: "thread-1" });
  });
  it("reclaims a stale generation left while the Codex plugin was unavailable", async () => {
    const stateDir = await fs.mkdtemp(path.join(os.tmpdir(), "openclaw-codex-reset-"));
    const storePath = path.join(stateDir, "sessions.json");
    const sessionKey = "agent:main:main";
    await fs.writeFile(
      storePath,
      JSON.stringify({
        [sessionKey]: {
          sessionId: "session-2",
          updatedAt: Date.now(),
        },
      }),
      "utf8",
    );
    const bindingStore = createCodexTestBindingStore();
    const harness = createCodexAppServerAgentHarness({
      bindingStore,
      resolveConfig: () => ({ session: { store: storePath } }),
    });
    const stale = {
      kind: "session" as const,
      agentId: "main",
      sessionId: "session-1",
      sessionKey,
    };
    await bindingStore.mutate(stale, {
      kind: "set",
      binding: { threadId: "thread-stale", cwd: "/repo" },
    });
    await expect(
      harness.reset?.({
        agentId: "main",
        sessionId: "session-2",
        sessionKey,
        reason: "reset",
      }),
    ).resolves.toBeUndefined();
    const current = { ...stale, sessionId: "session-2" };
    await expect(bindingStore.read(current)).resolves.toBeUndefined();
    await expect(
      bindingStore.mutate(current, {
        kind: "set",
        binding: { threadId: "thread-delayed", cwd: "/repo" },
      }),
    ).resolves.toBe(false);
    await fs.rm(stateDir, { recursive: true, force: true });
  });
 });
--- a/extensions/codex/harness.ts
+++ b/extensions/codex/harness.ts
@@ -7,13 +7,11 @@ import type {
  AgentHarnessCompactResult,
  ContextEngineHostCapability,
 } from "openclaw/plugin-sdk/agent-harness-runtime";
 import type { OpenClawConfig } from "openclaw/plugin-sdk/config-contracts";
 import type {
  CodexAppServerListModelsOptions,
  CodexAppServerModel,
  CodexAppServerModelListResult,
 } from "./src/app-server/models.js";
 import type { CodexAppServerBindingStore } from "./src/app-server/session-binding.js";
 const DEFAULT_CODEX_HARNESS_PROVIDER_IDS = new Set(["codex", "openai"]);
 const CODEX_APP_SERVER_CONTEXT_ENGINE_HOST_CAPABILITIES = [
@@ -39,14 +37,12 @@ type CodexAppServerAgentHarness = AgentHarness & {
 * Creates the Codex app-server harness used for attempts, side questions,
 * compaction, reset, and disposal.
 */
-export function createCodexAppServerAgentHarness(options: {
+export function createCodexAppServerAgentHarness(options?: {
  id?: string;
  label?: string;
  providerIds?: Iterable<string>;
  pluginConfig?: unknown;
  resolvePluginConfig?: () => unknown;
  resolveConfig?: () => OpenClawConfig | undefined;
  bindingStore: CodexAppServerBindingStore;
 }): AgentHarness {
  const providerIds = new Set(
    [...(options?.providerIds ?? DEFAULT_CODEX_HARNESS_PROVIDER_IDS)].map((id) =>
@@ -75,7 +71,6 @@ export function createCodexAppServerAgentHarness(options: {
      // cold provider catalog reads do not pull in the whole Codex runtime.
      const { runCodexAppServerAttempt } = await import("./src/app-server/run-attempt.js");
      return runCodexAppServerAttempt(params, {
        bindingStore: options.bindingStore,
        pluginConfig: options?.resolvePluginConfig?.() ?? options?.pluginConfig,
        nativeHookRelay: { enabled: true },
      });
@@ -83,7 +78,6 @@ export function createCodexAppServerAgentHarness(options: {
    runSideQuestion: async (params) => {
      const { runCodexAppServerSideQuestion } = await import("./src/app-server/side-question.js");
      return runCodexAppServerSideQuestion(params, {
        bindingStore: options.bindingStore,
        pluginConfig: options?.resolvePluginConfig?.() ?? options?.pluginConfig,
        nativeHookRelay: { enabled: true },
      });
@@ -91,43 +85,20 @@ export function createCodexAppServerAgentHarness(options: {
    compact: async (params) => {
      const { maybeCompactCodexAppServerSession } = await import("./src/app-server/compact.js");
      return maybeCompactCodexAppServerSession(params, {
        bindingStore: options.bindingStore,
        pluginConfig: options?.resolvePluginConfig?.() ?? options?.pluginConfig,
      });
    },
    compactAfterContextEngine: async (params) => {
      const { maybeCompactCodexAppServerSession } = await import("./src/app-server/compact.js");
      return maybeCompactCodexAppServerSession(params, {
        bindingStore: options.bindingStore,
        pluginConfig: options?.resolvePluginConfig?.() ?? options?.pluginConfig,
        allowNonManualNativeRequest: true,
      });
    },
    reset: async (params) => {
-      if (params.sessionId) {
+      if (params.sessionFile) {
-        const { reclaimCurrentCodexSessionGeneration, sessionBindingIdentity } =
+        const { clearCodexAppServerBinding } = await import("./src/app-server/session-binding.js");
-          await import("./src/app-server/session-binding.js");
+        await clearCodexAppServerBinding(params.sessionFile);
        const identity = sessionBindingIdentity({
          agentId: params.agentId,
          sessionId: params.sessionId,
          sessionKey: params.sessionKey,
        });
        let retired = await options.bindingStore.retireSessionGeneration(identity);
        if (retired === "conflict") {
          const reclaimed = await reclaimCurrentCodexSessionGeneration({
            bindingStore: options.bindingStore,
            identity,
            config: options.resolveConfig?.(),
          });
          if (reclaimed) {
            retired = await options.bindingStore.retireSessionGeneration(identity);
          }
        }
        if (retired === "conflict") {
          throw new Error(
            `Codex binding generation changed before session ${params.sessionId} could reset`,
          );
        }
      }
    },
    dispose: async () => {
--- a/extensions/codex/index.test.ts
+++ b/extensions/codex/index.test.ts
@@ -4,30 +4,10 @@ import { createTestPluginApi } from "openclaw/plugin-sdk/plugin-test-api";
 import { describe, expect, it, vi } from "vitest";
 import { createCodexAppServerAgentHarness } from "./harness.js";
 import plugin from "./index.js";
 import {
  createCodexAppServerBindingStore,
  sessionBindingIdentity,
 } from "./src/app-server/session-binding.js";
 import {
  createCodexTestBindingStateStore,
  testCodexAppServerBindingStore,
 } from "./src/app-server/session-binding.test-helpers.js";
 const runCodexAppServerAttemptMock = vi.hoisted(() => vi.fn());
 const runCodexAppServerSideQuestionMock = vi.hoisted(() => vi.fn());
 function createCodexTestRuntime(
  current?: () => unknown,
  stateStore = createCodexTestBindingStateStore(),
 ) {
  return {
    ...(current ? { config: { current } } : {}),
    state: {
      openSyncKeyedStore: () => stateStore,
    },
  } as never;
 }
 vi.mock("./src/app-server/run-attempt.js", () => ({
  runCodexAppServerAttempt: runCodexAppServerAttemptMock,
 }));
@@ -59,6 +39,7 @@ describe("codex plugin", () => {
    const registerMigrationProvider = vi.fn();
    const registerProvider = vi.fn();
    const on = vi.fn();
    const onConversationBindingResolved = vi.fn();
    plugin.register(
      createTestPluginApi({
@@ -67,13 +48,14 @@ describe("codex plugin", () => {
        source: "test",
        config: {},
        pluginConfig: {},
-        runtime: createCodexTestRuntime(),
+        runtime: {} as never,
        registerAgentHarness,
        registerCommand,
        registerMediaUnderstandingProvider,
        registerMigrationProvider,
        registerProvider,
        on,
        onConversationBindingResolved,
      }),
    );
@@ -83,6 +65,9 @@ describe("codex plugin", () => {
      | Record<string, unknown>
      | undefined;
    const inboundClaimRegistration = mockCall(on) as [unknown, unknown] | undefined;
    const bindingResolvedRegistration = mockCall(onConversationBindingResolved) as
      | [unknown]
      | undefined;
    expect(providerRegistration.id).toBe("codex");
    expect(providerRegistration.label).toBe("Codex");
@@ -109,12 +94,33 @@ describe("codex plugin", () => {
    expect(migrationRegistration?.label).toBe("Codex");
    expect(inboundClaimRegistration?.[0]).toBe("inbound_claim");
    expect(typeof inboundClaimRegistration?.[1]).toBe("function");
    expect(typeof bindingResolvedRegistration?.[0]).toBe("function");
  });
  it("registers with capture APIs that do not expose conversation binding hooks yet", () => {
    const registerProvider = vi.fn();
    const api = createTestPluginApi({
      id: "codex",
      name: "Codex",
      source: "test",
      config: {},
      pluginConfig: {},
      runtime: {} as never,
      registerAgentHarness: vi.fn(),
      registerCommand: vi.fn(),
      registerMediaUnderstandingProvider: vi.fn(),
      registerProvider,
      on: vi.fn(),
    });
    delete (api as { onConversationBindingResolved?: unknown }).onConversationBindingResolved;
    plugin.register(api);
    expect(registerProvider).toHaveBeenCalledTimes(1);
    expect((mockCallArg(registerProvider) as { id?: string } | undefined)?.id).toBe("codex");
  });
  it("claims the Codex routing providers by default", () => {
-    const harness = createCodexAppServerAgentHarness({
+    const harness = createCodexAppServerAgentHarness();
      bindingStore: testCodexAppServerBindingStore,
    });
    expect(harness.deliveryDefaults?.sourceVisibleReplies).toBe("message_tool");
    expect(
@@ -135,196 +141,8 @@ describe("codex plugin", () => {
    expect(unsupported.supported).toBe(false);
  });
  it("clears only ended session binding rows in the owning agent scope", async () => {
    const stateStore = createCodexTestBindingStateStore();
    const bindingStore = createCodexAppServerBindingStore(stateStore);
    const on = vi.fn();
    plugin.register(
      createTestPluginApi({
        id: "codex",
        name: "Codex",
        source: "test",
        config: {},
        pluginConfig: {},
        runtime: createCodexTestRuntime(undefined, stateStore),
        registerAgentHarness: vi.fn(),
        registerCommand: vi.fn(),
        registerMediaUnderstandingProvider: vi.fn(),
        registerMigrationProvider: vi.fn(),
        registerProvider: vi.fn(),
        on,
      }),
    );
    const sessionEnd = on.mock.calls.find(([name]) => name === "session_end")?.[1] as
      | ((
          event: { sessionId: string; sessionKey?: string; reason?: string },
          ctx: { agentId?: string; sessionId: string; sessionKey?: string },
        ) => Promise<void>)
      | undefined;
    if (!sessionEnd) {
      throw new Error("missing Codex session_end hook");
    }
    const identity = sessionBindingIdentity({
      agentId: "worker",
      sessionId: "session-1",
      sessionKey: "agent:worker:session-1",
    });
    const setBinding = () =>
      bindingStore.mutate(identity, {
        kind: "set",
        binding: { threadId: "thread-1", cwd: "/repo" },
      });
    for (const reason of ["shutdown", "restart", "compaction", "unknown"] as const) {
      await setBinding();
      await sessionEnd(
        { sessionId: "session-1", sessionKey: "agent:worker:session-1", reason },
        { agentId: "worker", sessionId: "session-1" },
      );
      await expect(bindingStore.read(identity)).resolves.toMatchObject({
        threadId: "thread-1",
      });
    }
    for (const reason of ["new", "reset", "idle", "daily", "deleted"] as const) {
      await setBinding();
      await sessionEnd(
        { sessionId: "session-1", sessionKey: "agent:worker:session-1", reason },
        { agentId: "worker", sessionId: "session-1" },
      );
      await expect(bindingStore.read(identity)).resolves.toBeUndefined();
    }
  });
  it("adopts compaction successors before delayed lifecycle cleanup", async () => {
    const stateStore = createCodexTestBindingStateStore();
    const bindingStore = createCodexAppServerBindingStore(stateStore);
    const on = vi.fn();
    plugin.register(
      createTestPluginApi({
        id: "codex",
        name: "Codex",
        source: "test",
        config: {},
        pluginConfig: {},
        runtime: createCodexTestRuntime(undefined, stateStore),
        registerAgentHarness: vi.fn(),
        registerCommand: vi.fn(),
        registerMediaUnderstandingProvider: vi.fn(),
        registerMigrationProvider: vi.fn(),
        registerProvider: vi.fn(),
        on,
      }),
    );
    const afterCompaction = on.mock.calls.find(([name]) => name === "after_compaction")?.[1] as
      | ((
          event: {
            messageCount: number;
            compactedCount: number;
            previousSessionId?: string;
          },
          ctx: { agentId?: string; sessionId?: string; sessionKey?: string },
        ) => Promise<void>)
      | undefined;
    const sessionEnd = on.mock.calls.find(([name]) => name === "session_end")?.[1] as
      | ((
          event: { sessionId: string; sessionKey?: string; reason?: string },
          ctx: { agentId?: string; sessionId: string; sessionKey?: string },
        ) => Promise<void>)
      | undefined;
    if (!afterCompaction || !sessionEnd) {
      throw new Error("missing Codex compaction lifecycle hooks");
    }
    const sessionKey = "agent:worker:telegram:chat-1";
    const previous = sessionBindingIdentity({
      agentId: "worker",
      sessionId: "session-1",
      sessionKey,
    });
    const successor = sessionBindingIdentity({
      agentId: "worker",
      sessionId: "session-2",
      sessionKey,
    });
    const newest = sessionBindingIdentity({
      agentId: "worker",
      sessionId: "session-3",
      sessionKey,
    });
    await bindingStore.mutate(previous, {
      kind: "set",
      binding: { threadId: "thread-1", cwd: "/repo" },
    });
    await afterCompaction(
      { messageCount: 1, compactedCount: 1, previousSessionId: "session-1" },
      { agentId: "worker", sessionId: "session-2", sessionKey },
    );
    await expect(bindingStore.read(previous)).resolves.toBeUndefined();
    await expect(bindingStore.read(successor)).resolves.toMatchObject({ threadId: "thread-1" });
    await afterCompaction(
      { messageCount: 1, compactedCount: 1, previousSessionId: "session-2" },
      { agentId: "worker", sessionId: "session-3", sessionKey },
    );
    await afterCompaction(
      { messageCount: 1, compactedCount: 1, previousSessionId: "session-1" },
      { agentId: "worker", sessionId: "session-2", sessionKey },
    );
    await expect(bindingStore.read(successor)).resolves.toBeUndefined();
    await expect(bindingStore.read(newest)).resolves.toMatchObject({ threadId: "thread-1" });
    await sessionEnd(
      { sessionId: "session-1", sessionKey, reason: "reset" },
      { agentId: "worker", sessionId: "session-1", sessionKey },
    );
    await sessionEnd(
      { sessionId: "session-2", sessionKey, reason: "compaction" },
      { agentId: "worker", sessionId: "session-2", sessionKey },
    );
    await expect(bindingStore.read(newest)).resolves.toMatchObject({ threadId: "thread-1" });
    expect(stateStore.entries()).toHaveLength(1);
  });
  it("ignores compaction for a session without a Codex binding", async () => {
    const warn = vi.fn();
    const on = vi.fn();
    plugin.register(
      createTestPluginApi({
        id: "codex",
        name: "Codex",
        source: "test",
        config: {},
        pluginConfig: {},
        logger: { debug: vi.fn(), info: vi.fn(), warn, error: vi.fn() },
        runtime: createCodexTestRuntime(),
        registerAgentHarness: vi.fn(),
        registerCommand: vi.fn(),
        registerMediaUnderstandingProvider: vi.fn(),
        registerMigrationProvider: vi.fn(),
        registerProvider: vi.fn(),
        on,
      }),
    );
    const afterCompaction = on.mock.calls.find(([name]) => name === "after_compaction")?.[1] as
      | ((event: object, ctx: { sessionId?: string; sessionKey?: string }) => Promise<void>)
      | undefined;
    if (!afterCompaction) {
      throw new Error("missing Codex after_compaction hook");
    }
    await afterCompaction(
      { previousSessionId: "session-1" },
      { sessionId: "session-2", sessionKey: "agent:main:main" },
    );
    expect(warn).not.toHaveBeenCalled();
  });
  it("enables the native hook relay for public Codex app-server attempts", async () => {
-    const harness = createCodexAppServerAgentHarness({
+    const harness = createCodexAppServerAgentHarness({ pluginConfig: { appServer: {} } });
      bindingStore: testCodexAppServerBindingStore,
      pluginConfig: { appServer: {} },
    });
    const result = { success: true };
    runCodexAppServerAttemptMock.mockResolvedValueOnce(result);
@@ -333,7 +151,6 @@ describe("codex plugin", () => {
    expect(runCodexAppServerAttemptMock).toHaveBeenCalledWith(
      { prompt: "hello" },
      {
        bindingStore: testCodexAppServerBindingStore,
        pluginConfig: { appServer: {} },
        nativeHookRelay: { enabled: true },
      },
@@ -368,7 +185,11 @@ describe("codex plugin", () => {
        source: "test",
        config: {},
        pluginConfig: { codexPlugins: { enabled: false } },
-        runtime: createCodexTestRuntime(() => liveConfig),
+        runtime: {
          config: {
            current: () => liveConfig,
          },
        } as never,
        registerAgentHarness,
        registerCommand: vi.fn(),
        registerMediaUnderstandingProvider: vi.fn(),
@@ -388,49 +209,14 @@ describe("codex plugin", () => {
    expect(runCodexAppServerAttemptMock).toHaveBeenCalledWith(
      { prompt: "calendar" },
      {
        bindingStore: expect.any(Object),
        pluginConfig: liveConfig.plugins.entries.codex.config,
        nativeHookRelay: { enabled: true },
      },
    );
  });
  it("does not resurrect startup Codex config after the live entry is removed", async () => {
    const registerAgentHarness = vi.fn();
    plugin.register(
      createTestPluginApi({
        id: "codex",
        name: "Codex",
        source: "test",
        config: {},
        pluginConfig: { appServer: { mode: "yolo" } },
        runtime: createCodexTestRuntime(() => ({ plugins: { entries: {} } })),
        registerAgentHarness,
        registerCommand: vi.fn(),
        registerMediaUnderstandingProvider: vi.fn(),
        registerMigrationProvider: vi.fn(),
        registerProvider: vi.fn(),
        on: vi.fn(),
      }),
    );
    const harness = mockCallArg(registerAgentHarness) as ReturnType<
      typeof createCodexAppServerAgentHarness
    >;
    runCodexAppServerAttemptMock.mockResolvedValueOnce({ success: true });
    await harness.runAttempt({ prompt: "default policy" } as never);
    expect(runCodexAppServerAttemptMock).toHaveBeenCalledWith(
      { prompt: "default policy" },
      expect.objectContaining({ pluginConfig: undefined }),
    );
  });
  it("enables the native hook relay for public Codex side questions", async () => {
-    const harness = createCodexAppServerAgentHarness({
+    const harness = createCodexAppServerAgentHarness({ pluginConfig: { appServer: {} } });
      bindingStore: testCodexAppServerBindingStore,
      pluginConfig: { appServer: {} },
    });
    const runSideQuestion = harness["runSideQuestion"];
    const result = { text: "ok" };
    runCodexAppServerSideQuestionMock.mockResolvedValueOnce(result);
@@ -443,7 +229,6 @@ describe("codex plugin", () => {
    expect(runCodexAppServerSideQuestionMock).toHaveBeenCalledWith(
      { question: "btw" },
      {
        bindingStore: testCodexAppServerBindingStore,
        pluginConfig: { appServer: {} },
        nativeHookRelay: { enabled: true },
      },
--- a/extensions/codex/index.ts
+++ b/extensions/codex/index.ts
@@ -4,71 +4,47 @@
 */
 import type { OpenClawConfig } from "openclaw/plugin-sdk/config-contracts";
 import { mutateConfigFile } from "openclaw/plugin-sdk/config-mutation";
-import {
+import { resolveLivePluginConfigObject } from "openclaw/plugin-sdk/plugin-config-runtime";
  resolveLivePluginConfigObject,
  resolvePluginConfigObject,
 } from "openclaw/plugin-sdk/plugin-config-runtime";
 import { definePluginEntry } from "openclaw/plugin-sdk/plugin-entry";
 import { createCodexAppServerAgentHarness } from "./harness.js";
 import { buildCodexMediaUnderstandingProvider } from "./media-understanding-provider.js";
 import { buildCodexProvider } from "./provider.js";
 import {
  CODEX_APP_SERVER_BINDING_MAX_ENTRIES,
  CODEX_APP_SERVER_BINDING_NAMESPACE,
  createLazyCodexAppServerBindingStore,
  type StoredCodexAppServerBinding,
 } from "./src/app-server/session-binding-store.js";
 import type { CodexPluginsConfigBlock } from "./src/command-plugins-management.js";
 import { createCodexCommand } from "./src/commands.js";
 import {
  handleCodexConversationBindingResolved,
  handleCodexConversationInboundClaim,
 } from "./src/conversation-binding.js";
 import { buildCodexMigrationProvider } from "./src/migration/provider.js";
 import {
  createCodexCliSessionNodeHostCommands,
  createCodexCliSessionNodeInvokePolicies,
-} from "./src/node-cli-session-registration.js";
+  listCodexCliSessionsOnNode,
-
+  resumeCodexCliSessionOnNode,
-const ENDED_SESSION_REASONS: ReadonlySet<string> = new Set([
+  resolveCodexCliSessionForBindingOnNode,
-  "new",
+} from "./src/node-cli-sessions.js";
  "reset",
  "idle",
  "daily",
  "deleted",
 ]);
 export default definePluginEntry({
  id: "codex",
  name: "Codex",
  description: "Codex app-server harness and Codex-managed GPT model catalog.",
  register(api) {
-    const runtimeConfigLoader = api.runtime.config?.current
+    const resolveCurrentConfig = () =>
-      ? () => api.runtime.config?.current() as OpenClawConfig
+      api.runtime.config?.current ? (api.runtime.config.current() as OpenClawConfig) : undefined;
      : undefined;
    const resolveCurrentConfig = () => runtimeConfigLoader?.();
    const loadNodeCliSessions = () => import("./src/node-cli-sessions.js");
    const resolveCurrentPluginConfig = () =>
      // Codex plugin config can change at runtime; resolve from live config for
      // harness attempts and binding claims instead of keeping startup values.
      resolveLivePluginConfigObject(
-        runtimeConfigLoader,
+        resolveCurrentConfig,
        "codex",
        api.pluginConfig as Record<string, unknown>,
-      );
+      ) ?? api.pluginConfig;
    const bindingStore = createLazyCodexAppServerBindingStore(
      api.runtime.state.openSyncKeyedStore<StoredCodexAppServerBinding>({
        namespace: CODEX_APP_SERVER_BINDING_NAMESPACE,
        maxEntries: CODEX_APP_SERVER_BINDING_MAX_ENTRIES,
        overflowPolicy: "reject-new",
      }),
    );
    api.registerAgentHarness(
-      createCodexAppServerAgentHarness({
+      createCodexAppServerAgentHarness({ resolvePluginConfig: resolveCurrentPluginConfig }),
        bindingStore,
        resolveConfig: resolveCurrentConfig,
        resolvePluginConfig: resolveCurrentPluginConfig,
      }),
    );
    api.registerProvider(buildCodexProvider({ pluginConfig: api.pluginConfig }));
    api.registerMediaUnderstandingProvider(
-      buildCodexMediaUnderstandingProvider({ resolvePluginConfig: resolveCurrentPluginConfig }),
+      buildCodexMediaUnderstandingProvider({ pluginConfig: api.pluginConfig }),
    );
    api.registerMigrationProvider(buildCodexMigrationProvider({ runtime: api.runtime }));
    for (const command of createCodexCliSessionNodeHostCommands()) {
@@ -79,43 +55,43 @@ export default definePluginEntry({
    }
    api.registerCommand(
      createCodexCommand({
-        resolvePluginConfig: resolveCurrentPluginConfig,
+        pluginConfig: api.pluginConfig,
        deps: {
-          bindingStore,
+          listCodexCliSessionsOnNode: (params) =>
-          listCodexCliSessionsOnNode: async (params) =>
+            listCodexCliSessionsOnNode({ runtime: api.runtime, ...params }),
-            await (
+          resolveCodexCliSessionForBindingOnNode: (params) =>
-              await loadNodeCliSessions()
+            resolveCodexCliSessionForBindingOnNode({ runtime: api.runtime, ...params }),
            ).listCodexCliSessionsOnNode({
              runtime: api.runtime,
              ...params,
            }),
          resolveCodexCliSessionForBindingOnNode: async (params) =>
            await (
              await loadNodeCliSessions()
            ).resolveCodexCliSessionForBindingOnNode({
              runtime: api.runtime,
              ...params,
            }),
          codexPluginsManagementIo: {
            readConfig: () => {
              const current = (api.runtime.config?.current?.() ?? {}) as OpenClawConfig;
-              const codexPlugins = resolvePluginConfigObject(current, "codex")?.codexPlugins;
+              const plugins = (current as Record<string, unknown>).plugins;
-              if (
+              if (!plugins || typeof plugins !== "object") {
                !codexPlugins ||
                typeof codexPlugins !== "object" ||
                Array.isArray(codexPlugins)
              ) {
                return Promise.resolve({});
              }
-              const block = codexPlugins as Record<string, unknown>;
+              const entries = (plugins as Record<string, unknown>).entries;
-              const declared = block.plugins;
+              if (!entries || typeof entries !== "object") {
                return Promise.resolve({});
              }
              const codexEntry = (entries as Record<string, unknown>).codex;
              if (!codexEntry || typeof codexEntry !== "object") {
                return Promise.resolve({});
              }
              const config = (codexEntry as Record<string, unknown>).config;
              if (!config || typeof config !== "object") {
                return Promise.resolve({});
              }
              const codexPlugins = (config as Record<string, unknown>).codexPlugins;
              if (!codexPlugins || typeof codexPlugins !== "object") {
                return Promise.resolve({});
              }
              const declared = (codexPlugins as Record<string, unknown>).plugins;
              if (!declared || typeof declared !== "object") {
                return Promise.resolve({
-                  enabled: block.enabled === true,
+                  enabled: (codexPlugins as Record<string, unknown>).enabled === true,
                });
              }
              return Promise.resolve({
-                enabled: block.enabled === true,
+                enabled: (codexPlugins as Record<string, unknown>).enabled === true,
                plugins: declared as Record<string, never>,
              });
            },
@@ -125,12 +101,17 @@ export default definePluginEntry({
                  // Create the nested plugin config path on demand so codex
                  // plugin commands can enable/update Codex-managed plugins.
                  const root = draft as Record<string, unknown>;
-                  const pluginsBlock = (root.plugins ??= {}) as Record<string, unknown>;
+                  root.plugins = (root.plugins ?? {}) as Record<string, unknown>;
-                  const entries = (pluginsBlock.entries ??= {}) as Record<string, unknown>;
+                  const pluginsBlock = root.plugins as Record<string, unknown>;
-                  const codexEntry = (entries.codex ??= {}) as Record<string, unknown>;
+                  pluginsBlock.entries = (pluginsBlock.entries ?? {}) as Record<string, unknown>;
-                  const config = (codexEntry.config ??= {}) as Record<string, unknown>;
+                  const entries = pluginsBlock.entries as Record<string, unknown>;
-                  const codexPlugins = (config.codexPlugins ??= {}) as Record<string, unknown>;
+                  entries.codex = (entries.codex ?? {}) as Record<string, unknown>;
-                  codexPlugins.plugins ??= {};
+                  const codexEntry = entries.codex as Record<string, unknown>;
                  codexEntry.config = (codexEntry.config ?? {}) as Record<string, unknown>;
                  const config = codexEntry.config as Record<string, unknown>;
                  config.codexPlugins = (config.codexPlugins ?? {}) as Record<string, unknown>;
                  const codexPlugins = config.codexPlugins as Record<string, unknown>;
                  codexPlugins.plugins = (codexPlugins.plugins ?? {}) as Record<string, unknown>;
                  update(codexPlugins as CodexPluginsConfigBlock);
                },
              });
@@ -139,58 +120,14 @@ export default definePluginEntry({
        },
      }),
    );
-    api.on("inbound_claim", async (event, ctx) => {
+    api.on("inbound_claim", (event, ctx) =>
-      const { handleCodexConversationInboundClaim } = await import("./src/conversation-binding.js");
+      handleCodexConversationInboundClaim(event, ctx, {
      return await handleCodexConversationInboundClaim(event, ctx, {
        bindingStore,
        pluginConfig: resolveCurrentPluginConfig(),
        config: resolveCurrentConfig(),
-        resumeCodexCliSessionOnNode: async (params) =>
+        resumeCodexCliSessionOnNode: (params) =>
-          await (
+          resumeCodexCliSessionOnNode({ runtime: api.runtime, ...params }),
-            await loadNodeCliSessions()
+      }),
-          ).resumeCodexCliSessionOnNode({
+    );
-            runtime: api.runtime,
+    api.onConversationBindingResolved?.(handleCodexConversationBindingResolved);
            ...params,
          }),
      });
    });
    api.on("after_compaction", async (event, ctx) => {
      const previousSessionId = event.previousSessionId?.trim();
      const sessionId = ctx.sessionId?.trim();
      if (!previousSessionId || !sessionId || previousSessionId === sessionId) {
        return;
      }
      const config = resolveCurrentConfig();
      const sessionKey = ctx.sessionKey?.trim();
      const { sessionBindingIdentity } = await import("./src/app-server/session-binding.js");
      const identity = sessionBindingIdentity({
        sessionId,
        ...(sessionKey ? { sessionKey } : {}),
        ...(ctx.agentId ? { agentId: ctx.agentId } : {}),
        ...(config ? { config } : {}),
      });
      const adopted = await bindingStore.adoptSessionGeneration(identity, previousSessionId);
      if (adopted === "conflict") {
        api.logger.warn?.(
          `codex: could not adopt compacted session generation ${sessionId} (${adopted}); secondary native compaction will skip`,
        );
      }
    });
    api.on("session_end", async (event, ctx) => {
      if (!event.reason || !ENDED_SESSION_REASONS.has(event.reason)) {
        return;
      }
      const sessionKey = event.sessionKey ?? ctx.sessionKey;
      const config = resolveCurrentConfig();
      const { sessionBindingIdentity } = await import("./src/app-server/session-binding.js");
      await bindingStore.retireSessionGeneration(
        sessionBindingIdentity({
          sessionId: event.sessionId,
          ...(sessionKey ? { sessionKey } : {}),
          ...(ctx.agentId ? { agentId: ctx.agentId } : {}),
          ...(config ? { config } : {}),
        }),
      );
    });
  },
 });
--- a/extensions/codex/media-understanding-provider.test.ts
+++ b/extensions/codex/media-understanding-provider.test.ts
@@ -2,33 +2,8 @@
 import { MAX_TIMER_TIMEOUT_MS } from "openclaw/plugin-sdk/number-runtime";
 import { afterEach, describe, expect, it, vi } from "vitest";
 import { buildCodexMediaUnderstandingProvider } from "./media-understanding-provider.js";
-import { CodexAppServerRpcError, type CodexAppServerClient } from "./src/app-server/client.js";
+import type { CodexAppServerClient } from "./src/app-server/client.js";
 import type { CodexServerNotification, JsonValue } from "./src/app-server/protocol.js";
 import { adaptCodexTestClientFactory } from "./src/app-server/test-support.js";
 const EXPECTED_MEDIA_THREAD_CONFIG = {
  project_doc_max_bytes: 0,
  web_search: "disabled",
  "tools.experimental_request_user_input.enabled": false,
  "features.hooks": false,
  "features.multi_agent": false,
  "features.apps": false,
  "features.plugins": false,
  "features.image_generation": false,
  "features.skill_mcp_dependency_install": false,
  "features.memories": false,
  "features.goals": false,
  "features.code_mode": false,
  "features.code_mode_only": false,
 };
 const sharedClientMocks = vi.hoisted(() => ({
  createIsolatedCodexAppServerClient: vi.fn(),
 }));
 vi.mock("./src/app-server/shared-client.js", () => ({
  createIsolatedCodexAppServerClient: sharedClientMocks.createIsolatedCodexAppServerClient,
 }));
 function codexModel(inputModalities: string[] = ["text", "image"]) {
  return {
@@ -102,15 +77,13 @@ function createFakeClient(options?: {
  inputModalities?: string[];
  completeWithItems?: boolean;
  notifyError?: string;
  approvalRequestMethod?: string;
  responseText?: string;
  turnStartError?: Error;
  preBindNotificationCount?: number;
  interruptError?: Error;
  unsubscribeError?: Error;
 }) {
  const notifications = new Set<(notification: CodexServerNotification) => void>();
-  const closeHandlers = new Set<() => void>();
+  const requestHandlers = new Set<(request: { method: string }) => JsonValue | undefined>();
  const requests: Array<{ method: string; params?: JsonValue }> = [];
  const approvalResponses: JsonValue[] = [];
  const request = vi.fn(async (method: string, params?: JsonValue) => {
    requests.push({ method, params });
    if (method === "model/list") {
@@ -123,60 +96,51 @@ function createFakeClient(options?: {
      return threadStartResult();
    }
    if (method === "turn/start") {
-      if (options?.turnStartError) {
+      if (options?.approvalRequestMethod) {
-        throw options.turnStartError;
+        for (const handler of requestHandlers) {
-      }
+          const response = handler({ method: options.approvalRequestMethod });
-      if (options?.preBindNotificationCount) {
+          if (response !== undefined) {
-        for (let index = 0; index < options.preBindNotificationCount; index += 1) {
+            approvalResponses.push(response);
          for (const notify of notifications) {
            notify({
              method: "item/started",
              params: { threadId: "thread-1", turnId: "turn-1" },
            });
          }
        }
        return turnStartResult();
      }
-      const emitTurnNotifications = () => {
+      if (options?.notifyError) {
-        if (options?.notifyError) {
+        for (const notify of notifications) {
-          for (const notify of notifications) {
+          notify({
-            notify({
+            method: "error",
-              method: "error",
+            params: {
-              params: {
+              threadId: "thread-1",
-                threadId: "thread-1",
+              turnId: "turn-1",
-                turnId: "turn-1",
+              error: {
-                error: {
+                message: options.notifyError,
-                  message: options.notifyError,
+                codexErrorInfo: null,
-                  codexErrorInfo: null,
+                additionalDetails: null,
                  additionalDetails: null,
                },
                willRetry: false,
              },
-            });
+              willRetry: false,
-          }
+            },
-        } else if (!options?.completeWithItems) {
+          });
          for (const notify of notifications) {
            notify({
              method: "item/agentMessage/delta",
              params: {
                threadId: "thread-1",
                turnId: "turn-1",
                itemId: "msg-1",
                delta: options?.responseText ?? "A red square.",
              },
            });
            notify({
              method: "turn/completed",
              params: {
                threadId: "thread-1",
                turnId: "turn-1",
                turn: turnStartResult("completed").turn,
              },
            });
          }
        }
-      };
+      } else if (!options?.completeWithItems) {
-      emitTurnNotifications();
+        for (const notify of notifications) {
          notify({
            method: "item/agentMessage/delta",
            params: {
              threadId: "thread-1",
              turnId: "turn-1",
              itemId: "msg-1",
              delta: options?.responseText ?? "A red square.",
            },
          });
          notify({
            method: "turn/completed",
            params: {
              threadId: "thread-1",
              turnId: "turn-1",
              turn: turnStartResult("completed").turn,
            },
          });
        }
      }
      return turnStartResult(
        options?.completeWithItems ? "completed" : "inProgress",
        options?.completeWithItems
@@ -192,12 +156,6 @@ function createFakeClient(options?: {
          : [],
      );
    }
    if (method === "turn/interrupt" && options?.interruptError) {
      throw options.interruptError;
    }
    if (method === "thread/unsubscribe" && options?.unsubscribeError) {
      throw options.unsubscribeError;
    }
    return {};
  });
@@ -207,39 +165,26 @@ function createFakeClient(options?: {
      notifications.add(handler);
      return () => notifications.delete(handler);
    },
-    addRequestHandler() {
+    addRequestHandler(handler: (request: { method: string }) => JsonValue | undefined) {
-      return () => undefined;
+      requestHandlers.add(handler);
      return () => requestHandlers.delete(handler);
    },
    addCloseHandler(handler: () => void) {
      closeHandlers.add(handler);
      return () => closeHandlers.delete(handler);
    },
    close: vi.fn(),
  } as unknown as CodexAppServerClient;
-  return { client, requests };
+  return { client, requests, approvalResponses };
 }
 describe("codex media understanding provider", () => {
  afterEach(() => {
    vi.useRealTimers();
    vi.restoreAllMocks();
    sharedClientMocks.createIsolatedCodexAppServerClient.mockReset();
  });
  it("runs image understanding through a bounded Codex app-server turn", async () => {
    const { client, requests } = createFakeClient();
    const clientFactory = vi.fn(async () => client);
    const provider = buildCodexMediaUnderstandingProvider({
-      clientLeaseFactory: adaptCodexTestClientFactory(clientFactory),
+      clientFactory: async () => client,
    });
    const cfg = {
      auth: {
        order: {
          openai: ["openai:work"],
        },
      },
    };
    const result = await provider.describeImage?.({
      buffer: Buffer.from("image-bytes"),
@@ -249,38 +194,34 @@ describe("codex media understanding provider", () => {
      model: "gpt-5.4",
      prompt: "Describe briefly.",
      timeoutMs: 30_000,
-      cfg,
+      cfg: {},
      agentDir: "/tmp/openclaw-agent",
    });
    expect(result).toEqual({ text: "A red square.", model: "gpt-5.4" });
    expect(clientFactory).toHaveBeenCalledWith(
      expect.any(Object),
      undefined,
      "/tmp/openclaw-agent",
      cfg,
      expect.objectContaining({ timeoutMs: 30_000 }),
    );
    expect(requests.map((entry) => entry.method)).toEqual([
      "model/list",
      "thread/start",
      "turn/start",
      "thread/unsubscribe",
    ]);
    expect(requests[0]?.params).toEqual({ limit: 100, cursor: null, includeHidden: true });
    expect(requests[1]?.params).toEqual({
      model: "gpt-5.4",
      modelProvider: "openai",
-      cwd: "/tmp/openclaw-agent/codex-media-home",
+      cwd: "/tmp/openclaw-agent",
-      approvalPolicy: "never",
+      approvalPolicy: "on-request",
      sandbox: "read-only",
      serviceName: "OpenClaw",
      personality: "none",
      developerInstructions:
        "You are OpenClaw's bounded image-understanding worker. Describe only the provided image content. Do not call tools, edit files, or ask follow-up questions.",
-      config: EXPECTED_MEDIA_THREAD_CONFIG,
+      config: {
        "features.code_mode": false,
        "features.code_mode_only": false,
      },
      environments: [],
      dynamicTools: [],
      experimentalRawEvents: true,
      ephemeral: true,
      persistExtendedHistory: false,
    });
    expect(requests[2]?.params).toEqual({
      threadId: "thread-1",
@@ -288,83 +229,19 @@ describe("codex media understanding provider", () => {
        { type: "text", text: "Describe briefly.", text_elements: [] },
        { type: "image", url: "data:image/png;base64,aW1hZ2UtYnl0ZXM=" },
      ],
      cwd: "/tmp/openclaw-agent",
      approvalPolicy: "on-request",
      model: "gpt-5.4",
      effort: "low",
    });
  });
  it("treats a blank agent directory as absent when starting the app-server", async () => {
    const { client, requests } = createFakeClient();
    const clientFactory = vi.fn(async () => client);
    const provider = buildCodexMediaUnderstandingProvider({
      clientLeaseFactory: adaptCodexTestClientFactory(clientFactory),
    });
    const cfg = {
      agents: { list: [{ id: "main", agentDir: "/tmp/openclaw-default-agent" }] },
    };
    await provider.describeImage?.({
      buffer: Buffer.from("image-bytes"),
      fileName: "image.png",
      mime: "image/png",
      provider: "codex",
      model: "gpt-5.4",
      timeoutMs: 30_000,
      cfg,
      agentDir: " ",
    });
    expect(clientFactory).toHaveBeenCalledWith(
      expect.any(Object),
      undefined,
      "/tmp/openclaw-default-agent",
      cfg,
      expect.any(Object),
    );
    expect(requests[1]?.params).toEqual(
      expect.objectContaining({ cwd: "/tmp/openclaw-default-agent/codex-media-home" }),
    );
  });
  it("passes the scoped auth store into isolated app-server startup", async () => {
    const { client } = createFakeClient();
    sharedClientMocks.createIsolatedCodexAppServerClient.mockResolvedValue(client);
    const provider = buildCodexMediaUnderstandingProvider();
    const authStore = {
      version: 1,
      profiles: {
        "openai:scoped": {
          type: "oauth" as const,
          provider: "openai",
          access: "scoped-access",
          refresh: "scoped-refresh",
          expires: Date.now() + 60_000,
        },
      },
    };
    await provider.describeImage?.({
      buffer: Buffer.from("image-bytes"),
      fileName: "image.png",
      mime: "image/png",
      provider: "codex",
      model: "gpt-5.4",
      timeoutMs: 30_000,
      cfg: {},
      authStore,
      agentDir: "/tmp/openclaw-agent",
    });
    expect(sharedClientMocks.createIsolatedCodexAppServerClient).toHaveBeenCalledWith(
      expect.objectContaining({ authProfileStore: authStore }),
    );
  });
  it("clamps oversized image understanding turn timeouts", async () => {
    const setTimeoutSpy = vi.spyOn(globalThis, "setTimeout");
    try {
      const { client } = createFakeClient();
      const provider = buildCodexMediaUnderstandingProvider({
-        clientLeaseFactory: adaptCodexTestClientFactory(async () => client),
+        clientFactory: async () => client,
      });
      const result = await provider.describeImage?.({
@@ -387,97 +264,33 @@ describe("codex media understanding provider", () => {
    }
  });
-  it("starts the media deadline before client acquisition", async () => {
+  it("declines approval requests during image understanding", async () => {
-    vi.useFakeTimers();
+    const { client, approvalResponses } = createFakeClient({
      approvalRequestMethod: "item/permissions/requestApproval",
    });
    const provider = buildCodexMediaUnderstandingProvider({
-      clientLeaseFactory: adaptCodexTestClientFactory(
+      clientFactory: async () => client,
        async () => await new Promise<CodexAppServerClient>(() => {}),
      ),
    });
-    const description = provider.describeImage?.({
+
-      buffer: Buffer.from("image-bytes"),
+    await provider.describeImage?.({
      fileName: "image.png",
      mime: "image/png",
      provider: "codex",
      model: "gpt-5.4",
      timeoutMs: 100,
      cfg: {},
      agentDir: "/tmp/openclaw-agent",
    });
    const rejected = expect(description).rejects.toThrow(
      "Codex app-server image understanding timed out",
    );
    await vi.advanceTimersByTimeAsync(100);
    await rejected;
  });
  it("retires a media client lease that resolves after its deadline", async () => {
    let resolveLease!: (lease: {
      client: CodexAppServerClient;
      release: () => void;
      abandon: () => Promise<void>;
    }) => void;
    const pendingLease = new Promise<{
      client: CodexAppServerClient;
      release: () => void;
      abandon: () => Promise<void>;
    }>((resolve) => {
      resolveLease = resolve;
    });
    const clientLeaseFactory = vi.fn(async () => await pendingLease);
    const provider = buildCodexMediaUnderstandingProvider({ clientLeaseFactory });
    const description = provider.describeImage?.({
      buffer: Buffer.from("image-bytes"),
      fileName: "image.png",
      mime: "image/png",
      provider: "codex",
      model: "gpt-5.4",
      timeoutMs: 5,
      cfg: {},
      agentDir: "/tmp/openclaw-agent",
    });
    await expect(description).rejects.toThrow("Codex app-server image understanding timed out");
    const { client } = createFakeClient();
    const release = vi.fn();
    const abandon = vi.fn(async () => undefined);
    resolveLease({ client, release, abandon });
    await vi.waitFor(() => expect(abandon).toHaveBeenCalledOnce());
    expect(release).not.toHaveBeenCalled();
  });
  it("releases the bounded route between isolated media calls", async () => {
    const { client, requests } = createFakeClient();
    const provider = buildCodexMediaUnderstandingProvider({
      clientLeaseFactory: adaptCodexTestClientFactory(async () => client),
    });
    const request = {
      buffer: Buffer.from("image-bytes"),
      fileName: "image.png",
      mime: "image/png",
      provider: "codex",
      model: "gpt-5.4",
      prompt: "Describe briefly.",
      timeoutMs: 30_000,
      cfg: {},
      agentDir: "/tmp/openclaw-agent",
-    };
+    });
-    const first = await provider.describeImage?.(request);
+    expect(approvalResponses).toEqual([{ permissions: {}, scope: "turn" }]);
    const second = await provider.describeImage?.(request);
    expect(first?.text).toBe("A red square.");
    expect(second?.text).toBe("A red square.");
    expect(requests.filter((entry) => entry.method === "model/list")).toHaveLength(2);
    expect(requests.filter((entry) => entry.method === "thread/start")).toHaveLength(2);
  });
  it("extracts text from terminal turn items", async () => {
    const { client } = createFakeClient({ completeWithItems: true });
    const provider = buildCodexMediaUnderstandingProvider({
-      clientLeaseFactory: adaptCodexTestClientFactory(async () => client),
+      clientFactory: async () => client,
    });
    const result = await provider.describeImages?.({
@@ -496,7 +309,7 @@ describe("codex media understanding provider", () => {
  it("rejects text-only Codex app-server models before starting a turn", async () => {
    const { client, requests } = createFakeClient({ inputModalities: ["text"] });
    const provider = buildCodexMediaUnderstandingProvider({
-      clientLeaseFactory: adaptCodexTestClientFactory(async () => client),
+      clientFactory: async () => client,
    });
    await expect(
@@ -517,7 +330,7 @@ describe("codex media understanding provider", () => {
  it("surfaces Codex app-server turn errors", async () => {
    const { client } = createFakeClient({ notifyError: "vision unavailable" });
    const provider = buildCodexMediaUnderstandingProvider({
-      clientLeaseFactory: adaptCodexTestClientFactory(async () => client),
+      clientFactory: async () => client,
    });
    await expect(
@@ -534,107 +347,12 @@ describe("codex media understanding provider", () => {
    ).rejects.toThrow("vision unavailable");
  });
  it.each([
    {
      name: "structured rejection",
      error: new CodexAppServerRpcError({ message: "turn rejected" }, "turn/start"),
      abandonCount: 0,
    },
    {
      name: "ambiguous timeout",
      error: new Error("turn/start timed out"),
      abandonCount: 1,
    },
  ])("handles $name with exact media lease ownership", async ({ error, abandonCount }) => {
    const { client } = createFakeClient({ turnStartError: error });
    const release = vi.fn();
    const abandon = vi.fn(async () => undefined);
    const provider = buildCodexMediaUnderstandingProvider({
      clientLeaseFactory: async () => ({ client, release, abandon }),
    });
    await expect(
      provider.describeImage?.({
        buffer: Buffer.from("image-bytes"),
        fileName: "image.png",
        mime: "image/png",
        provider: "codex",
        model: "gpt-5.4",
        timeoutMs: 30_000,
        cfg: {},
        agentDir: "/tmp/openclaw-agent",
      }),
    ).rejects.toBe(error);
    expect(abandon).toHaveBeenCalledTimes(abandonCount);
    expect(release).toHaveBeenCalledTimes(1);
  });
  it("retires the media client when thread cleanup is unconfirmed", async () => {
    const { client } = createFakeClient({ unsubscribeError: new Error("unsubscribe failed") });
    const release = vi.fn();
    const abandon = vi.fn(async () => undefined);
    const provider = buildCodexMediaUnderstandingProvider({
      clientLeaseFactory: async () => ({ client, release, abandon }),
    });
    await expect(
      provider.describeImage?.({
        buffer: Buffer.from("image-bytes"),
        fileName: "image.png",
        mime: "image/png",
        provider: "codex",
        model: "gpt-5.4",
        timeoutMs: 30_000,
        cfg: {},
        agentDir: "/tmp/openclaw-agent",
      }),
    ).resolves.toEqual({ text: "A red square.", model: "gpt-5.4" });
    expect(abandon).toHaveBeenCalledOnce();
    expect(release).not.toHaveBeenCalled();
  });
  it("retires the media client when an accepted turn cannot be interrupted", async () => {
    const { client, requests } = createFakeClient({
      preBindNotificationCount: 257,
      interruptError: new Error("interrupt timeout"),
    });
    const release = vi.fn();
    const abandon = vi.fn(async () => undefined);
    const provider = buildCodexMediaUnderstandingProvider({
      clientLeaseFactory: async () => ({ client, release, abandon }),
    });
    await expect(
      provider.describeImage?.({
        buffer: Buffer.from("image-bytes"),
        fileName: "image.png",
        mime: "image/png",
        provider: "codex",
        model: "gpt-5.4",
        timeoutMs: 30_000,
        cfg: {},
        agentDir: "/tmp/openclaw-agent",
      }),
    ).rejects.toThrow("pre-bind notification buffer exceeded 256 entries");
    expect(requests.map((entry) => entry.method)).toEqual([
      "model/list",
      "thread/start",
      "turn/start",
      "turn/interrupt",
    ]);
    expect(abandon).toHaveBeenCalledOnce();
    expect(release).not.toHaveBeenCalled();
  });
  it("runs structured extraction through the same bounded Codex app-server path", async () => {
    const { client, requests } = createFakeClient({
      responseText: '{"summary":"red square","tags":["shape"]}',
    });
    const provider = buildCodexMediaUnderstandingProvider({
-      clientLeaseFactory: adaptCodexTestClientFactory(async () => client),
+      clientFactory: async () => client,
    });
    const result = await provider.extractStructured?.({
@@ -675,21 +393,25 @@ describe("codex media understanding provider", () => {
      "model/list",
      "thread/start",
      "turn/start",
      "thread/unsubscribe",
    ]);
    expect(requests[1]?.params).toEqual({
      model: "gpt-5.4",
      modelProvider: "openai",
-      cwd: "/tmp/openclaw-agent/codex-media-home",
+      cwd: "/tmp/openclaw-agent",
-      approvalPolicy: "never",
+      approvalPolicy: "on-request",
      sandbox: "read-only",
      serviceName: "OpenClaw",
      personality: "none",
      developerInstructions:
        "You are OpenClaw's bounded structured-extraction worker. Return only the requested extraction. Do not call tools, edit files, ask follow-up questions, or include secrets.",
-      config: EXPECTED_MEDIA_THREAD_CONFIG,
+      config: {
        "features.code_mode": false,
        "features.code_mode_only": false,
      },
      environments: [],
      dynamicTools: [],
      experimentalRawEvents: true,
      ephemeral: true,
      persistExtendedHistory: false,
    });
    const turnParams = requests[2]?.params as
      | {
@@ -702,9 +424,9 @@ describe("codex media understanding provider", () => {
        }
      | undefined;
    expect(turnParams?.threadId).toBe("thread-1");
-    expect(turnParams?.approvalPolicy).toBeUndefined();
+    expect(turnParams?.approvalPolicy).toBe("on-request");
-    expect(turnParams?.model).toBeUndefined();
+    expect(turnParams?.model).toBe("gpt-5.4");
-    expect(turnParams?.cwd).toBeUndefined();
+    expect(turnParams?.cwd).toBe("/tmp/openclaw-agent");
    expect(turnParams?.effort).toBe("low");
    expect(turnParams?.input).toHaveLength(3);
    expect(turnParams?.input?.[0]?.type).toBe("text");
@@ -727,7 +449,7 @@ describe("codex media understanding provider", () => {
      responseText: '{"summary":"only text"}',
    });
    const provider = buildCodexMediaUnderstandingProvider({
-      clientLeaseFactory: adaptCodexTestClientFactory(async () => client),
+      clientFactory: async () => client,
    });
    await expect(
@@ -747,7 +469,7 @@ describe("codex media understanding provider", () => {
  it("returns a controlled error when structured JSON parsing fails", async () => {
    const { client } = createFakeClient({ responseText: "not json" });
    const provider = buildCodexMediaUnderstandingProvider({
-      clientLeaseFactory: adaptCodexTestClientFactory(async () => client),
+      clientFactory: async () => client,
    });
    await expect(
@@ -776,7 +498,7 @@ describe("codex media understanding provider", () => {
      responseText: '{"summary":123,"tags":["shape"]}',
    });
    const provider = buildCodexMediaUnderstandingProvider({
-      clientLeaseFactory: adaptCodexTestClientFactory(async () => client),
+      clientFactory: async () => client,
    });
    await expect(
--- a/extensions/codex/media-understanding-provider.ts
+++ b/extensions/codex/media-understanding-provider.ts
@@ -1,35 +1,538 @@
-/** Lazy registration facade for Codex-backed media understanding. */
+/**
-import type { MediaUnderstandingProvider } from "openclaw/plugin-sdk/media-understanding";
+ * Codex-backed media understanding provider for bounded image description and
 * structured extraction turns.
 */
 import {
  type JsonSchemaObject,
  validateJsonSchemaValue,
 } from "openclaw/plugin-sdk/json-schema-runtime";
 import type {
  ImagesDescriptionRequest,
  ImagesDescriptionResult,
  MediaUnderstandingProvider,
  StructuredExtractionRequest,
  StructuredExtractionResult,
 } from "openclaw/plugin-sdk/media-understanding";
 import { resolveTimerTimeoutMs } from "openclaw/plugin-sdk/number-runtime";
 import { CODEX_PROVIDER_ID, FALLBACK_CODEX_MODELS } from "./provider-catalog.js";
-import type { CodexAppServerClientLeaseFactory } from "./src/app-server/shared-client.js";
+import type { CodexAppServerClientFactory } from "./src/app-server/client-factory.js";
 import type { CodexAppServerClient } from "./src/app-server/client.js";
 import { resolveCodexAppServerRuntimeOptions } from "./src/app-server/config.js";
 import { readModelListResult } from "./src/app-server/models.js";
 import {
  assertCodexThreadStartResponse,
  assertCodexTurnStartResponse,
  readCodexErrorNotification,
  readCodexTurnCompletedNotification,
 } from "./src/app-server/protocol-validators.js";
 import {
  isJsonObject,
  type CodexServerNotification,
  type CodexThreadItem,
  type CodexThreadStartParams,
  type CodexTurn,
  type CodexTurnStartParams,
  type CodexUserInput,
  type JsonObject,
  type JsonValue,
 } from "./src/app-server/protocol.js";
 import { buildCodexRuntimeThreadConfig } from "./src/app-server/thread-lifecycle.js";
 const DEFAULT_CODEX_IMAGE_MODEL =
  FALLBACK_CODEX_MODELS.find((model) => model.inputModalities.includes("image"))?.id ??
  FALLBACK_CODEX_MODELS[0]?.id;
 const DEFAULT_CODEX_IMAGE_PROMPT = "Describe the image.";
 /** Dependencies and plugin config for Codex media-understanding calls. */
 export type CodexMediaUnderstandingProviderOptions = {
  pluginConfig?: unknown;
-  resolvePluginConfig?: () => unknown;
+  clientFactory?: CodexAppServerClientFactory;
  clientLeaseFactory?: CodexAppServerClientLeaseFactory;
 };
-/** Builds a provider whose app-server implementation loads on first use. */
+/**
 * Builds the media-understanding provider that delegates image tasks to an
 * isolated Codex app-server session.
 */
 export function buildCodexMediaUnderstandingProvider(
  options: CodexMediaUnderstandingProviderOptions = {},
 ): MediaUnderstandingProvider {
  let runtime: Promise<typeof import("./src/media-understanding-provider.runtime.js")> | undefined;
  const load = () => (runtime ??= import("./src/media-understanding-provider.runtime.js"));
  return {
    id: CODEX_PROVIDER_ID,
    capabilities: ["image"],
    ...(DEFAULT_CODEX_IMAGE_MODEL ? { defaultModels: { image: DEFAULT_CODEX_IMAGE_MODEL } } : {}),
-    describeImage: async ({ buffer, fileName, mime, ...request }) =>
+    describeImage: async (req) =>
-      await (
+      describeCodexImages(
-        await load()
+        {
-      ).describeCodexImages({ ...request, images: [{ buffer, fileName, mime }] }, options),
+          images: [
-    describeImages: async (request) => await (await load()).describeCodexImages(request, options),
+            {
-    extractStructured: async (request) =>
+              buffer: req.buffer,
-      await (await load()).extractCodexStructured(request, options),
+              fileName: req.fileName,
              mime: req.mime,
            },
          ],
          provider: req.provider,
          model: req.model,
          prompt: req.prompt,
          maxTokens: req.maxTokens,
          timeoutMs: req.timeoutMs,
          profile: req.profile,
          preferredProfile: req.preferredProfile,
          authStore: req.authStore,
          agentDir: req.agentDir,
          cfg: req.cfg,
        },
        options,
      ),
    describeImages: async (req) => describeCodexImages(req, options),
    extractStructured: async (req) => extractCodexStructured(req, options),
  };
 }
 async function describeCodexImages(
  req: ImagesDescriptionRequest,
  options: CodexMediaUnderstandingProviderOptions,
 ): Promise<ImagesDescriptionResult> {
  const model = req.model.trim();
  if (!model) {
    throw new Error("Codex image understanding requires model id.");
  }
  const text = await runBoundedCodexVisionTurn({
    model,
    profile: req.profile,
    timeoutMs: req.timeoutMs,
    agentDir: req.agentDir,
    options,
    taskLabel: "image understanding",
    developerInstructions:
      "You are OpenClaw's bounded image-understanding worker. Describe only the provided image content. Do not call tools, edit files, or ask follow-up questions.",
    input: [
      { type: "text", text: buildCodexImagePrompt(req), text_elements: [] },
      ...req.images.map((image) => ({
        type: "image" as const,
        url: `data:${image.mime ?? "image/png"};base64,${image.buffer.toString("base64")}`,
      })),
    ],
    requiredModalities: ["text", "image"],
  });
  return { text, model };
 }
 type BoundedCodexVisionTurnParams = {
  model: string;
  profile?: string;
  timeoutMs: number;
  agentDir?: string;
  options: CodexMediaUnderstandingProviderOptions;
  taskLabel: string;
  developerInstructions: string;
  input: CodexUserInput[];
  requiredModalities: string[];
 };
 async function runBoundedCodexVisionTurn(params: BoundedCodexVisionTurnParams): Promise<string> {
  const appServer = resolveCodexAppServerRuntimeOptions({
    pluginConfig: params.options.pluginConfig,
  });
  const timeoutMs = resolveTimerTimeoutMs(params.timeoutMs, 100, 100);
  const ownsClient = !params.options.clientFactory;
  // Tests inject a client factory; production creates an isolated app-server
  // client so media tasks cannot reuse the interactive attempt session.
  const client = params.options.clientFactory
    ? await params.options.clientFactory(appServer.start, params.profile)
    : await import("./src/app-server/shared-client.js").then(
        ({ createIsolatedCodexAppServerClient }) =>
          createIsolatedCodexAppServerClient({
            startOptions: appServer.start,
            timeoutMs,
            authProfileId: params.profile,
          }),
      );
  const abortController = new AbortController();
  const timeout = setTimeout(() => abortController.abort("timeout"), timeoutMs);
  timeout.unref?.();
  try {
    await assertCodexModelSupportsInput({
      client,
      model: params.model,
      requiredModalities: params.requiredModalities,
      timeoutMs,
      signal: abortController.signal,
    });
    const thread = assertCodexThreadStartResponse(
      await client.request<unknown>(
        "thread/start",
        {
          model: params.model,
          modelProvider: "openai",
          cwd: params.agentDir || process.cwd(),
          approvalPolicy: "on-request",
          sandbox: "read-only",
          serviceName: "OpenClaw",
          developerInstructions: params.developerInstructions,
          // Media workers are bounded read-only turns; native code mode and
          // dynamic tools stay disabled to avoid side effects while inspecting media.
          config: buildCodexRuntimeThreadConfig(undefined, { nativeCodeModeEnabled: false }),
          environments: [],
          dynamicTools: [],
          experimentalRawEvents: true,
          persistExtendedHistory: false,
          ephemeral: true,
        } satisfies CodexThreadStartParams,
        { timeoutMs, signal: abortController.signal },
      ),
    );
    const collector = createCodexTurnCollector(thread.thread.id, params.taskLabel);
    const cleanup = client.addNotificationHandler(collector.handleNotification);
    const requestCleanup = client.addRequestHandler(denyCodexImageApprovalRequest);
    try {
      const turn = assertCodexTurnStartResponse(
        await client.request<unknown>(
          "turn/start",
          {
            threadId: thread.thread.id,
            input: params.input,
            cwd: params.agentDir || process.cwd(),
            approvalPolicy: "on-request",
            model: params.model,
            effort: "low",
          } satisfies CodexTurnStartParams,
          { timeoutMs, signal: abortController.signal },
        ),
      );
      const text = await collector.collect(turn.turn, {
        timeoutMs,
        signal: abortController.signal,
      });
      return text;
    } finally {
      requestCleanup();
      cleanup();
    }
  } finally {
    clearTimeout(timeout);
    if (ownsClient) {
      client.close();
    }
  }
 }
 async function extractCodexStructured(
  req: StructuredExtractionRequest,
  options: CodexMediaUnderstandingProviderOptions,
 ): Promise<StructuredExtractionResult> {
  const model = req.model.trim();
  if (!model) {
    throw new Error("Codex structured extraction requires model id.");
  }
  const instructions = req.instructions.trim();
  if (!instructions) {
    throw new Error("Codex structured extraction requires instructions.");
  }
  if (req.input.length === 0) {
    throw new Error("Codex structured extraction requires at least one input.");
  }
  if (!req.input.some((entry) => entry.type === "image")) {
    throw new Error("Codex structured extraction requires at least one image input.");
  }
  const text = await runBoundedCodexVisionTurn({
    model,
    profile: req.profile,
    timeoutMs: req.timeoutMs,
    agentDir: req.agentDir,
    options,
    taskLabel: "structured extraction",
    developerInstructions:
      "You are OpenClaw's bounded structured-extraction worker. Return only the requested extraction. Do not call tools, edit files, ask follow-up questions, or include secrets.",
    input: buildCodexStructuredInput(req),
    requiredModalities: requiredStructuredModalities(),
  });
  return normalizeStructuredExtractionResult({ text, model, provider: req.provider, req });
 }
 function denyCodexImageApprovalRequest(request: { method: string }): JsonValue | undefined {
  if (
    request.method === "item/commandExecution/requestApproval" ||
    request.method === "item/fileChange/requestApproval"
  ) {
    return {
      decision: "decline",
      reason: "OpenClaw Codex image understanding does not grant tool or file approvals.",
    };
  }
  if (request.method === "item/permissions/requestApproval") {
    return { permissions: {}, scope: "turn" };
  }
  if (request.method.includes("requestApproval")) {
    return {
      decision: "decline",
      reason: "OpenClaw Codex image understanding does not grant native approvals.",
    };
  }
  if (request.method === "mcpServer/elicitation/request") {
    return { action: "decline" };
  }
  return undefined;
 }
 async function assertCodexModelSupportsInput(params: {
  client: CodexAppServerClient;
  model: string;
  requiredModalities: string[];
  timeoutMs: number;
  signal: AbortSignal;
 }): Promise<void> {
  const result = await params.client.request<unknown>(
    "model/list",
    { limit: 100, cursor: null, includeHidden: false },
    { timeoutMs: Math.min(params.timeoutMs, 5_000), signal: params.signal },
  );
  const listed = readModelListResult(result).models;
  const match = listed.find((entry) => entry.model === params.model || entry.id === params.model);
  if (!match) {
    throw new Error(`Codex app-server model not found: ${params.model}`);
  }
  if (params.requiredModalities.includes("image") && !match.inputModalities.includes("image")) {
    throw new Error(`Codex app-server model does not support images: ${params.model}`);
  }
  if (params.requiredModalities.includes("text") && !match.inputModalities.includes("text")) {
    throw new Error(`Codex app-server model does not support text: ${params.model}`);
  }
 }
 function buildCodexImagePrompt(req: ImagesDescriptionRequest): string {
  const prompt = req.prompt?.trim() || DEFAULT_CODEX_IMAGE_PROMPT;
  if (req.images.length <= 1) {
    return prompt;
  }
  return `${prompt}\n\nAnalyze all ${req.images.length} images together.`;
 }
 function requiredStructuredModalities(): string[] {
  return ["text", "image"];
 }
 function buildCodexStructuredInput(req: StructuredExtractionRequest): CodexUserInput[] {
  return [
    { type: "text", text: buildStructuredExtractionPrompt(req), text_elements: [] },
    ...req.input.map((entry) => {
      if (entry.type === "text") {
        return { type: "text" as const, text: entry.text, text_elements: [] };
      }
      return {
        type: "image" as const,
        url: `data:${entry.mime ?? "image/png"};base64,${entry.buffer.toString("base64")}`,
      };
    }),
  ];
 }
 function buildStructuredExtractionPrompt(req: StructuredExtractionRequest): string {
  return [
    req.instructions.trim(),
    req.schemaName ? `Schema name: ${req.schemaName}` : undefined,
    req.jsonSchema ? `JSON schema:\n${JSON.stringify(req.jsonSchema)}` : undefined,
    req.jsonMode === false
      ? "Return the extraction as concise text."
      : "Return valid JSON only. Do not wrap the JSON in Markdown fences.",
  ]
    .filter((part): part is string => Boolean(part))
    .join("\n\n");
 }
 function isJsonSchemaObject(value: unknown): value is JsonSchemaObject {
  return typeof value === "object" && value !== null && !Array.isArray(value);
 }
 function normalizeStructuredExtractionResult(params: {
  text: string;
  model: string;
  provider: string;
  req: StructuredExtractionRequest;
 }): StructuredExtractionResult {
  const result: StructuredExtractionResult = {
    text: params.text,
    model: params.model,
    provider: params.provider,
    contentType: params.req.jsonMode === false ? "text" : "json",
  };
  if (params.req.jsonMode !== false) {
    try {
      result.parsed = JSON.parse(params.text);
    } catch {
      throw new Error("Codex structured extraction returned invalid JSON.");
    }
    if (isJsonSchemaObject(params.req.jsonSchema)) {
      const validation = validateJsonSchemaValue({
        schema: params.req.jsonSchema,
        cacheKey: "codex.media-understanding.extractStructured",
        value: result.parsed,
        cache: false,
      });
      if (!validation.ok) {
        const message = validation.errors.map((error) => error.text).join("; ") || "invalid";
        throw new Error(`Codex structured extraction JSON did not match schema: ${message}`);
      }
      result.parsed = validation.value;
    }
  }
  return result;
 }
 function createCodexTurnCollector(threadId: string, taskLabel: string) {
  let turnId: string | undefined;
  let completedTurn: CodexTurn | undefined;
  let promptError: string | undefined;
  const pending: CodexServerNotification[] = [];
  const assistantTextByItem = new Map<string, string>();
  const assistantItemOrder: string[] = [];
  let resolveCompletion: (() => void) | undefined;
  const completion = new Promise<void>((resolve) => {
    resolveCompletion = resolve;
  });
  const rememberAssistantText = (itemId: string, text: string) => {
    if (!text) {
      return;
    }
    if (!assistantTextByItem.has(itemId)) {
      assistantItemOrder.push(itemId);
    }
    assistantTextByItem.set(itemId, text);
  };
  const handleNotification = (notification: CodexServerNotification): void => {
    const params = isJsonObject(notification.params) ? notification.params : undefined;
    if (!params || readString(params, "threadId") !== threadId) {
      return;
    }
    if (!turnId) {
      pending.push(notification);
      return;
    }
    const notificationTurnId = readNotificationTurnId(params);
    if (notificationTurnId !== turnId) {
      return;
    }
    if (notification.method === "item/agentMessage/delta") {
      const itemId = readString(params, "itemId") ?? readString(params, "id") ?? "assistant";
      const delta = readString(params, "delta") ?? "";
      rememberAssistantText(itemId, `${assistantTextByItem.get(itemId) ?? ""}${delta}`);
      return;
    }
    if (notification.method === "turn/completed") {
      completedTurn =
        readCodexTurnCompletedNotification(notification.params)?.turn ?? completedTurn;
      resolveCompletion?.();
      return;
    }
    if (notification.method === "error") {
      promptError =
        readCodexErrorNotification(notification.params)?.error.message ??
        `codex app-server ${taskLabel} turn failed`;
      resolveCompletion?.();
    }
  };
  return {
    handleNotification,
    async collect(
      startedTurn: CodexTurn,
      options: { timeoutMs: number; signal: AbortSignal },
    ): Promise<string> {
      turnId = startedTurn.id;
      if (isTerminalTurn(startedTurn)) {
        completedTurn = startedTurn;
      }
      for (const notification of pending.splice(0)) {
        handleNotification(notification);
      }
      if (!completedTurn && !promptError) {
        await waitForTurnCompletion({
          completion,
          timeoutMs: options.timeoutMs,
          signal: options.signal,
          taskLabel,
        });
      }
      if (promptError) {
        throw new Error(promptError);
      }
      if (completedTurn?.status === "failed") {
        throw new Error(
          completedTurn.error?.message ?? `codex app-server ${taskLabel} turn failed`,
        );
      }
      const itemText = collectAssistantTextFromItems(completedTurn?.items);
      const deltaText = assistantItemOrder
        .map((itemId) => assistantTextByItem.get(itemId)?.trim())
        .filter((text): text is string => Boolean(text))
        .join("\n\n")
        .trim();
      const text = (itemText || deltaText).trim();
      if (!text) {
        throw new Error(`Codex app-server ${taskLabel} turn returned no text.`);
      }
      return text;
    },
  };
 }
 async function waitForTurnCompletion(params: {
  completion: Promise<void>;
  timeoutMs: number;
  signal: AbortSignal;
  taskLabel: string;
 }): Promise<void> {
  let timeout: ReturnType<typeof setTimeout> | undefined;
  let cleanupAbort: (() => void) | undefined;
  try {
    await Promise.race([
      params.completion,
      new Promise<never>((_, reject) => {
        timeout = setTimeout(
          () => reject(new Error(`codex app-server ${params.taskLabel} turn timed out`)),
          params.timeoutMs,
        );
        timeout.unref?.();
        const abortListener = () =>
          reject(new Error(`codex app-server ${params.taskLabel} turn aborted`));
        params.signal.addEventListener("abort", abortListener, { once: true });
        cleanupAbort = () => params.signal.removeEventListener("abort", abortListener);
      }),
    ]);
  } finally {
    if (timeout) {
      clearTimeout(timeout);
    }
    cleanupAbort?.();
  }
 }
 function collectAssistantTextFromItems(items: CodexThreadItem[] | undefined): string {
  return (items ?? [])
    .filter((item) => item.type === "agentMessage")
    .map((item) => item.text.trim())
    .filter(Boolean)
    .join("\n\n")
    .trim();
 }
 function readNotificationTurnId(record: JsonObject): string | undefined {
  const direct = readString(record, "turnId");
  if (direct) {
    return direct;
  }
  return isJsonObject(record.turn) ? readString(record.turn, "id") : undefined;
 }
 function readString(record: JsonObject, key: string): string | undefined {
  const value = record[key];
  return typeof value === "string" ? value : undefined;
 }
 function isTerminalTurn(turn: CodexTurn): boolean {
  return turn.status === "completed" || turn.status === "interrupted" || turn.status === "failed";
 }
--- a/extensions/codex/provider.test.ts
+++ b/extensions/codex/provider.test.ts
@@ -4,10 +4,10 @@ import { CODEX_GPT5_BEHAVIOR_CONTRACT } from "./prompt-overlay.js";
 import { codexProviderDiscovery } from "./provider-discovery.js";
 import { buildCodexProvider, buildCodexProviderCatalog } from "./provider.js";
 import { CodexAppServerClient } from "./src/app-server/client.js";
-import type { listAllCodexAppServerModels } from "./src/app-server/models.js";
+import type { listCodexAppServerModels } from "./src/app-server/models.js";
 import {
  createIsolatedCodexAppServerClient,
-  leaseSharedCodexAppServerClient,
+  getSharedCodexAppServerClient,
  resetSharedCodexAppServerClientForTests,
 } from "./src/app-server/shared-client.js";
@@ -26,8 +26,7 @@ function createFakeCodexClient(): CodexAppServerClient {
  return {
    initialize: vi.fn(async () => undefined),
    request: vi.fn(async () => ({ data: [] })),
-    addNotificationHandler: vi.fn(() => () => undefined),
+    setActiveSharedLeaseCountProviderForUnscopedNotifications: vi.fn(),
    addRequestHandler: vi.fn(() => () => undefined),
    addCloseHandler: vi.fn(() => () => undefined),
    close: vi.fn(),
  } as unknown as CodexAppServerClient;
@@ -40,7 +39,7 @@ const TEST_CODEX_APP_SERVER_CONFIG = {
 };
 async function listTestCodexAppServerModels(
-  options: Parameters<typeof listAllCodexAppServerModels>[0] = {},
+  options: Parameters<typeof listCodexAppServerModels>[0] = {},
 ) {
  expect(options.sharedClient).toBe(false);
  const client = await createIsolatedCodexAppServerClient({
@@ -184,33 +183,45 @@ describe("codex provider", () => {
    expect(resultProvider?.models.map((model) => model.id)).toEqual(["gpt-5.4"]);
  });
-  it("delegates all-page discovery to one model lister call", async () => {
+  it("pages through live discovery before building the provider catalog", async () => {
-    const listModels = vi.fn(async () => ({
+    const listModels = vi
-      models: [
+      .fn()
-        {
+      .mockResolvedValueOnce({
-          id: "gpt-5.4",
+        models: [
-          model: "gpt-5.4",
+          {
-          hidden: false,
+            id: "gpt-5.4",
-          inputModalities: ["text", "image"],
+            model: "gpt-5.4",
-          supportedReasoningEfforts: ["medium"],
+            hidden: false,
-        },
+            inputModalities: ["text", "image"],
-        {
+            supportedReasoningEfforts: ["medium"],
-          id: "gpt-5.5",
+          },
-          model: "gpt-5.5",
+        ],
-          hidden: false,
+        nextCursor: "page-2",
-          inputModalities: ["text"],
+      })
-          supportedReasoningEfforts: [],
+      .mockResolvedValueOnce({
-        },
+        models: [
-      ],
+          {
-    }));
+            id: "gpt-5.5",
            model: "gpt-5.5",
            hidden: false,
            inputModalities: ["text"],
            supportedReasoningEfforts: [],
          },
        ],
      });
    const result = await buildCodexProviderCatalog({
      env: {},
      listModels,
    });
    expect(listModels).toHaveBeenCalledTimes(1);
    expectRecordFields(mockCallArg(listModels, 0), {
      cursor: undefined,
      limit: 100,
      sharedClient: false,
    });
    expectRecordFields(mockCallArg(listModels, 1), {
      cursor: "page-2",
      limit: 100,
      sharedClient: false,
    });
@@ -266,7 +277,7 @@ describe("codex provider", () => {
      .mockReturnValueOnce(activeClient)
      .mockReturnValueOnce(discoveryClient);
-    await leaseSharedCodexAppServerClient({
+    await getSharedCodexAppServerClient({
      startOptions: {
        transport: "stdio",
        command: "/tmp/openclaw-test-codex",
--- a/extensions/codex/provider.ts
+++ b/extensions/codex/provider.ts
@@ -18,11 +18,16 @@ import {
  CODEX_PROVIDER_ID,
  FALLBACK_CODEX_MODELS,
 } from "./provider-catalog.js";
-import type { CodexAppServerStartOptions } from "./src/app-server/config.js";
+import {
  type CodexAppServerStartOptions,
  readCodexPluginConfig,
  resolveCodexAppServerRuntimeOptions,
 } from "./src/app-server/config.js";
 import type {
  CodexAppServerModel,
  CodexAppServerModelListResult,
 } from "./src/app-server/models.js";
 import { buildCodexAppServerUsageSnapshot } from "./src/app-server/rate-limits.js";
 const DEFAULT_DISCOVERY_TIMEOUT_MS = 2500;
 const LIVE_DISCOVERY_ENV = "OPENCLAW_CODEX_DISCOVERY_LIVE";
@@ -34,6 +39,7 @@ const codexCatalogLog = createSubsystemLogger("codex/catalog");
 type CodexModelLister = (options: {
  timeoutMs: number;
  limit?: number;
  cursor?: string;
  startOptions?: CodexAppServerStartOptions;
  sharedClient?: boolean;
 }) => Promise<CodexAppServerModelListResult>;
@@ -117,11 +123,6 @@ export function buildCodexProvider(options: BuildCodexProviderOptions = {}): Pro
      }
      const runtimePluginConfig = resolvePluginConfigObject(ctx.config, CODEX_PROVIDER_ID);
      const pluginConfig = runtimePluginConfig ?? (ctx.config ? undefined : options.pluginConfig);
      const [{ resolveCodexAppServerRuntimeOptions }, { buildCodexAppServerUsageSnapshot }] =
        await Promise.all([
          import("./src/app-server/config.js"),
          import("./src/app-server/rate-limits.js"),
        ]);
      const appServer = resolveCodexAppServerRuntimeOptions({ pluginConfig });
      const rateLimits = await (options.readRateLimits ?? requestCodexAppServerRateLimitsLazy)({
        timeoutMs: ctx.timeoutMs,
@@ -155,15 +156,13 @@ export function buildCodexProvider(options: BuildCodexProviderOptions = {}): Pro
 export async function buildCodexProviderCatalog(
  options: BuildCatalogOptions = {},
 ): Promise<{ provider: ModelProviderConfig }> {
  const { readCodexPluginConfig, resolveCodexAppServerRuntimeOptions } =
    await import("./src/app-server/config.js");
  const config = readCodexPluginConfig(options.pluginConfig);
  const appServer = resolveCodexAppServerRuntimeOptions({ pluginConfig: options.pluginConfig });
  const timeoutMs = normalizeTimeoutMs(config.discovery?.timeoutMs);
  let discovered: CodexAppServerModel[] = [];
  if (config.discovery?.enabled !== false && !shouldSkipLiveDiscovery(options.env)) {
    discovered = await listModelsBestEffort({
-      listModels: options.listModels ?? listAllCodexAppServerModelsLazy,
+      listModels: options.listModels ?? listCodexAppServerModelsLazy,
      timeoutMs,
      startOptions: appServer.start,
      onDiscoveryFailure: options.onDiscoveryFailure,
@@ -201,14 +200,22 @@ async function listModelsBestEffort(params: {
  onDiscoveryFailure?: (error: unknown) => void;
 }): Promise<CodexAppServerModel[]> {
  try {
-    // The all-pages helper keeps one app-server client alive across pagination.
+    const models: CodexAppServerModel[] = [];
-    const result = await params.listModels({
+    let cursor: string | undefined;
-      timeoutMs: params.timeoutMs,
+    do {
-      limit: MODEL_DISCOVERY_PAGE_LIMIT,
+      // App-server model listing is paginated; collect every visible model so
-      startOptions: params.startOptions,
+      // aliases and picker rows match the current Codex account.
-      sharedClient: false,
+      const result = await params.listModels({
-    });
+        timeoutMs: params.timeoutMs,
-    return result.models.filter((model) => !model.hidden);
+        limit: MODEL_DISCOVERY_PAGE_LIMIT,
        cursor,
        startOptions: params.startOptions,
        sharedClient: false,
      });
      models.push(...result.models.filter((model) => !model.hidden));
      cursor = result.nextCursor;
    } while (cursor);
    return models;
  } catch (error) {
    params.onDiscoveryFailure?.(error);
    codexCatalogLog.debug("codex model discovery failed; using fallback catalog", {
@@ -218,14 +225,15 @@ async function listModelsBestEffort(params: {
  }
 }
-async function listAllCodexAppServerModelsLazy(options: {
+async function listCodexAppServerModelsLazy(options: {
  timeoutMs: number;
  limit?: number;
  cursor?: string;
  startOptions?: CodexAppServerStartOptions;
  sharedClient?: boolean;
 }): Promise<CodexAppServerModelListResult> {
-  const { listAllCodexAppServerModels } = await import("./src/app-server/models.js");
+  const { listCodexAppServerModels } = await import("./src/app-server/models.js");
-  return listAllCodexAppServerModels(options);
+  return listCodexAppServerModels(options);
 }
 async function requestCodexAppServerRateLimitsLazy(options: {
--- a/extensions/codex/src/app-server/app-server-policy.test.ts
+++ b/extensions/codex/src/app-server/app-server-policy.test.ts
@@ -1,6 +1,9 @@
 // Codex tests cover app server policy plugin behavior.
 import { describe, expect, it } from "vitest";
-import { resolveCodexAppServerForOpenClawToolPolicy } from "./app-server-policy.js";
+import {
  resolveCodexAppServerForModelProvider,
  resolveCodexAppServerForOpenClawToolPolicy,
 } from "./app-server-policy.js";
 import { readCodexPluginConfig, resolveCodexAppServerRuntimeOptions } from "./config.js";
 describe("Codex app-server policy", () => {
@@ -66,4 +69,143 @@ describe("Codex app-server policy", () => {
    expect(explicitEnv.approvalPolicy).toBe("never");
    expect(explicitRequirements.approvalPolicy).toBe("never");
  });
  it("keeps model-backed reviewers for explicit OpenAI model providers", () => {
    const appServer = resolveCodexAppServerRuntimeOptions({
      env: {},
      requirementsToml: null,
      execMode: "auto",
      modelProvider: "openai",
    });
    expect(
      resolveCodexAppServerForModelProvider({
        appServer,
        provider: "codex",
        model: "openai/gpt-5.5",
      }).approvalsReviewer,
    ).toBe("auto_review");
    expect(
      resolveCodexAppServerForModelProvider({
        appServer,
        provider: "codex",
        model: "gpt-5.5",
      }).approvalsReviewer,
    ).toBe("user");
    expect(
      resolveCodexAppServerForModelProvider({ appServer, provider: "openai" }).approvalsReviewer,
    ).toBe("auto_review");
  });
  it("uses human approval for OpenAI-compatible custom endpoints", () => {
    const appServer = resolveCodexAppServerRuntimeOptions({
      env: {},
      requirementsToml: null,
      execMode: "auto",
      modelProvider: "openai",
      model: "gpt-5.5",
      config: {
        models: {
          providers: {
            openai: {
              baseUrl: "http://localhost:8080/v1",
              models: [],
            },
          },
        },
      },
    });
    expect(appServer.approvalsReviewer).toBe("user");
    expect(
      resolveCodexAppServerForModelProvider({
        appServer,
        provider: "openai",
        model: "gpt-5.5",
        config: {
          models: {
            providers: {
              openai: {
                baseUrl: "http://localhost:8080/v1",
                models: [],
              },
            },
          },
        },
      }).approvalsReviewer,
    ).toBe("user");
  });
  it("uses human approval instead of Codex Guardian for custom model providers", () => {
    const appServer = resolveCodexAppServerRuntimeOptions({
      env: {},
      requirementsToml: null,
      execMode: "auto",
      modelProvider: "openai",
    });
    const resolved = resolveCodexAppServerForModelProvider({
      appServer,
      provider: "lmstudio",
    });
    const vendorPrefixedModel = resolveCodexAppServerForModelProvider({
      appServer,
      provider: "openrouter",
      model: "openai/gpt-5.5",
    });
    expect(appServer.approvalsReviewer).toBe("auto_review");
    expect(resolved.approvalPolicy).toBe("on-request");
    expect(resolved.sandbox).toBe("workspace-write");
    expect(resolved.approvalsReviewer).toBe("user");
    expect(vendorPrefixedModel.approvalsReviewer).toBe("user");
  });
  it("infers custom providers from provider-qualified model refs", () => {
    const appServer = resolveCodexAppServerRuntimeOptions({
      env: {},
      requirementsToml: null,
      execMode: "auto",
    });
    expect(
      resolveCodexAppServerForModelProvider({
        appServer,
        model: "lmstudio/local-model",
      }).approvalsReviewer,
    ).toBe("user");
  });
  it("uses provider-qualified model refs to override broad native provider wrappers", () => {
    const appServer = resolveCodexAppServerRuntimeOptions({
      env: {},
      requirementsToml: null,
      execMode: "auto",
    });
    expect(
      resolveCodexAppServerForModelProvider({
        appServer,
        provider: "codex",
        model: "lmstudio/local-model",
      }).approvalsReviewer,
    ).toBe("user");
  });
  it("downgrades legacy guardian_subagent for custom model providers", () => {
    const appServer = resolveCodexAppServerRuntimeOptions({
      env: {},
      requirementsToml: null,
      pluginConfig: {
        appServer: {
          mode: "guardian",
          approvalsReviewer: "guardian_subagent",
        },
      },
    });
    expect(
      resolveCodexAppServerForModelProvider({ appServer, provider: "local" }).approvalsReviewer,
    ).toBe("user");
  });
 });
--- a/extensions/codex/src/app-server/app-server-policy.ts
+++ b/extensions/codex/src/app-server/app-server-policy.ts
@@ -2,10 +2,11 @@
 * Policy promotion for Codex app-server runs that can safely use OpenClaw tool
 * approvals.
 */
-import type {
+import {
-  CodexAppServerRuntimeOptions,
+  canUseCodexModelBackedApprovalsReviewerForModel,
-  CodexPluginConfig,
+  type CodexAppServerRuntimeOptions,
-  OpenClawExecPolicyForCodexAppServer,
+  type CodexPluginConfig,
  type OpenClawExecPolicyForCodexAppServer,
 } from "./config.js";
 /**
@@ -44,6 +45,35 @@ export function resolveCodexAppServerForOpenClawToolPolicy(params: {
  };
 }
 export function resolveCodexAppServerForModelProvider(params: {
  appServer: CodexAppServerRuntimeOptions;
  provider?: string;
  model?: string;
  config?: Parameters<typeof canUseCodexModelBackedApprovalsReviewerForModel>[0]["config"];
  env?: NodeJS.ProcessEnv;
  agentDir?: string;
  codexConfigToml?: string | null;
 }): CodexAppServerRuntimeOptions {
  const explicitProvider = normalizeModelBackedReviewerProvider(params.provider);
  if (
    !isCodexModelBackedApprovalsReviewer(params.appServer.approvalsReviewer) ||
    canUseCodexModelBackedApprovalsReviewerForModel({
      modelProvider: explicitProvider,
      model: params.model,
      config: params.config,
      env: params.env,
      agentDir: params.agentDir,
      codexConfigToml: params.codexConfigToml,
    })
  ) {
    return params.appServer;
  }
  return {
    ...params.appServer,
    approvalsReviewer: "user",
  };
 }
 function isCodexAppServerPolicyMode(value: unknown): boolean {
  return value === "guardian" || value === "yolo";
 }
@@ -53,3 +83,12 @@ function isCodexAppServerApprovalPolicy(value: unknown): boolean {
    value === "never" || value === "on-request" || value === "on-failure" || value === "untrusted"
  );
 }
 function isCodexModelBackedApprovalsReviewer(value: string): boolean {
  return value === "auto_review" || value === "guardian_subagent";
 }
 function normalizeModelBackedReviewerProvider(provider: string | undefined): string | undefined {
  const normalized = provider?.trim().toLowerCase();
  return normalized || undefined;
 }
--- a/extensions/codex/src/app-server/approval-bridge.ts
+++ b/extensions/codex/src/app-server/approval-bridge.ts
@@ -285,7 +285,8 @@ function matchesCurrentTurn(
  if (!requestParams) {
    return false;
  }
-  const requestThreadId = readString(requestParams, "threadId");
+  const requestThreadId =
    readString(requestParams, "threadId") ?? readString(requestParams, "conversationId");
  const requestTurnId = readString(requestParams, "turnId");
  return requestThreadId === threadId && requestTurnId === turnId;
 }
--- a/extensions/codex/src/app-server/attempt-client-cleanup.test.ts
+++ b/extensions/codex/src/app-server/attempt-client-cleanup.test.ts
@@ -2,41 +2,10 @@
 import { describe, expect, it, vi } from "vitest";
 import {
  interruptCodexTurnBestEffort,
  runCodexTurnStartWithLease,
  settleCodexAppServerClientLease,
  unsubscribeCodexThreadBestEffort,
  validateCodexThreadCreationResponse,
 } from "./attempt-client-cleanup.js";
 import { CodexAppServerRpcError } from "./client.js";
 describe("Codex app-server attempt client cleanup", () => {
  it("keeps the client lease after a structured turn-start rejection", async () => {
    const abandon = vi.fn(async () => undefined);
    const error = new CodexAppServerRpcError({ message: "turn rejected" }, "turn/start");
    await expect(
      runCodexTurnStartWithLease({ abandon } as never, async () => {
        throw error;
      }),
    ).rejects.toBe(error);
    expect(abandon).not.toHaveBeenCalled();
  });
  it("abandons only the exact client lease after an ambiguous turn-start timeout", async () => {
    const abandon = vi.fn(async () => undefined);
    const otherAbandon = vi.fn(async () => undefined);
    await expect(
      runCodexTurnStartWithLease({ abandon } as never, async () => {
        throw new Error("turn/start timed out");
      }),
    ).rejects.toThrow("turn/start timed out");
    expect(abandon).toHaveBeenCalledTimes(1);
    expect(otherAbandon).not.toHaveBeenCalled();
  });
  it("interrupts turns with optional request timeout", () => {
    const request = vi.fn(async () => ({}));
@@ -53,58 +22,7 @@ describe("Codex app-server attempt client cleanup", () => {
    );
  });
-  it("unsubscribes a retained thread when its create response is malformed", async () => {
+  it("swallows unsubscribe cleanup failures", async () => {
    const request = vi.fn(async () => ({}));
    const abandon = vi.fn(async () => undefined);
    const invalidResponse = { thread: { id: "thread-1" } };
    await expect(
      validateCodexThreadCreationResponse(
        { client: { request } as never, abandon },
        invalidResponse,
        () => {
          throw new Error("invalid thread/start response");
        },
      ),
    ).rejects.toThrow("invalid thread/start response");
    expect(request).toHaveBeenCalledWith(
      "thread/unsubscribe",
      { threadId: "thread-1" },
      { timeoutMs: 5_000 },
    );
    expect(abandon).not.toHaveBeenCalled();
  });
  it.each([
    ["omits the retained thread id", {}, vi.fn(async () => ({}))],
    [
      "cannot confirm unsubscribe",
      { thread: { id: "thread-1" } },
      vi.fn(async () => {
        throw new Error("connection lost");
      }),
    ],
  ])(
    "retires the client when a malformed create response %s",
    async (_label, response, request) => {
      const abandon = vi.fn(async () => undefined);
      await expect(
        validateCodexThreadCreationResponse(
          { client: { request } as never, abandon },
          response,
          () => {
            throw new Error("invalid thread/start response");
          },
        ),
      ).rejects.toThrow("subscription could not be released");
      expect(abandon).toHaveBeenCalledOnce();
    },
  );
  it("reports unsubscribe cleanup failures", async () => {
    const request = vi.fn(async () => {
      throw new Error("already gone");
    });
@@ -114,7 +32,7 @@ describe("Codex app-server attempt client cleanup", () => {
        threadId: "thread-1",
        timeoutMs: 123,
      }),
-    ).resolves.toBe(false);
+    ).resolves.toBeUndefined();
    expect(request).toHaveBeenCalledWith(
      "thread/unsubscribe",
@@ -122,31 +40,4 @@ describe("Codex app-server attempt client cleanup", () => {
      { timeoutMs: 123 },
    );
  });
  it("returns leases only after thread cleanup is confirmed", async () => {
    const release = vi.fn();
    const abandon = vi.fn(async () => undefined);
    await settleCodexAppServerClientLease(
      { client: { request: vi.fn(async () => ({})) }, release, abandon } as never,
      { threadId: "thread-ok", timeoutMs: 123 },
    );
    expect(release).toHaveBeenCalledOnce();
    expect(abandon).not.toHaveBeenCalled();
    release.mockClear();
    await settleCodexAppServerClientLease(
      {
        client: {
          request: vi.fn(async () => {
            throw new Error("unsubscribe failed");
          }),
        },
        release,
        abandon,
      } as never,
      { threadId: "thread-stale", timeoutMs: 123 },
    );
    expect(release).not.toHaveBeenCalled();
    expect(abandon).toHaveBeenCalledOnce();
  });
 });
--- a/extensions/codex/src/app-server/attempt-client-cleanup.ts
+++ b/extensions/codex/src/app-server/attempt-client-cleanup.ts
@@ -2,126 +2,14 @@
 * Best-effort cleanup helpers for timed-out or aborted Codex app-server turns.
 */
 import { embeddedAgentLog } from "openclaw/plugin-sdk/agent-harness-runtime";
-import { CodexAppServerRpcError, type CodexAppServerClient } from "./client.js";
+import type { CodexAppServerClient } from "./client.js";
-import { isJsonObject, readCodexThreadCreationResponseId } from "./protocol.js";
+import { retireSharedCodexAppServerClientIfCurrent } from "./shared-client.js";
 import type { CodexAppServerClientLease } from "./shared-client.js";
 /** Timeout for best-effort app-server turn interruption during cleanup. */
 export const CODEX_APP_SERVER_INTERRUPT_TIMEOUT_MS = 5_000;
 /** Timeout for best-effort thread unsubscribe during cleanup. */
 export const CODEX_APP_SERVER_UNSUBSCRIBE_TIMEOUT_MS = 5_000;
 /** The connection's thread-subscription ownership can no longer be proven. */
 export class CodexAppServerUnsafeSubscriptionError extends Error {
  constructor(message: string, options?: ErrorOptions) {
    super(message, options);
    this.name = "CodexAppServerUnsafeSubscriptionError";
  }
 }
 export function isCodexAppServerUnsafeSubscriptionError(
  error: unknown,
 ): error is CodexAppServerUnsafeSubscriptionError {
  return error instanceof CodexAppServerUnsafeSubscriptionError;
 }
 /** A resume response may only describe the thread this connection retained. */
 export function assertCodexThreadResumeSubscription(
  requestedThreadId: string,
  returnedThreadId: string,
 ): void {
  if (returnedThreadId !== requestedThreadId) {
    throw new CodexAppServerUnsafeSubscriptionError(
      `Codex thread/resume returned ${returnedThreadId} for ${requestedThreadId}`,
    );
  }
 }
 /** Retires the exact client lease when turn acceptance is ambiguous. */
 export async function runCodexTurnStartWithLease<T>(
  lease: CodexAppServerClientLease,
  startTurn: () => Promise<T>,
 ): Promise<T> {
  try {
    return await startTurn();
  } catch (error) {
    // Structured RPC rejection happens before Codex accepts the turn. Transport,
    // timeout, and abort failures may hide an accepted turn with an unknown id.
    if (!(error instanceof CodexAppServerRpcError)) {
      await lease.abandon();
    }
    throw error;
  }
 }
 /** Retries once when native work wins the race immediately before turn/start. */
 export async function runCodexTurnStartWithNativeTurnRetry<T>(params: {
  startTurn: () => Promise<T>;
  waitForActiveTurnCompletion: () => Promise<boolean>;
  afterActiveTurnCompletion?: () => Promise<void>;
  onRetry?: () => void;
 }): Promise<T> {
  try {
    return await params.startTurn();
  } catch (error) {
    if (!isCodexActiveTurnNotSteerableError(error)) {
      throw error;
    }
    params.onRetry?.();
    if (!(await params.waitForActiveTurnCompletion())) {
      throw error;
    }
    await params.afterActiveTurnCompletion?.();
    return await params.startTurn();
  }
 }
 /** True for Codex's structured rejection when native work already owns the thread. */
 export function isCodexActiveTurnNotSteerableError(error: unknown): boolean {
  if (!(error instanceof CodexAppServerRpcError) || !isJsonObject(error.data)) {
    return false;
  }
  const info = error.data.codexErrorInfo;
  return isJsonObject(info) && isJsonObject(info.activeTurnNotSteerable);
 }
 /** Validates a create response and retires the client unless cleanup is confirmed. */
 export async function validateCodexThreadCreationResponse<T>(
  owner: {
    client: CodexAppServerClient;
    abandon: () => Promise<void>;
  },
  response: unknown,
  validate: (value: unknown) => T,
 ): Promise<T> {
  try {
    return validate(response);
  } catch (error) {
    const threadId = readCodexThreadCreationResponseId(response);
    const released = threadId
      ? await unsubscribeCodexThreadBestEffort(owner.client, {
          threadId,
          timeoutMs: CODEX_APP_SERVER_UNSUBSCRIBE_TIMEOUT_MS,
        })
      : false;
    if (released) {
      throw error;
    }
    try {
      await owner.abandon();
    } catch (abandonError) {
      throw new CodexAppServerUnsafeSubscriptionError(
        "Codex thread creation response was invalid and its client could not be retired",
        { cause: abandonError },
      );
    }
    throw new CodexAppServerUnsafeSubscriptionError(
      "Codex thread creation response was invalid and its subscription could not be released",
      { cause: error },
    );
  }
 }
 /** Sends a turn interrupt without blocking abort cleanup on app-server errors. */
 export function interruptCodexTurnBestEffort(
  client: CodexAppServerClient,
@@ -148,56 +36,28 @@ export function interruptCodexTurnBestEffort(
  }
 }
-/** Unsubscribes from a thread and reports whether wire cleanup was confirmed. */
+/** Unsubscribes from a thread while swallowing cleanup-only failures. */
 export async function unsubscribeCodexThreadBestEffort(
  client: CodexAppServerClient,
  params: {
    threadId: string;
    timeoutMs: number;
  },
-): Promise<boolean> {
+): Promise<void> {
  try {
    await client.request(
      "thread/unsubscribe",
      { threadId: params.threadId },
      { timeoutMs: params.timeoutMs },
    );
    return true;
  } catch (error) {
    embeddedAgentLog.debug("codex app-server thread unsubscribe cleanup failed", {
      threadId: params.threadId,
      error,
    });
    return false;
  }
 }
 /** Returns one exact client lease to the pool only after subscription cleanup succeeds. */
 export async function settleCodexAppServerClientLease(
  lease: CodexAppServerClientLease,
  params: {
    threadId?: string;
    timeoutMs: number;
    abandon?: boolean;
  },
 ): Promise<void> {
  if (params.abandon) {
    await lease.abandon();
    return;
  }
  if (
    params.threadId &&
    !(await unsubscribeCodexThreadBestEffort(lease.client, {
      threadId: params.threadId,
      timeoutMs: params.timeoutMs,
    }))
  ) {
    await lease.abandon();
    return;
  }
  lease.release();
 }
 /**
 * Retires the shared client after a timed-out turn so later runs do not reuse a
 * potentially wedged app-server connection.
@@ -208,9 +68,10 @@ export async function retireCodexAppServerClientAfterTimedOutTurn(
    threadId: string;
    turnId: string;
    reason: string;
    abandonClientLease: () => Promise<void>;
  },
 ): Promise<void> {
  const retiredSharedClient = retireSharedCodexAppServerClientIfCurrent(client);
  const detachedSharedClient = Boolean(retiredSharedClient);
  interruptCodexTurnBestEffort(client, {
    threadId: params.threadId,
    turnId: params.turnId,
@@ -220,10 +81,28 @@ export async function retireCodexAppServerClientAfterTimedOutTurn(
    threadId: params.threadId,
    timeoutMs: CODEX_APP_SERVER_UNSUBSCRIBE_TIMEOUT_MS,
  });
-  await params.abandonClientLease();
+  let closedClient = retiredSharedClient?.closed ?? false;
  if (!detachedSharedClient) {
    const close = (client as { close?: () => void }).close;
    if (typeof close === "function") {
      try {
        close.call(client);
        closedClient = true;
      } catch (error) {
        embeddedAgentLog.debug("codex app-server client close failed during timeout cleanup", {
          threadId: params.threadId,
          turnId: params.turnId,
          error,
        });
      }
    }
  }
  embeddedAgentLog.warn("codex app-server client retired after timed-out turn", {
    threadId: params.threadId,
    turnId: params.turnId,
    reason: params.reason,
    detachedSharedClient,
    closedClient,
    activeSharedClientLeases: retiredSharedClient?.activeLeases ?? 0,
  });
 }
--- a/extensions/codex/src/app-server/attempt-context.ts
+++ b/extensions/codex/src/app-server/attempt-context.ts
@@ -16,7 +16,6 @@ import {
 } from "openclaw/plugin-sdk/agent-harness-runtime";
 import { resolveAgentWorkspaceDir } from "openclaw/plugin-sdk/agent-runtime";
 import { buildMemorySystemPromptAddition } from "openclaw/plugin-sdk/core";
 import { MESSAGE_TOOL_DELIVERY_HINTS } from "openclaw/plugin-sdk/message-tool-delivery-hints";
 import type { CodexDynamicToolSpec, JsonValue } from "./protocol.js";
 import { isJsonObject } from "./protocol.js";
 import type { CodexAppServerThreadBinding } from "./session-binding.js";
@@ -585,12 +584,17 @@ export function prependCodexOpenClawPromptContext(
  return [context?.trim(), deliverySection, promptSection].filter(Boolean).join("\n\n");
 }
 const CODEX_DELIVERY_HINT_LINES = [
  "Delivery: to send a message, use the `message` tool.",
  "Delivery: Final assistant text is not automatically delivered in this run. Use the `message` tool to send user-visible output.",
 ] as const;
 function splitLeadingCodexDeliveryHint(prompt: string): {
  deliveryHint?: string;
  prompt: string;
 } {
  const trimmedStart = prompt.trimStart();
-  const matchedHint = MESSAGE_TOOL_DELIVERY_HINTS.find((hint) => trimmedStart.startsWith(hint));
+  const matchedHint = CODEX_DELIVERY_HINT_LINES.find((hint) => trimmedStart.startsWith(hint));
  if (!matchedHint) {
    return { prompt };
  }
--- a/extensions/codex/src/app-server/attempt-notification-state.ts
+++ b/extensions/codex/src/app-server/attempt-notification-state.ts
@@ -9,6 +9,7 @@ import {
  isFileChangePatchUpdatedNotification,
  isAssistantCommentaryCompletionNotification,
  isNativeToolProgressNotification,
  isNativeResponseStreamDeltaNotification,
  isPendingOpenClawDynamicToolCompletionNotification,
  isRawAssistantProgressNotification,
  isRawReasoningCompletionNotification,
@@ -16,6 +17,7 @@ import {
  isReasoningProgressNotification,
  isReasoningItemCompletionNotification,
  isRetryableErrorNotification,
  isTurnNotification,
  readCodexNotificationItem,
  readNotificationItemId,
  shouldDisarmAssistantCompletionIdleWatch,
@@ -23,7 +25,6 @@ import {
 } from "./attempt-notifications.js";
 import { CODEX_POST_REASONING_REPLY_IDLE_TIMEOUT_MS } from "./attempt-timeouts.js";
 import type { CodexAttemptTurnWatchController } from "./attempt-turn-watches.js";
 import { isCodexNotificationForTurn } from "./notification-correlation.js";
 import type { CodexServerNotification } from "./protocol.js";
 type CodexExecutionPhase =
@@ -69,7 +70,7 @@ export function isTerminalCodexTurnNotificationForTurn(params: {
  turnId: string;
  currentPromptTexts: string[];
 }): boolean {
-  if (!isCodexNotificationForTurn(params.notification.params, params.threadId, params.turnId)) {
+  if (!isTurnNotification(params.notification.params, params.threadId, params.turnId)) {
    return false;
  }
  return (
@@ -104,15 +105,16 @@ export function applyCodexTurnNotificationState(params: {
  turnCrossedToolHandoff: boolean;
 } {
  const { notification, turnWatches } = params;
-  const isCurrentTurnNotification = isCodexNotificationForTurn(
+  const isCurrentTurnNotification = isTurnNotification(
    notification.params,
    params.threadId,
    params.turnId,
  );
  const isTurnCompletion = notification.method === "turn/completed" && isCurrentTurnNotification;
  const isNativeResponseStreamDelta = isNativeResponseStreamDeltaNotification(notification);
  let turnCrossedToolHandoff = params.turnCrossedToolHandoff;
-  if (isCurrentTurnNotification) {
+  if (isCurrentTurnNotification && !isNativeResponseStreamDelta) {
    turnWatches.touchActivity(`notification:${notification.method}`, {
      details: describeNotificationActivity(notification),
      attemptProgress: true,
@@ -248,6 +250,7 @@ export function applyCodexTurnNotificationState(params: {
    !turnWatches.isCompletionIdleWatchPinnedByTerminalError() &&
    notification.method !== "turn/completed" &&
    isCurrentTurnNotification &&
    !isNativeResponseStreamDelta &&
    !trackedDynamicToolCompletion &&
    !rawToolOutputCompletion &&
    !postToolProgressNeedsTerminalGuard &&
--- a/extensions/codex/src/app-server/attempt-notifications.ts
+++ b/extensions/codex/src/app-server/attempt-notifications.ts
@@ -1,6 +1,11 @@
 /**
 * Predicates and readers for Codex app-server notification envelopes.
 */
 import { asBoolean } from "openclaw/plugin-sdk/string-coerce-runtime";
 import {
  describeCodexNotificationCorrelation,
  isCodexNotificationForTurn,
 } from "./notification-correlation.js";
 import {
  isJsonObject,
  type CodexServerNotification,
@@ -211,6 +216,13 @@ export function isNativeToolProgressNotification(notification: CodexServerNotifi
  }
 }
 /** Returns true for raw native response stream delta events. */
 export function isNativeResponseStreamDeltaNotification(
  notification: CodexServerNotification,
 ): boolean {
  return notification.method.startsWith("response.") && notification.method.endsWith(".delta");
 }
 /** Returns true for file-change patch update notifications. */
 export function isFileChangePatchUpdatedNotification(
  notification: CodexServerNotification,
@@ -265,9 +277,74 @@ function readRawAssistantTextPreview(item: JsonObject): string | undefined {
  return text.length > 240 ? `${text.slice(0, 237)}...` : text;
 }
 /** Returns true when notification params correlate to a specific thread/turn. */
 export function isTurnNotification(
  value: JsonValue | undefined,
  threadId: string,
  turnId: string,
 ): boolean {
  return isCodexNotificationForTurn(value, threadId, turnId);
 }
 /** Returns true when a correlated notification belongs to another active run. */
 export function isCodexNotificationOutsideActiveRun(
  correlation: ReturnType<typeof describeCodexNotificationCorrelation>,
 ): boolean {
  const hasThreadScope = Boolean(correlation.threadId || correlation.nestedTurnThreadId);
  if (!hasThreadScope) {
    return false;
  }
  if (!correlation.matchesActiveThread) {
    return true;
  }
  const hasTurnScope = Boolean(correlation.turnId || correlation.nestedTurnId);
  return hasTurnScope && correlation.matchesActiveTurn === false;
 }
 /** Checks request params that must contain the current thread and turn ids. */
 export function isCurrentThreadTurnRequestParams(
  value: JsonValue | undefined,
  threadId: string,
  turnId: string,
 ): boolean {
  if (!isJsonObject(value)) {
    return false;
  }
  return readString(value, "threadId") === threadId && readString(value, "turnId") === turnId;
 }
 /** Checks approval request params, accepting `conversationId` as thread id. */
 export function isCurrentApprovalTurnRequestParams(
  value: JsonValue | undefined,
  threadId: string,
  turnId: string,
 ): boolean {
  if (!isJsonObject(value)) {
    return false;
  }
  const requestThreadId = readString(value, "threadId") ?? readString(value, "conversationId");
  return requestThreadId === threadId && readString(value, "turnId") === turnId;
 }
 /** Checks request params where `turnId` may be omitted or null for the thread. */
 export function isCurrentThreadOptionalTurnRequestParams(
  value: JsonValue | undefined,
  threadId: string,
  turnId: string,
 ): boolean {
  if (!isJsonObject(value) || readString(value, "threadId") !== threadId) {
    return false;
  }
  const requestTurnId = value.turnId;
  return requestTurnId === null || requestTurnId === undefined || requestTurnId === turnId;
 }
 /** Returns true for app-server error notifications that will retry. */
 export function isRetryableErrorNotification(value: JsonValue | undefined): boolean {
-  return isJsonObject(value) && value.willRetry === true;
+  if (!isJsonObject(value)) {
    return false;
  }
  return readBoolean(value, "willRetry") === true || readBoolean(value, "will_retry") === true;
 }
 /** Returns true for terminal app-server thread status strings. */
@@ -342,6 +419,10 @@ function readString(record: JsonObject, key: string): string | undefined {
  return typeof value === "string" ? value : undefined;
 }
 function readBoolean(record: JsonObject, key: string): boolean | undefined {
  return asBoolean(record[key]);
 }
 /** Reads a typed Codex item from notification params when id/type are present. */
 export function readCodexNotificationItem(
  params: JsonValue | undefined,
--- a/extensions/codex/src/app-server/attempt-startup.test.ts
+++ b/extensions/codex/src/app-server/attempt-startup.test.ts
@@ -9,16 +9,13 @@ import type {
 } from "openclaw/plugin-sdk/agent-harness-runtime";
 import { afterEach, beforeEach, describe, expect, it, vi } from "vitest";
 import { startCodexAttemptThread } from "./attempt-startup.js";
 import { defaultLeasedCodexAppServerClientFactory } from "./client-factory.js";
 import { CodexAppServerClient } from "./client.js";
 import { type CodexPluginConfig, resolveCodexAppServerRuntimeOptions } from "./config.js";
 import { threadStartResult } from "./run-attempt-test-harness.js";
 import {
-  resetCodexTestBindingStore,
+  clearSharedCodexAppServerClient,
-  testCodexAppServerBindingStore,
+  getLeasedSharedCodexAppServerClient,
-} from "./session-binding.test-helpers.js";
+  releaseLeasedSharedCodexAppServerClient,
 import {
  leaseSharedCodexAppServerClient,
  resetSharedCodexAppServerClientForTests,
 } from "./shared-client.js";
 import { createClientHarness, createCodexTestModel } from "./test-support.js";
@@ -88,10 +85,12 @@ function startThreadWithHarness(
  signal = new AbortController().signal,
  overrides?: {
    pluginConfig?: CodexPluginConfig;
    attemptClientFactory?: (
      harness: ClientHarness,
    ) => Parameters<typeof startCodexAttemptThread>[0]["attemptClientFactory"];
    harness?: ClientHarness;
    paths?: AttemptPaths;
    skipStartSpy?: boolean;
    onThreadReserved?: Parameters<typeof startCodexAttemptThread>[0]["onThreadReserved"];
  },
 ) {
  const harness = overrides?.harness ?? createClientHarness();
@@ -102,7 +101,8 @@ function startThreadWithHarness(
  const effectivePluginConfig = overrides?.pluginConfig ?? pluginConfig;
  const run = startCodexAttemptThread({
-    bindingStore: testCodexAppServerBindingStore,
+    attemptClientFactory:
      overrides?.attemptClientFactory?.(harness) ?? defaultLeasedCodexAppServerClientFactory,
    appServer: resolveCodexAppServerRuntimeOptions({ pluginConfig: effectivePluginConfig }),
    pluginConfig: effectivePluginConfig,
    computerUseConfig: effectivePluginConfig.computerUse ?? { enabled: false },
@@ -123,11 +123,10 @@ function startThreadWithHarness(
    sandboxExecServerEnabled: false,
    sandbox: null,
    contextEngineProjection: undefined,
    startupTokenGuard: {},
    startupTimeoutMs,
    signal,
    onStartupTimeout: vi.fn(),
-    onThreadReserved: overrides?.onThreadReserved,
+    spawnedBy: undefined,
  });
  return { harness, run };
@@ -169,13 +168,12 @@ describe("startCodexAttemptThread", () => {
    vi.useRealTimers();
    vi.stubEnv("CODEX_API_KEY", "");
    vi.stubEnv("OPENAI_API_KEY", "");
-    resetCodexTestBindingStore();
+    clearSharedCodexAppServerClient();
    resetSharedCodexAppServerClientForTests();
  });
  afterEach(async () => {
    vi.useRealTimers();
-    resetSharedCodexAppServerClientForTests();
+    clearSharedCodexAppServerClient();
    vi.restoreAllMocks();
    vi.unstubAllEnvs();
    for (const root of tempRoots) {
@@ -184,7 +182,7 @@ describe("startCodexAttemptThread", () => {
    tempRoots.clear();
  });
-  it("keeps the shared app-server reusable after a structured startup rejection", async () => {
+  it("clears the shared app-server when top-level thread startup fails with an app error", async () => {
    const { harness, run } = startThreadWithHarness(5_000);
    await answerInitialize(harness);
    const threadStart = await waitForThreadStart(harness);
@@ -194,57 +192,25 @@ describe("startCodexAttemptThread", () => {
    });
    await expect(run).rejects.toThrow("Invalid bearer token");
    expect(harness.process.stdin.destroyed).toBe(false);
  });
  it("retires the client when malformed startup cleanup cannot be confirmed", async () => {
    const { harness, run } = startThreadWithHarness(5_000);
    await answerInitialize(harness);
    const threadStart = await waitForThreadStart(harness);
    harness.send({ id: threadStart.id, result: { thread: { id: "thread-malformed" } } });
    const unsubscribe = await waitForRequest(harness, "thread/unsubscribe");
    harness.send({
      id: unsubscribe.id,
      error: { code: -32000, message: "unsubscribe failed" },
    });
    await expect(run).rejects.toThrow("subscription could not be released");
    expect(harness.process.stdin.destroyed).toBe(true);
  });
-  it("retires the client when route cleanup cannot release the subscription", async () => {
+  it("retires a failed startup client after another active lease releases", async () => {
    const { harness, run } = startThreadWithHarness(5_000, undefined, {
      onThreadReserved: () => {
        throw new Error("route integration failed");
      },
    });
    await answerInitialize(harness);
    const threadStart = await waitForThreadStart(harness);
    harness.send({ id: threadStart.id, result: threadStartResult("thread-route-failed") });
    const unsubscribe = await waitForRequest(harness, "thread/unsubscribe");
    harness.send({
      id: unsubscribe.id,
      error: { code: -32000, message: "unsubscribe failed" },
    });
    await expect(run).rejects.toThrow("Codex startup subscription cleanup failed");
    expect(harness.process.stdin.destroyed).toBe(true);
  });
  it("does not retire a peer-owned client after a structured startup rejection", async () => {
    const retained = createClientHarness();
-    const startSpy = vi.spyOn(CodexAppServerClient, "start").mockReturnValue(retained.client);
+    const replacement = createClientHarness();
    const startSpy = vi
      .spyOn(CodexAppServerClient, "start")
      .mockReturnValueOnce(retained.client)
      .mockReturnValueOnce(replacement.client);
    const appServer = resolveCodexAppServerRuntimeOptions({ pluginConfig });
    const paths = createAttemptPaths();
-    const retainedLeasePromise = leaseSharedCodexAppServerClient({
+    const retainedLease = getLeasedSharedCodexAppServerClient({
      startOptions: appServer.start,
      agentDir: paths.agentDir,
      preparedAuth: {},
    });
    await answerInitialize(retained);
-    const retainedLease = await retainedLeasePromise;
+    await expect(retainedLease).resolves.toBe(retained.client);
    expect(retainedLease.client).toBe(retained.client);
    const { run } = startThreadWithHarness(5_000, new AbortController().signal, {
      harness: retained,
@@ -260,16 +226,17 @@ describe("startCodexAttemptThread", () => {
    await expect(run).rejects.toThrow("Invalid bearer token");
    expect(retained.process.stdin.destroyed).toBe(false);
-    retainedLease.release();
+    expect(releaseLeasedSharedCodexAppServerClient(retained.client)).toBe(true);
-    const nextLeasePromise = leaseSharedCodexAppServerClient({
+    await vi.waitFor(() => expect(retained.process.stdin.destroyed).toBe(true));
    const replacementLease = getLeasedSharedCodexAppServerClient({
      startOptions: appServer.start,
      agentDir: paths.agentDir,
      preparedAuth: {},
    });
-    const nextLease = await nextLeasePromise;
+    await answerInitialize(replacement);
-    expect(nextLease.client).toBe(retained.client);
+    await expect(replacementLease).resolves.toBe(replacement.client);
-    expect(startSpy).toHaveBeenCalledTimes(1);
+    expect(startSpy).toHaveBeenCalledTimes(2);
-    nextLease.release();
+    expect(releaseLeasedSharedCodexAppServerClient(replacement.client)).toBe(true);
  });
  it("clears the shared app-server when startup abandons an in-flight thread request", async () => {
@@ -291,20 +258,18 @@ describe("startCodexAttemptThread", () => {
    expect(harness.stdinDestroyed).toBe(true);
  });
-  it("retires abandoned thread startup even when another lease shares the client", async () => {
+  it("aborts abandoned thread startup when another lease keeps the shared app-server alive", async () => {
    const retained = createClientHarness();
    vi.spyOn(CodexAppServerClient, "start").mockReturnValue(retained.client);
    const appServer = resolveCodexAppServerRuntimeOptions({ pluginConfig });
    const paths = createAttemptPaths();
-    const retainedLeasePromise = leaseSharedCodexAppServerClient({
+    const retainedLease = getLeasedSharedCodexAppServerClient({
      startOptions: appServer.start,
      agentDir: paths.agentDir,
      preparedAuth: {},
    });
    await answerInitialize(retained);
-    const retainedLease = await retainedLeasePromise;
+    await expect(retainedLease).resolves.toBe(retained.client);
    expect(retainedLease.client).toBe(retained.client);
    const { run } = startThreadWithHarness(100, new AbortController().signal, {
      harness: retained,
@@ -315,9 +280,11 @@ describe("startCodexAttemptThread", () => {
    const threadStart = await waitForThreadStart(retained);
    await rejected;
-    expect(threadStart.id).toBeDefined();
+    expect(retained.process.stdin.destroyed).toBe(false);
-    expect(retained.process.stdin.destroyed).toBe(true);
+
-    retainedLease.release();
+    retained.send({ id: threadStart.id, result: { threadId: "late-thread" } });
    expect(releaseLeasedSharedCodexAppServerClient(retained.client)).toBe(true);
    await vi.waitFor(() => expect(retained.process.stdin.destroyed).toBe(true));
  });
  it("closes the shared app-server when startup times out during initialize", async () => {
@@ -342,37 +309,45 @@ describe("startCodexAttemptThread", () => {
    ).toBe(false);
  });
-  it("releases a late startup lease without retiring a peer-owned initializing client", async () => {
+  it("closes a startup client that arrives after startup timeout", async () => {
-    const harness = createClientHarness();
+    let observedFactoryOptions:
-    const startSpy = vi.spyOn(CodexAppServerClient, "start").mockReturnValue(harness.client);
+      | {
-    const paths = createAttemptPaths();
+          onStartedClient?: (client: CodexAppServerClient) => void;
-    const appServer = resolveCodexAppServerRuntimeOptions({ pluginConfig });
+          abandonSignal?: AbortSignal;
-    const peerPromise = leaseSharedCodexAppServerClient({
+        }
-      startOptions: appServer.start,
+      | undefined;
-      agentDir: paths.agentDir,
+    let resolveFactoryDone: () => void = () => undefined;
-      preparedAuth: {},
+    const factoryDone = new Promise<void>((resolve) => {
      resolveFactoryDone = resolve;
    });
-    const { run } = startThreadWithHarness(100, new AbortController().signal, {
+    const { harness, run } = startThreadWithHarness(100, new AbortController().signal, {
-      harness,
+      attemptClientFactory:
-      paths,
+        (factoryHarness) => async (_startOptions, _authProfileId, _agentDir, _config, options) => {
-      skipStartSpy: true,
+          try {
            observedFactoryOptions = options;
            await new Promise<void>((resolve) => {
              setTimeout(resolve, 250);
            });
            options?.onStartedClient?.(factoryHarness.client);
            return factoryHarness.client;
          } finally {
            resolveFactoryDone();
          }
        },
    });
    const rejected = expect(run).rejects.toThrow("codex app-server startup timed out");
-    await expect(run).rejects.toThrow("codex app-server startup timed out");
+    await rejected;
-    expect(harness.stdinDestroyed).toBe(false);
+    await factoryDone;
-    await answerInitialize(harness);
+    await vi.waitFor(() => expect(harness.stdinDestroyed).toBe(true), {
-    const peer = await peerPromise;
+      interval: 1,
-    expect(peer.client).toBe(harness.client);
+      timeout: 2_000,
    await new Promise<void>((resolve) => {
      setImmediate(resolve);
    });
    expect(startSpy).toHaveBeenCalledTimes(1);
    expect(
      readHarnessMessages(harness.writes).some((write) => write.method === "thread/start"),
    ).toBe(false);
-    await peer.abandon();
+    expect(observedFactoryOptions?.onStartedClient).toBeTypeOf("function");
-    expect(harness.stdinDestroyed).toBe(true);
+    expect(observedFactoryOptions?.abandonSignal?.aborted).toBe(true);
  });
  it("clears the shared app-server when cancellation abandons an in-flight thread request", async () => {
--- a/Show More
+++ b/Show More
`@@ -1,2 +1,2 @@`
	`b121079a0912b3051a9fc319a675ef920da9db23364ca0c0ccd3c9f0a05a3a49 plugin-sdk-api-baseline.json`	`2c783beea6b3cda3d79060739a923f9f39e7e8b5942123dd6b08a09143a587ca plugin-sdk-api-baseline.json`
	`61a0108da670e0f44ba4b861c002eb6eaa5cf63e392d4e7e7de42044cbe7d115 plugin-sdk-api-baseline.jsonl`	`0b33af2cffb42abb46682fb71c8f214da220793f13d10a34d332e75ff99e8ce9 plugin-sdk-api-baseline.jsonl`