openclaw/llama-cpp.md at dcf6f66d56eb194cb1b9bb38f7baa1198c58e5ed

liuyu/openclaw

Fork 0

mirror of https://github.com/openclaw/openclaw.git synced 2026-06-06 14:01:24 +08:00

Files

Onur Solmaz dcf6f66d56 fix(memory): move local embeddings to llama.cpp provider

2026-06-04 16:57:17 +08:00

1.6 KiB

Raw Blame History

summary, read_when, title, sidebarTitle

summary

read_when

title

sidebarTitle

Install the official llama.cpp provider for local GGUF memory embeddings

You want memory search embeddings from a local GGUF model

You are configuring memorySearch.provider = "local"

You need the node-llama-cpp runtime dependency

llama.cpp Provider

llama-cpp is the official external provider plugin for local GGUF embeddings. It owns the node-llama-cpp runtime dependency used by memorySearch.provider: "local".

Install it before using local memory embeddings:

openclaw plugins install @openclaw/llama-cpp-provider

The main openclaw npm package does not include node-llama-cpp. Keeping the native dependency in this plugin prevents normal OpenClaw npm updates from deleting a manually installed runtime inside the OpenClaw package directory.

Configuration

Set the memory search provider to local:

{
  agents: {
    defaults: {
      memorySearch: {
        provider: "local",
        local: {
          modelPath: "hf:ggml-org/embeddinggemma-300m-qat-q8_0-GGUF/embeddinggemma-300m-qat-Q8_0.gguf",
        },
      },
    },
  },
}

The default model is embeddinggemma-300m-qat-Q8_0.gguf. You can also point local.modelPath at a local .gguf file.

Native Runtime

Use Node 24 for the smoothest native install path. Source checkouts using pnpm may need to approve and rebuild the native dependency:

pnpm approve-builds
pnpm rebuild node-llama-cpp

For lower-friction local embeddings, use a local service provider such as Ollama or LM Studio instead.

1.6 KiB Raw Blame History

Configuration

Native Runtime

1.6 KiB

Raw Blame History