mirror of
https://github.com/openclaw/openclaw.git
synced 2026-06-20 22:12:53 +08:00
* fix(memory): move local llama.cpp runtime to provider plugin * chore: ignore llama cpp dynamic dependency * test: remove invalid local provider alias fixture * chore: refresh llama cpp shrinkwrap * chore: drop stale memory embedding defaults facade
59 lines
1.6 KiB
Markdown
59 lines
1.6 KiB
Markdown
---
|
|
summary: "Install the official llama.cpp provider for local GGUF memory embeddings"
|
|
read_when:
|
|
- You want memory search embeddings from a local GGUF model
|
|
- You are configuring memorySearch.provider = "local"
|
|
- You need the OpenClaw plugin that owns the node-llama-cpp runtime
|
|
title: "llama.cpp Provider"
|
|
sidebarTitle: "llama.cpp Provider"
|
|
---
|
|
|
|
`llama-cpp` is the official external provider plugin for local GGUF embeddings.
|
|
It owns the `node-llama-cpp` runtime dependency used by
|
|
`memorySearch.provider: "local"`.
|
|
|
|
Install it before using local memory embeddings:
|
|
|
|
```bash
|
|
openclaw plugins install @openclaw/llama-cpp-provider
|
|
```
|
|
|
|
The main `openclaw` npm package does not include `node-llama-cpp`. Keeping the
|
|
native dependency in this plugin prevents normal OpenClaw npm updates from
|
|
deleting a manually installed runtime inside the OpenClaw package directory.
|
|
|
|
## Configuration
|
|
|
|
Set the memory search provider to `local`:
|
|
|
|
```json5
|
|
{
|
|
agents: {
|
|
defaults: {
|
|
memorySearch: {
|
|
provider: "local",
|
|
local: {
|
|
modelPath: "hf:ggml-org/embeddinggemma-300m-qat-q8_0-GGUF/embeddinggemma-300m-qat-Q8_0.gguf",
|
|
},
|
|
},
|
|
},
|
|
},
|
|
}
|
|
```
|
|
|
|
The default model is `embeddinggemma-300m-qat-Q8_0.gguf`. You can also point
|
|
`local.modelPath` at a local `.gguf` file.
|
|
|
|
## Native Runtime
|
|
|
|
Use Node 24 for the smoothest native install path. Source checkouts using pnpm
|
|
may need to approve and rebuild the native dependency:
|
|
|
|
```bash
|
|
pnpm approve-builds
|
|
pnpm rebuild node-llama-cpp
|
|
```
|
|
|
|
For lower-friction local embeddings, use a local service provider such as
|
|
Ollama or LM Studio instead.
|