Semantic memory search for markdown knowledge bases

These details have not been verified by PyPI

Project description

memsearch

OpenClaw's memory, everywhere.

https://github.com/user-attachments/assets/31de76cc-81a8-4462-a47d-bd9c394d33e3

💡 Give your AI agents persistent memory in a few lines of code. Write memories as markdown, search them semantically. Inspired by OpenClaw's markdown-first memory architecture. Pluggable into any agent framework.

✨ Why memsearch?

📝 Markdown is the source of truth — human-readable, git-friendly, zero vendor lock-in. Your memories are just .md files
⚡ Smart dedup — SHA-256 content hashing means unchanged content is never re-embedded
🔄 Live sync — File watcher auto-indexes changes to the vector DB, deletes stale chunks when files are removed
🧩 Ready-made Claude Code plugin — a drop-in example of agent memory built on memsearch

📦 Installation

pip install memsearch

Optional embedding providers

pip install "memsearch[google]"      # Google Gemini
pip install "memsearch[voyage]"      # Voyage AI
pip install "memsearch[ollama]"      # Ollama (local)
pip install "memsearch[local]"       # sentence-transformers (local, no API key)
pip install "memsearch[all]"         # Everything

🐍 Python API — Give Your Agent Memory

from memsearch import MemSearch

mem = MemSearch(paths=["./memory"])

await mem.index()                                      # index markdown files
results = await mem.search("Redis config", top_k=3)    # semantic search
print(results[0]["content"], results[0]["score"])       # content + similarity

🚀 Full example — agent with memory (OpenAI) — click to expand

import asyncio
from datetime import date
from pathlib import Path
from openai import OpenAI
from memsearch import MemSearch

MEMORY_DIR = "./memory"
llm = OpenAI()                                        # your LLM client
mem = MemSearch(paths=[MEMORY_DIR])                    # memsearch handles the rest

def save_memory(content: str):
    """Append a note to today's memory log (OpenClaw-style daily markdown)."""
    p = Path(MEMORY_DIR) / f"{date.today()}.md"
    p.parent.mkdir(parents=True, exist_ok=True)
    with open(p, "a") as f:
        f.write(f"\n{content}\n")

async def agent_chat(user_input: str) -> str:
    # 1. Recall — search past memories for relevant context
    memories = await mem.search(user_input, top_k=3)
    context = "\n".join(f"- {m['content'][:200]}" for m in memories)

    # 2. Think — call LLM with memory context
    resp = llm.chat.completions.create(
        model="gpt-4o-mini",
        messages=[
            {"role": "system", "content": f"You have these memories:\n{context}"},
            {"role": "user", "content": user_input},
        ],
    )
    answer = resp.choices[0].message.content

    # 3. Remember — save this exchange and index it
    save_memory(f"## {user_input}\n{answer}")
    await mem.index()

    return answer

async def main():
    # Seed some knowledge
    save_memory("## Team\n- Alice: frontend lead\n- Bob: backend lead")
    save_memory("## Decision\nWe chose Redis for caching over Memcached.")
    await mem.index()  # or mem.watch() to auto-index in the background

    # Agent can now recall those memories
    print(await agent_chat("Who is our frontend lead?"))
    print(await agent_chat("What caching solution did we pick?"))

asyncio.run(main())

💜 Anthropic Claude example — click to expand

pip install memsearch anthropic

import asyncio
from datetime import date
from pathlib import Path
from anthropic import Anthropic
from memsearch import MemSearch

MEMORY_DIR = "./memory"
llm = Anthropic()
mem = MemSearch(paths=[MEMORY_DIR])

def save_memory(content: str):
    p = Path(MEMORY_DIR) / f"{date.today()}.md"
    p.parent.mkdir(parents=True, exist_ok=True)
    with open(p, "a") as f:
        f.write(f"\n{content}\n")

async def agent_chat(user_input: str) -> str:
    # 1. Recall
    memories = await mem.search(user_input, top_k=3)
    context = "\n".join(f"- {m['content'][:200]}" for m in memories)

    # 2. Think — call Claude with memory context
    resp = llm.messages.create(
        model="claude-sonnet-4-5-20250929",
        max_tokens=1024,
        system=f"You have these memories:\n{context}",
        messages=[{"role": "user", "content": user_input}],
    )
    answer = resp.content[0].text

    # 3. Remember
    save_memory(f"## {user_input}\n{answer}")
    await mem.index()
    return answer

async def main():
    save_memory("## Team\n- Alice: frontend lead\n- Bob: backend lead")
    await mem.index()
    print(await agent_chat("Who is our frontend lead?"))

asyncio.run(main())

🦙 Ollama (fully local, no API key) — click to expand

pip install "memsearch[ollama]"
ollama pull nomic-embed-text          # embedding model
ollama pull llama3.2                  # chat model

import asyncio
from datetime import date
from pathlib import Path
from ollama import chat
from memsearch import MemSearch

MEMORY_DIR = "./memory"
mem = MemSearch(paths=[MEMORY_DIR], embedding_provider="ollama")

def save_memory(content: str):
    p = Path(MEMORY_DIR) / f"{date.today()}.md"
    p.parent.mkdir(parents=True, exist_ok=True)
    with open(p, "a") as f:
        f.write(f"\n{content}\n")

async def agent_chat(user_input: str) -> str:
    # 1. Recall
    memories = await mem.search(user_input, top_k=3)
    context = "\n".join(f"- {m['content'][:200]}" for m in memories)

    # 2. Think — call Ollama locally
    resp = chat(
        model="llama3.2",
        messages=[
            {"role": "system", "content": f"You have these memories:\n{context}"},
            {"role": "user", "content": user_input},
        ],
    )
    answer = resp.message.content

    # 3. Remember
    save_memory(f"## {user_input}\n{answer}")
    await mem.index()
    return answer

async def main():
    save_memory("## Team\n- Alice: frontend lead\n- Bob: backend lead")
    await mem.index()
    print(await agent_chat("Who is our frontend lead?"))

asyncio.run(main())

🖥️ CLI Usage

memsearch index ./memory/                          # index markdown files
memsearch search "how to configure Redis caching"  # semantic search
memsearch watch ./memory/                          # auto-index on file changes
memsearch compact                                  # LLM-powered memory summarization
memsearch config init                              # interactive config wizard
memsearch stats                                    # show index statistics

📖 Full command reference with all flags and examples → CLI Reference

🔍 How It Works

Markdown is the source of truth — the vector store is just a derived index, rebuildable anytime.

  ┌─── Search ─────────────────────────────────────────────────────────┐
  │                                                                    │
  │  "how to configure Redis?"                                         │
  │        │                                                           │
  │        ▼                                                           │
  │   ┌──────────┐     ┌─────────────────┐     ┌──────────────────┐   │
  │   │  Embed   │────▶│ Cosine similarity│────▶│ Top-K results    │   │
  │   │  query   │     │ (Milvus)        │     │ with source info │   │
  │   └──────────┘     └─────────────────┘     └──────────────────┘   │
  │                                                                    │
  └────────────────────────────────────────────────────────────────────┘

  ┌─── Ingest ─────────────────────────────────────────────────────────┐
  │                                                                    │
  │  MEMORY.md                                                         │
  │  memory/2026-02-09.md     ┌──────────┐     ┌────────────────┐     │
  │  memory/2026-02-08.md ───▶│ Chunker  │────▶│ Dedup          │     │
  │                           │(heading, │     │(chunk_hash PK) │     │
  │                           │paragraph)│     └───────┬────────┘     │
  │                           └──────────┘             │              │
  │                                             new chunks only       │
  │                                                    ▼              │
  │                                            ┌──────────────┐       │
  │                                            │  Embed &     │       │
  │                                            │  Milvus upsert│      │
  │                                            └──────────────┘       │
  │                                                                    │
  └────────────────────────────────────────────────────────────────────┘

  ┌─── Watch ──────────────────────────────────────────────────────────┐
  │  File watcher (1500ms debounce) ──▶ auto re-index / delete stale  │
  └────────────────────────────────────────────────────────────────────┘

  ┌─── Compact ─────────────────────────────────────────────────────────┐
  │  Retrieve chunks ──▶ LLM summarize ──▶ write memory/YYYY-MM-DD.md │
  └────────────────────────────────────────────────────────────────────┘

🔒 The entire pipeline runs locally by default — your data never leaves your machine unless you choose a remote backend or a cloud embedding provider.

🧩 Claude Code Plugin

memsearch ships with a Claude Code plugin — a real-world example of agent memory in action. It gives Claude automatic persistent memory across sessions: every session is summarized to markdown, every prompt triggers a semantic search, and a background watcher keeps the index in sync. No commands to learn, no manual saving — just install and go.

# 1. Install the memsearch CLI
pip install memsearch

# 2. Set your embedding API key (OpenAI is the default provider)
export OPENAI_API_KEY="sk-..."

# 3. In Claude Code, add the marketplace and install the plugin
/plugin marketplace add zilliztech/memsearch
/plugin install memsearch

# 4. Restart Claude Code for the plugin to take effect, then start chatting!
claude

📖 Architecture, hook details, and development mode → Claude Code Plugin docs

⚙️ Configuration

Settings are resolved in priority order (lowest → highest):

Built-in defaults → 2. Global ~/.memsearch/config.toml → 3. Project .memsearch.toml → 4. CLI flags

API keys for embedding/LLM providers are read from standard environment variables (OPENAI_API_KEY, GOOGLE_API_KEY, VOYAGE_API_KEY, ANTHROPIC_API_KEY, etc.).

📖 Config wizard, TOML examples, and all settings → Getting Started — Configuration

🔌 Embedding Providers

Provider	Install	Default Model
OpenAI	`memsearch` (included)	`text-embedding-3-small`
Google	`memsearch[google]`	`gemini-embedding-001`
Voyage	`memsearch[voyage]`	`voyage-3-lite`
Ollama	`memsearch[ollama]`	`nomic-embed-text`
Local	`memsearch[local]`	`all-MiniLM-L6-v2`

📖 Provider setup and env vars → CLI Reference — Embedding Provider Reference

🗄️ Milvus Backend

memsearch supports three deployment modes — just change milvus_uri:

Mode	`milvus_uri`	Best for
Milvus Lite (default)	`~/.memsearch/milvus.db`	Personal use, dev — zero config
Milvus Server	`http://localhost:19530`	Multi-agent, team environments
Zilliz Cloud	`https://in03-xxx.api.gcp-us-west1.zillizcloud.com`	Production, fully managed

📖 Code examples and setup details → Getting Started — Milvus Backends

📚 Links

Documentation — Getting Started, CLI Reference, Architecture
Claude Code Plugin — hook details, progressive disclosure, comparison with claude-mem
OpenClaw — the memory architecture that inspired memsearch
Milvus — the vector database powering memsearch
Changelog — release history

Contributing

Bug reports, feature requests, and pull requests are welcome on GitHub. For questions and discussions, join us on Discord.

📄 License

MIT

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.4.2

May 9, 2026

0.4.1

Apr 30, 2026

0.4.0

Apr 22, 2026

0.3.1

Apr 16, 2026

0.3.0

Apr 14, 2026

0.2.4

Apr 10, 2026

0.2.3

Apr 8, 2026

0.2.2

Mar 31, 2026

0.2.1

Mar 31, 2026

0.2.0

Mar 30, 2026

0.1.19

Mar 23, 2026

0.1.18

Mar 22, 2026

0.1.17

Mar 19, 2026

0.1.16

Mar 9, 2026

0.1.15

Mar 5, 2026

0.1.14

Mar 3, 2026

0.1.13

Feb 28, 2026

0.1.12

Feb 27, 2026

0.1.11

Feb 18, 2026

0.1.10

Feb 16, 2026

0.1.9

Feb 16, 2026

0.1.8

Feb 15, 2026

0.1.7

Feb 13, 2026

This version

0.1.6

Feb 12, 2026

0.1.5

Feb 12, 2026

0.1.4

Feb 11, 2026

0.1.3

Feb 11, 2026

0.1.2

Feb 11, 2026

0.1.1

Feb 11, 2026

0.1.0

Feb 11, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

memsearch-0.1.6.tar.gz (2.8 MB view details)

Uploaded Feb 12, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

memsearch-0.1.6-py3-none-any.whl (34.4 kB view details)

Uploaded Feb 12, 2026 Python 3

File details

Details for the file memsearch-0.1.6.tar.gz.

File metadata

Download URL: memsearch-0.1.6.tar.gz
Upload date: Feb 12, 2026
Size: 2.8 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for memsearch-0.1.6.tar.gz
Algorithm	Hash digest
SHA256	`8082d4e8dd0897f4bf9f26b3d2258c6b0b7eb4f77cef7ea591c3a59c1e1daf93`
MD5	`15fd699ba7a45b25865184117a1f4c4b`
BLAKE2b-256	`63cb0dfe1c38a939f3480e856c971882471a5a22d6722e2e901b9cd8d715f2b1`

See more details on using hashes here.

Provenance

The following attestation bundles were made for memsearch-0.1.6.tar.gz:

Publisher: release.yml on zilliztech/memsearch

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: memsearch-0.1.6.tar.gz
- Subject digest: 8082d4e8dd0897f4bf9f26b3d2258c6b0b7eb4f77cef7ea591c3a59c1e1daf93
- Sigstore transparency entry: 943213798
- Sigstore integration time: Feb 12, 2026
Source repository:
- Permalink: zilliztech/memsearch@19e2561ba10a034f28afeb7a6df78311d7acb45b
- Branch / Tag: refs/tags/v0.1.6
- Owner: https://github.com/zilliztech
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@19e2561ba10a034f28afeb7a6df78311d7acb45b
- Trigger Event: push

File details

Details for the file memsearch-0.1.6-py3-none-any.whl.

File metadata

Download URL: memsearch-0.1.6-py3-none-any.whl
Upload date: Feb 12, 2026
Size: 34.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for memsearch-0.1.6-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e292a7d33757a7f1bc54d08507cb78b37e95d4d705f9ad2a3ebaa7281c780869`
MD5	`75c6d7ac4fcf9361aeada319e452cd56`
BLAKE2b-256	`81df045db42501155fb25016c41dccbe133486ebe59ee6d28b655fffa8ecae8a`

See more details on using hashes here.

Provenance

The following attestation bundles were made for memsearch-0.1.6-py3-none-any.whl:

Publisher: release.yml on zilliztech/memsearch

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: memsearch-0.1.6-py3-none-any.whl
- Subject digest: e292a7d33757a7f1bc54d08507cb78b37e95d4d705f9ad2a3ebaa7281c780869
- Sigstore transparency entry: 943213825
- Sigstore integration time: Feb 12, 2026
Source repository:
- Permalink: zilliztech/memsearch@19e2561ba10a034f28afeb7a6df78311d7acb45b
- Branch / Tag: refs/tags/v0.1.6
- Owner: https://github.com/zilliztech
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@19e2561ba10a034f28afeb7a6df78311d7acb45b
- Trigger Event: push

memsearch 0.1.6

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

memsearch

✨ Why memsearch?

📦 Installation

🐍 Python API — Give Your Agent Memory

🖥️ CLI Usage

🔍 How It Works

🧩 Claude Code Plugin

⚙️ Configuration

🔌 Embedding Providers

🗄️ Milvus Backend

📚 Links

Contributing

📄 License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance