Agent Brain RAG - Intelligent document indexing and semantic search server that gives AI agents long-term memory

These details have not been verified by PyPI

Project links

Project description

Agent Brain RAG Server

Agent Brain (formerly doc-serve) is an intelligent document indexing and semantic search system designed to give AI agents long-term memory.

AI agents need persistent memory to be truly useful. Agent Brain provides the retrieval infrastructure that enables context-aware, knowledge-grounded AI interactions.

Installation

pip install agent-brain-rag

Quick Start

Set environment variables:

export OPENAI_API_KEY=your-key
export ANTHROPIC_API_KEY=your-key

Start the server:
```
agent-brain-serve
```

The server will start at http://127.0.0.1:8000.

Note: The legacy command doc-serve is still available but deprecated. Please use agent-brain-serve for new installations.

Search Capabilities

Agent Brain provides multiple search strategies to match your retrieval needs:

Search Type	Description	Best For
Semantic Search	Natural language queries using OpenAI embeddings (`text-embedding-3-large`)	Conceptual questions, finding related content
Keyword Search (BM25)	Traditional keyword matching with TF-IDF ranking	Exact matches, technical terms, code identifiers
Hybrid Search	Combines vector + BM25 for best of both approaches	General-purpose queries, balanced recall/precision
GraphRAG	Knowledge graph-based retrieval for relationship-aware queries	Understanding connections, multi-hop reasoning

Features

Document Indexing: Load and index documents from folders (PDF, Markdown, TXT, DOCX, HTML)
AST-Aware Code Ingestion: Smart parsing for Python, TypeScript, JavaScript, Java, Go, Rust, C, C++
Multi-Strategy Retrieval: Semantic, keyword, hybrid, and graph-based search
OpenAI Embeddings: Uses text-embedding-3-large for high-quality embeddings
Claude Summarization: AI-powered code summaries for better context
Chroma Vector Store: Persistent, thread-safe vector database
FastAPI: Modern, high-performance REST API with OpenAPI documentation

Prerequisites

Python 3.10+
OpenAI API key (for embeddings)
Anthropic API key (for summarization)

Development Installation

cd agent-brain-server
poetry install

Configuration

Copy the environment template and configure:

cp ../.env.example .env
# Edit .env with your API keys

Required environment variables:

OPENAI_API_KEY: Your OpenAI API key for embeddings
ANTHROPIC_API_KEY: Your Anthropic API key for summarization

Running the Server

# Development mode
poetry run uvicorn agent_brain_server.api.main:app --reload

# Or use the entry point
poetry run agent-brain-serve

API Documentation

Once running, visit:

Swagger UI: http://127.0.0.1:8000/docs
ReDoc: http://127.0.0.1:8000/redoc
OpenAPI JSON: http://127.0.0.1:8000/openapi.json

API Endpoints

Health

GET /health - Server health status
GET /health/status - Detailed indexing status

Indexing

POST /index - Start indexing documents from a folder
POST /index/add - Add documents to existing index
DELETE /index - Reset the index

Querying

POST /query - Semantic search query
GET /query/count - Get indexed document count

Example Usage

Index Documents

curl -X POST http://localhost:8000/index \
  -H "Content-Type: application/json" \
  -d '{"folder_path": "/path/to/docs"}'

Query Documents

curl -X POST http://localhost:8000/query \
  -H "Content-Type: application/json" \
  -d '{"query": "How do I configure authentication?", "top_k": 5}'

Architecture

agent_brain_server/
├── api/
│   ├── main.py           # FastAPI application
│   └── routers/          # Endpoint handlers
├── config/
│   └── settings.py       # Configuration management
├── models/               # Pydantic request/response models
├── indexing/
│   ├── document_loader.py  # Document loading
│   ├── chunking.py         # Text chunking
│   └── embedding.py        # Embedding generation
├── services/
│   ├── indexing_service.py # Indexing orchestration
│   └── query_service.py    # Query execution
└── storage/
    └── vector_store.py     # Chroma vector store

Development

Running Tests

poetry run pytest

Code Formatting

poetry run black agent_brain_server/
poetry run ruff check agent_brain_server/

Type Checking

poetry run mypy agent_brain_server/

Documentation

User Guide - Getting started and usage
Developer Guide - Contributing and development
API Reference - Full API documentation

Release Information

Current Version: See pyproject.toml
Release Notes: GitHub Releases
Changelog: Latest Release

Related Packages

agent-brain-cli - Command-line interface for Agent Brain

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

9.6.0

Apr 4, 2026

9.5.0

Mar 31, 2026

9.4.1

Mar 23, 2026

9.4.0

Mar 20, 2026

9.3.0

Mar 18, 2026

9.2.0

Mar 17, 2026

9.0.0

Mar 16, 2026

8.0.0

Mar 16, 2026

7.0.0

Mar 6, 2026

6.0.3

Feb 21, 2026

6.0.1

Feb 15, 2026

6.0.0

Feb 14, 2026

5.0.0

Feb 11, 2026

4.0.0

Feb 6, 2026

3.0.0

Feb 4, 2026

This version

2.0.0

Feb 2, 2026

1.2.0

Jan 31, 2026

1.1.0

Jan 29, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agent_brain_rag-2.0.0.tar.gz (65.6 kB view details)

Uploaded Feb 2, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

agent_brain_rag-2.0.0-py3-none-any.whl (89.0 kB view details)

Uploaded Feb 2, 2026 Python 3

File details

Details for the file agent_brain_rag-2.0.0.tar.gz.

File metadata

Download URL: agent_brain_rag-2.0.0.tar.gz
Upload date: Feb 2, 2026
Size: 65.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.3

File hashes

Hashes for agent_brain_rag-2.0.0.tar.gz
Algorithm	Hash digest
SHA256	`1058da29bee6691baf039c143ff76d37a2cbc4ae266475dbdcb642f2b179ab2b`
MD5	`9ffebc95df58d79ea6d3924572a69e18`
BLAKE2b-256	`f8ce968522095b56447a1e5f7439934f0e25dc33deb7ad341710d3be376150c9`

See more details on using hashes here.

File details

Details for the file agent_brain_rag-2.0.0-py3-none-any.whl.

File metadata

Download URL: agent_brain_rag-2.0.0-py3-none-any.whl
Upload date: Feb 2, 2026
Size: 89.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.3

File hashes

Hashes for agent_brain_rag-2.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f02d061ad868dfde031564b119ce55446f58d894be4cce6ebea7e8a0bad00aba`
MD5	`83f3246eb6248d4b30f358d025cc7d3c`
BLAKE2b-256	`9b44f8da93f17d2054226e62d371d51ed61ba0838c83487a00e0418a550209f8`

See more details on using hashes here.

agent-brain-rag 2.0.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Agent Brain RAG Server

Installation

Quick Start

Search Capabilities

Features

Prerequisites

Development Installation

Configuration

Running the Server

API Documentation

API Endpoints

Health

Indexing

Querying

Example Usage

Index Documents

Query Documents

Architecture

Development

Running Tests

Code Formatting

Type Checking

Documentation

Release Information

Related Packages

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes