Transform chat history into structured, navigable memory graphs

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

yasinsb

These details have not been verified by PyPI

Project description

🧠 Rebrain

Transform chat history into structured, personalized AI memory.

Rebrain processes your ChatGPT conversations through a 5-step pipeline, extracting observations, synthesizing learnings and cognitions, then building a user persona for hyper-personalized AI interactions.

🚀 Quick Start (Recommended)

Using UV - Zero Setup Required

# 1. Install UV (one-time)
curl -LsSf https://astral.sh/uv/install.sh | sh

# 2. Set your API key
export GEMINI_API_KEY=your_key_here

# 3. Process your conversations
uvx rebrain pipeline run --input conversations.json

# 4. Start MCP server for Claude/Cursor
uvx rebrain mcp

That's it! No Python installation, no virtual environments, no dependencies to manage.

See INSTALL.md for detailed installation options.

🎯 For Developers

# Clone and setup
git clone https://github.com/yasinsb/rebrain.git
cd rebrain
python -m venv .venv && source .venv/bin/activate
pip install -r requirements.txt
cp env.template .env  # Add your GEMINI_API_KEY

# Run pipeline (bash CLI)
scripts/pipeline/cli.sh all

# Or step by step
scripts/pipeline/cli.sh step1  # Transform & filter
scripts/pipeline/cli.sh step2  # Extract & cluster observations
scripts/pipeline/cli.sh step3  # Synthesize learnings
scripts/pipeline/cli.sh step4  # Synthesize cognitions
scripts/pipeline/cli.sh step5  # Build persona

# Load into memg-core
python scripts/load_memg.py

Output: data/persona/persona.md - ready for system prompts!

🤖 MCP Integration (Claude Desktop / Cursor)

Direct Mode (Recommended)

Add to ~/.cursor/mcp.json or Claude Desktop config:

{
  "mcpServers": {
    "rebrain": {
      "command": "uvx",
      "args": ["--from", "rebrain", "rebrain-mcp"],
      "cwd": "/path/to/your/rebrain/project"
    }
  }
}

HTTP Mode (Shared Server)

# Start persistent server
uvx rebrain mcp --port 9999

{
  "mcpServers": {
    "rebrain": {
      "url": "http://localhost:9999/mcp"
    }
  }
}

Benefits:

💰 Process once (~$0.10-0.20), query forever for free (local memg-core)
⚡ Instant restarts - database persists, no reprocessing
🔒 100% local - no ongoing API costs, no cloud lock-in

Quick Start (Legacy)

1. Setup

git clone <repo-url>
cd rebrain

# Create virtual environment
python -m venv .venv
source .venv/bin/activate  # Windows: .venv\Scripts\activate

# Install dependencies
pip install -r requirements.txt

# Configure
cp env.template .env
# Edit .env: add GEMINI_API_KEY

2. Prepare Data

Export your ChatGPT conversations and place the JSON file at:

data/raw/conversations.json

3. Run Pipeline

# Full pipeline (5 steps)
scripts/pipeline/cli.sh all

# Or run individual steps
scripts/pipeline/cli.sh step1
scripts/pipeline/cli.sh step2
# ... etc

4. Check Results

# View pipeline status
scripts/pipeline/cli.sh status

# Read your persona
cat data/persona/persona.md

Pipeline Overview

Rebrain uses a 5-stage synthesis pipeline:

Raw Conversations (JSON export)
    ↓ Step 1: Transform & Filter
Clean Conversations (date filtered, code removed)
    ↓ Step 2: Extract & Cluster Observations (AI + K-Means)
Clustered Observations (~100 clusters)
    ↓ Step 3: Synthesize Learnings (AI + K-Means)
Clustered Learnings (~20 clusters)
    ↓ Step 4: Synthesize Cognitions (AI)
High-Level Cognitions (~20 patterns)
    ↓ Step 5: Build Persona (AI)
User Persona (3 plain text sections)

Key Features:

Privacy-First: Category-specific filtering at observation extraction
Adaptive Clustering: Finds local optima with tolerance-based K-Means
Flexible Models: Override per-task via prompt template metadata
Provenance Tracking: Full lineage from conversation → observation → learning → cognition
Dual Output: JSON (structured) + Markdown (human-readable)

Configuration

All pipeline parameters live in config/pipeline.yaml:

ingestion:
  date_cutoff_days: 180
  remove_code_blocks: true

insight_extraction:
  max_concurrent: 20
  batch_size: 40

learning_clustering:
  target_clusters: 20
  tolerance: 0.2

Model selection via prompt templates:

# rebrain/prompts/templates/persona_synthesis.yaml
metadata:
  model_recommendation: "gemini-2.5-flash"

See config/README.md for details.

CLI Usage

# Check what's been generated
./cli.sh status

# Run individual steps
./cli.sh step1 -i data/raw/my_convos.json
./cli.sh step2 --cluster-only  # Re-cluster existing insights
./cli.sh step3

# Clean outputs
./cli.sh clean --all

# Full help
./cli.sh help

See scripts/pipeline/README.md for details.

Project Structure

rebrain/
├── rebrain/              # Core library
│   ├── core/            # GenAI client
│   ├── ingestion/       # Data loading & chunking
│   ├── operations/      # Embedder, clusterer, synthesizer
│   ├── prompts/         # Prompt templates (YAML)
│   ├── retrieval/       # Query interface (future)
│   └── schemas/         # Pydantic models
├── config/              # Pipeline configuration
├── scripts/pipeline/    # 5-step pipeline + CLI
├── data/               # Raw → processed → persona
└── notebooks/          # Exploration & testing

Output

Persona (Step 5)

JSON (data/persona/persona.json):

{
  "model": "gemini-2.5-flash",
  "persona": {
    "personal_profile": "...",
    "communication_preferences": "...",
    "professional_profile": "..."
  }
}

Markdown (data/persona/persona.md):

# User Persona Information for AI

## Personal Profile
...

## Communication Preferences
...

## Professional Profile
...

Copy-paste ready for system prompts!

Development

# Install dev dependencies
pip install -r requirements_dev.txt

# Run with custom config
python scripts/pipeline/01_transform_filter.py --input data/raw/test.json

# Check specific step
python scripts/pipeline/02_extract_cluster_insights.py --skip-cluster

Documentation

Pipeline Details: scripts/pipeline/README.md
Configuration: config/README.md
Data Structure: data/README.md
Model Override Pattern: MODEL_OVERRIDE_PATTERN.md
Persona Builder: PERSONA_BUILDER_REFACTOR.md

License

MIT License - see LICENSE file

Built by Yasin Salimibeni

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

yasinsb

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.7

Oct 19, 2025

0.1.6

Oct 17, 2025

0.1.5

Oct 16, 2025

0.1.3

Oct 15, 2025

0.1.2

Oct 15, 2025

This version

0.1.1

Oct 15, 2025

0.1.0

Oct 14, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rebrain-0.1.1.tar.gz (71.5 kB view details)

Uploaded Oct 15, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

rebrain-0.1.1-py3-none-any.whl (90.0 kB view details)

Uploaded Oct 15, 2025 Python 3

File details

Details for the file rebrain-0.1.1.tar.gz.

File metadata

Download URL: rebrain-0.1.1.tar.gz
Upload date: Oct 15, 2025
Size: 71.5 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for rebrain-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`6363ca3224806e696ebc4ae68275b67479ea3f3061377e3cc92c3cf1e15725db`
MD5	`eb642049d0b90dbb05762e606dcff417`
BLAKE2b-256	`fef98885424933ca32d5480798c432702ae65cfabb612e4e305563e74dc4992b`

See more details on using hashes here.

Provenance

The following attestation bundles were made for rebrain-0.1.1.tar.gz:

Publisher: publish.yml on yasinsb/rebrain

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: rebrain-0.1.1.tar.gz
- Subject digest: 6363ca3224806e696ebc4ae68275b67479ea3f3061377e3cc92c3cf1e15725db
- Sigstore transparency entry: 606751151
- Sigstore integration time: Oct 15, 2025
Source repository:
- Permalink: yasinsb/rebrain@7b4754e2977a04e287c59ab99c79ee9ea36f4189
- Branch / Tag: refs/tags/v0.1.1
- Owner: https://github.com/yasinsb
- Access: private
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@7b4754e2977a04e287c59ab99c79ee9ea36f4189
- Trigger Event: release

File details

Details for the file rebrain-0.1.1-py3-none-any.whl.

File metadata

Download URL: rebrain-0.1.1-py3-none-any.whl
Upload date: Oct 15, 2025
Size: 90.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for rebrain-0.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6a487acb74e98d543950a4af2db0461ad3368da2d6204093bcb91a4720751b75`
MD5	`7f6779d1de4b1d67069f19ae8b5066f6`
BLAKE2b-256	`9ca55cdf8f9750807310fbdbeec831e4910f359307c0af9dac9b43b6bf2a71c2`

See more details on using hashes here.

Provenance

The following attestation bundles were made for rebrain-0.1.1-py3-none-any.whl:

Publisher: publish.yml on yasinsb/rebrain

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: rebrain-0.1.1-py3-none-any.whl
- Subject digest: 6a487acb74e98d543950a4af2db0461ad3368da2d6204093bcb91a4720751b75
- Sigstore transparency entry: 606751157
- Sigstore integration time: Oct 15, 2025
Source repository:
- Permalink: yasinsb/rebrain@7b4754e2977a04e287c59ab99c79ee9ea36f4189
- Branch / Tag: refs/tags/v0.1.1
- Owner: https://github.com/yasinsb
- Access: private
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@7b4754e2977a04e287c59ab99c79ee9ea36f4189
- Trigger Event: release

rebrain 0.1.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Classifiers

Project description

🧠 Rebrain

🚀 Quick Start (Recommended)

🎯 For Developers

🤖 MCP Integration (Claude Desktop / Cursor)

Direct Mode (Recommended)

HTTP Mode (Shared Server)

Quick Start (Legacy)

1. Setup

2. Prepare Data

3. Run Pipeline

4. Check Results

Pipeline Overview

Configuration

CLI Usage

Project Structure

Output

Persona (Step 5)

Development

Documentation

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance