Collective problem-solving memory for coding agents — powered by Actian VectorAI DB

These details have not been verified by PyPI

Project links

Project description

Context8

Collective problem-solving memory for coding agents — powered by Actian VectorAI DB

Context7 gives your agent the docs. Context8 gives it what the docs don't cover.

Every time a coding agent solves an uncommon error, the solution vanishes after the session. Context8 stores those solutions in a vector database so any agent — yours or your team's — can find them next time.

Agent hits error → searches Context8 → finds a past solution → applies it
                                            ↓
                     Agent solves new error → logs it to Context8 → future agents benefit

Prerequisites

Docker Desktop — Actian VectorAI DB runs as a Docker container on your machine
Python 3.10+

Quick Start

# 1. Install context8 + the Actian VectorAI DB client (one line)
pip install context8 "actian-vectorai @ https://github.com/hackmamba-io/actian-vectorAI-db-beta/raw/main/actian_vectorai-0.1.0b2-py3-none-any.whl"

# Or with uv
uv pip install context8 "actian-vectorai @ https://github.com/hackmamba-io/actian-vectorAI-db-beta/raw/main/actian_vectorai-0.1.0b2-py3-none-any.whl"

# 2. Start the database (pulls and runs the Docker container)
context8 start

# 3. Initialize and seed with 24 curated problem-solution pairs
context8 init --seed

# 4. Add to your coding agent
context8 add claude       # Claude Code
context8 add cursor       # Cursor
context8 add windsurf     # Windsurf

# 5. Verify everything works
context8 doctor

Restart your agent. It now has three new tools: context8_search, context8_log, and context8_stats.

Why two packages? The actian-vectorai SDK is distributed by Actian as a beta wheel and is not yet on PyPI. Context8 is on PyPI. Once Actian publishes their SDK to PyPI, this will become a single pip install context8.

What It Does

Layer	Source	What It Covers
Context 1-6	Codebase, conversation	Your current project
Context 7	Official documentation	Common patterns, API usage
Context 8	Agent problem-solving history	Uncommon errors, workarounds, integration bugs

Context8 catches the long tail — the problems that waste the most agent cycles because they aren't documented anywhere:

Environment-specific failures (OS quirks, dependency conflicts)
Library interaction bugs that only surface in combination
Workarounds discovered through trial-and-error
Errors that return zero useful Stack Overflow results

How It Works

Context8 is an MCP server backed by Actian VectorAI DB. When your agent encounters an error:

Search — Agent calls context8_search("TypeError Cannot read properties of undefined map React Suspense")
Match — Context8 runs hybrid search: dense semantic vectors find meaning-similar problems, sparse keyword vectors catch exact error tokens, metadata filters narrow by language/framework
Return — Agent gets ranked solutions with code diffs, confidence scores, and context
Learn — After solving a new problem, the agent calls context8_log(problem=..., solution=...) to store it

Three Search Strategies, Fused Together

Strategy	What It Catches	Example
Dense search (problem vector, 384d)	Semantic meaning	"undefined array access" matches "null reference on collection"
Dense search (code context vector, 768d)	Code patterns	`data?.items ?? []` matches `optional chaining null safety`
Sparse search (BM25 keywords)	Exact tokens	`ModuleNotFoundError` matches `ModuleNotFoundError` exactly

Results are fused with Reciprocal Rank Fusion (RRF) and filtered by language, framework, and more.

CLI Reference

context8 start                  # Start the Actian VectorAI DB container
context8 stop                   # Stop the container
context8 init                   # Create the collection
context8 init --seed            # Create + seed with starter data
context8 init --seed --force    # Drop, recreate, and reseed

context8 add claude             # Add to Claude Code (~/.claude/settings.json)
context8 add claude-project     # Add to project-level Claude config
context8 add cursor             # Add to Cursor (.cursor/mcp.json)
context8 add windsurf           # Add to Windsurf (.windsurf/mcp.json)
context8 remove claude          # Remove from Claude Code

context8 stats                  # Show knowledge base statistics
context8 doctor                 # Full health check
context8 search "query"         # Search from the command line (for testing)
context8 search "query" -l python  # Search with language filter

context8 serve                  # Start MCP server (agents call this automatically)

Architecture

┌─────────────────────────────────────────────────────────┐
│              Coding Agent (Claude Code / Cursor)         │
└────────────────────────┬────────────────────────────────┘
                         │ MCP (stdio)
┌────────────────────────▼────────────────────────────────┐
│                  Context8 MCP Server                     │
│                                                          │
│  ┌──────────────┐  ┌──────────────┐  ┌──────────────┐  │
│  │   Embedding   │  │    Search    │  │    Storage   │  │
│  │   Pipeline    │  │    Engine    │  │    Service   │  │
│  │              │  │              │  │              │  │
│  │ MiniLM 384d  │  │ Dense+Sparse │  │ Named Vecs  │  │
│  │ CodeBERT*    │  │ RRF Fusion   │  │ Filters     │  │
│  │ BM25 Sparse  │  │ QueryAnalyze │  │ Dedup       │  │
│  └──────────────┘  └──────────────┘  └──────────────┘  │
└────────────────────────┬────────────────────────────────┘
                         │ gRPC :50051
              ┌──────────▼──────────┐
              │  Actian VectorAI DB │
              │  (Docker)           │
              │                     │
              │  context8_store     │
              │  3 named vectors    │
              │  + sparse + payload │
              └─────────────────────┘

Hackathon: Advanced Features Used

This project uses all three advanced features required by the Actian VectorAI DB Build Challenge:

Hybrid Fusion — Dense semantic vectors + sparse BM25 keyword vectors, fused with RRF. Error messages need both semantic understanding and exact token matching.
Filtered Search — Metadata filters narrow results by language, framework, error type, and resolution status. A Python agent doesn't need TypeScript solutions.
Named Vectors — Three separate embedding spaces (problem 384d, solution 384d, code_context 768d) because error descriptions, fix descriptions, and code snippets are semantically different domains.

Tech Stack

Component	Technology
Vector Database	Actian VectorAI DB (Docker, gRPC)
Dense Embeddings	sentence-transformers/all-MiniLM-L6-v2 (384d)
Code Embeddings	microsoft/codebert-base (768d, opt-in)
Sparse Embeddings	Custom BM25 tokenizer
MCP Server	Python `mcp` SDK (stdio transport)
CLI	Click + Rich
Package Manager	uv / pip compatible

Development

# Clone and set up
git clone https://github.com/hallelx2/context8.git
cd context8
uv venv && source .venv/bin/activate  # or: .venv\Scripts\activate on Windows

# Install context8 + dev deps + actian client
uv pip install -e ".[all]" "actian-vectorai @ https://github.com/hackmamba-io/actian-vectorAI-db-beta/raw/main/actian_vectorai-0.1.0b2-py3-none-any.whl"

# Start the DB and verify
context8 start
context8 doctor

# Run tests
pytest tests/ -v

# Lint
ruff check src/

Project Structure

context8/
├── src/context8/
│   ├── cli.py              # CLI commands (start/stop/init/add/remove/stats/doctor/search)
│   ├── server.py           # MCP server (context8_search, context8_log, context8_stats)
│   ├── agents.py           # Agent config management (add/remove from Claude/Cursor/etc.)
│   ├── search.py           # Hybrid search engine + QueryAnalyzer
│   ├── embeddings.py       # MiniLM + CodeBERT + BM25 pipeline
│   ├── storage.py          # Actian VectorAI DB operations (with sparse fallback)
│   ├── models.py           # ResolutionRecord dataclass
│   ├── config.py           # Constants, paths, agent registry
│   └── seed.py             # 24 curated problem-solution starter records
├── tests/
├── docs/                   # Architecture, plans, bottleneck analysis
├── docker-compose.yml
├── pyproject.toml
└── CLAUDE.md               # Agent instructions for this codebase

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.1.0

Apr 16, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

context8-0.1.0.tar.gz (67.0 kB view details)

Uploaded Apr 16, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

context8-0.1.0-py3-none-any.whl (29.8 kB view details)

Uploaded Apr 16, 2026 Python 3

File details

Details for the file context8-0.1.0.tar.gz.

File metadata

Download URL: context8-0.1.0.tar.gz
Upload date: Apr 16, 2026
Size: 67.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for context8-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`62f5db47ce9fbfa62d7d1a31c5bfd191abcfef16326eb2d9e6163cd0525cad98`
MD5	`a9e2397e3aaf9fc38bb13a42f52972bf`
BLAKE2b-256	`e0a9bebd599f93d1d5bfb60a574f4bbcfae23a0aaa0b1fa40aa2275d4d2c5d73`

See more details on using hashes here.

File details

Details for the file context8-0.1.0-py3-none-any.whl.

File metadata

Download URL: context8-0.1.0-py3-none-any.whl
Upload date: Apr 16, 2026
Size: 29.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for context8-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4db19b6773229f97c5bd7f75158842c83c185a876822299824dc45c3fe4b6524`
MD5	`bdc829bc58054149647a05a1337f4d9c`
BLAKE2b-256	`b79bc5c029d199b90c3bf4a3b43b39536ee1863b770f44191c5c2c4a4745d46d`

See more details on using hashes here.

context8 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Context8

Prerequisites

Quick Start

What It Does

How It Works

Three Search Strategies, Fused Together

CLI Reference

Architecture

Hackathon: Advanced Features Used

Tech Stack

Development

Project Structure

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes