A self-improving code reasoning engine with persistent semantic memory
Project description
Neo
A self-improving code reasoning engine that learns from experience using persistent semantic memory. Neo uses multi-agent reasoning to analyze code, generate solutions, and continuously improve through feedback loops.
- Persistent Memory: Learns from every solution attempt
- Semantic Retrieval: Vector search finds relevant patterns
- Code-First Generation: No diff parsing failures
- Local Storage: Privacy-first JSON storage in ~/.neo directory
- Model-Agnostic: Works with any LM provider
- Available as a Claude Code Plugin: Integrates seamlessly with Anthropic's Claude models and CLI.
Why Neo? Why Care?
If you've been Vibe Coding, then Vibe Planning, then Context Engineering, and on and on, you have likely hit walls where the models are both powerful and limited, brilliant and incompetent, wise and ignorant, humble yet overconfident.
Worse, your speedy AI Code Assistant sometimes goes rogue and overwrites key code in a project, or writes redundant code even after just reading documentation and the source code, or violates your project's patterns and design philosophy.... It can be infuriating. Why doesn't the model remember? Why doesn't it learn? Why can't it keep the context of the code patterns and tech stack? ... -> This is what Neo is designed to solve.
Neo is the missing context layer for AI Code Assistants. It learns from every solution attempt, using vector embeddings to retrieve relevant patterns for new problems. It then applies the learned patterns to generate solutions, and continuously improves through feedback loops.
Table of Contents
- Design Philosophy
- How It Works
- Quick Start
- Claude Code Plugin
- Installation
- Usage
- Architecture
- Performance
- Configuration
- LM Adapters
- Extending Neo
- Key Features
- Development
- Research & References
- License
- Contributing
- Changelog
Design Philosophy
Persistent Learning: Neo builds a semantic memory of successful and failed solutions, using vector embeddings to retrieve relevant patterns for new problems.
Code-First Output: Instead of generating diffs that need parsing, Neo outputs executable code blocks directly, eliminating extraction failures.
Local File Storage: Semantic memory stored in ~/.neo directory for privacy and offline access.
Model-Agnostic: Works with OpenAI, Anthropic, Google, local models, or Ollama via a simple adapter interface.
How It Works
User Problem → Neo CLI → Semantic Retrieval → Reasoning → Code Generation
↓
[Vector Search]
[Pattern Matching]
[Confidence Scoring]
↓
Executable Code + Memory Update
Neo retrieves similar past solutions using Jina Code embeddings (768-dimensional vectors), applies learned patterns, generates solutions, and stores feedback for continuous improvement.
-
Jina's embeddings model (open source) is downloaded automatically when you first run Neo. This model runs locally on your machine to generate vector embeddings.
-
When you ask Neo for help:
- Your query is embedded locally using the Jina model
- Neo searches local memory for similar past solutions (using FAISS)
- Retrieved patterns are combined with your original prompt
- This combined context is sent to your chosen LLM API (OpenAI/Anthropic/Google)
- The LLM generates a solution informed by both your query and past patterns
- The result is stored back in local memory for future use
Local storage: ~/.neo/reasoning_patterns.json ← Stores vectors + patterns ~/.neo/faiss_index.bin ← FAISS index for fast search
Privacy:
- Your code never leaves your machine during embedding/search
- Only your prompt + retrieved patterns are sent to the LLM API
- This is the same as using the LLM directly, but with added context from something akin to memory.
Your Prompt
↓
Local Jina Embedding (768-dim vector)
↓
Local FAISS Search (finds similar past solutions)
↓
Retrieve Pattern Text from ~/.neo/reasoning_patterns.json
↓
Combine: Your Prompt + Retrieved Pattern Text
↓
→→→ NETWORK CALL →→→ LLM API (OpenAI/Anthropic/etc.)
↓
Solution Generated
↓
Store in Local Memory for future use
Quick Start
# Install from PyPI (recommended)
pip install neo-reasoner
# Or install with specific LM provider
pip install neo-reasoner[openai] # For GPT (recommended)
pip install neo-reasoner[anthropic] # For Claude
pip install neo-reasoner[google] # For Gemini
pip install neo-reasoner[all] # All providers
# Set API key
export OPENAI_API_KEY=sk-...
# Test Neo
neo --version
See QUICKSTART.md for 5-minute setup guide
Claude Code Plugin
Neo is available as a Claude Code plugin with specialized agents and slash commands for seamless integration:
# Add the marketplace
/plugin marketplace add Parslee-ai/claude-code-plugins
# Install Neo plugin
/plugin install neo
Once installed, you get:
- Neo Agent: Specialized subagent for semantic reasoning (
Use the Neo agent to...) - Slash Commands:
/neo,/neo-review,/neo-optimize,/neo-architect,/neo-debug,/neo-pattern - Persistent Memory: Neo learns from your codebase patterns over time
- Multi-Agent Reasoning: Solver, Critic, and Verifier agents collaborate on solutions
Quick Examples
# Code review with semantic analysis
/neo-review src/api/handlers.py
# Get optimization suggestions
/neo-optimize process_large_dataset function
# Architectural guidance
/neo-architect Should I use microservices or monolith?
# Debug complex issues
/neo-debug Race condition in task processor
See .claude-plugin/README.md for full plugin documentation
Installation
From PyPI (Recommended)
# Install Neo
pip install neo-reasoner
# With specific LM provider
pip install neo-reasoner[openai] # GPT (recommended)
pip install neo-reasoner[anthropic] # Claude
pip install neo-reasoner[google] # Gemini
pip install neo-reasoner[all] # All providers
# Verify installation
neo --version
From Source (Development)
# Clone repository
git clone https://github.com/Parslee-ai/neo.git
cd neo
# Install in development mode with all dependencies
pip install -e ".[dev,all]"
# Verify installation
neo --version
Dependencies
Core dependencies are automatically installed via pyproject.toml:
- numpy >= 1.24.0
- scikit-learn >= 1.3.0
- datasketch >= 1.6.0
- fastembed >= 0.3.0
- faiss-cpu >= 1.7.0
Optional: LM Provider
Choose your language model provider:
pip install openai # GPT models (recommended)
pip install anthropic # Claude
pip install google-generativeai # Gemini
pip install requests # Ollama
See INSTALL.md for detailed installation instructions
Usage
CLI Interface
# Ask Neo a question
neo "how do I fix the authentication bug?"
# With working directory context
neo --cwd /path/to/project "optimize this function"
# Check version and memory stats
neo --version
Timeout Requirements
Neo makes blocking LLM API calls that typically take 30-120 seconds. When calling Neo from scripts or automation, use appropriate timeouts:
# From shell (10 minute timeout)
timeout 600 neo "your query"
# From Python subprocess
subprocess.run(["neo", query], timeout=600)
Insufficient timeouts will cause failures during LLM inference, not context gathering.
Output Format
Neo outputs executable code blocks with confidence scores:
def solution():
# Neo's generated code
pass
Personality System
Neo responds with personality (Matrix-inspired quotes) when displaying version info:
$ neo --version
"What is real? How do you define 'real'?"
120 patterns. 0.3 confidence.
Load Program - Training Neo's Memory
"The Operator uploads a program into Neo's head."
Neo can bootstrap its memory by importing patterns from HuggingFace datasets. This is NOT model fine-tuning - it's retrieval learning that expands local semantic memory with reusable code patterns.
# Install datasets library
pip install datasets
# Load patterns from MBPP (recommended starter - 1000 Python problems)
neo --load-program mbpp --split train --limit 1000
# Load from OpenAI HumanEval (164 hand-written coding problems)
neo --load-program openai_humaneval --split test
# Load from BigCode HumanEvalPack (multi-language variants)
neo --load-program bigcode/humanevalpack --split test --limit 500
# Dry run to preview
neo --load-program mbpp --dry-run
# Custom column mapping
neo --load-program my_dataset \
--columns '{"text":"pattern","code":"solution"}'
Output (Matrix-style):
"I know kung fu."
Loaded: 847 patterns
Deduped: 153 duplicates
Index rebuilt: 1.2s
Memory: 1247 total patterns
How it works:
- Acquire: Pull dataset from HuggingFace
- Normalize: Map rows to ReasoningEntry schema
- Dedupe: Hash-based deduplication against existing memory
- Embed: Generate local embeddings (Jina Code v2)
- Index: Upsert into FAISS index
- Report: Matrix quote + counts
Key points:
- NOT fine-tuning - just expanding retrieval memory
- Patterns start at 0.3 confidence (trainable via real-world usage)
- Automatic deduplication prevents memory bloat
- Uses local embeddings (no data leaves your machine)
- Stored in
~/.neo/alongside learned patterns
See docs/LOAD_PROGRAM.md for detailed documentation
Architecture
Semantic Memory
Neo uses Jina Code v2 embeddings (768 dimensions) optimized for code similarity:
- Pattern Storage: Every solution attempt creates a reasoning pattern
- Vector Search: Similar problems retrieve relevant patterns via FAISS
- Confidence Scoring: Patterns track success/failure rates
- Local Persistence: Patterns stored locally in JSON format
Code Block Schema (Phase 1)
Neo generates executable code directly instead of diffs:
@dataclass
class CodeSuggestion:
file_path: str
unified_diff: str # Legacy: backward compatibility
code_block: str = "" # Primary: executable Python code
description: str
confidence: float
tradeoffs: list[str]
This eliminates the 18% extraction failure rate from diff parsing.
Storage Architecture
- Local Files: JSON storage in ~/.neo directory
- FAISS Index: Fast vector search for pattern retrieval
- Auto-Consolidation: Intelligent pattern merging to prevent fragmentation
Performance
Neo improves over time as it learns from experience. Initial performance depends on available memory patterns. Performance grows as the semantic memory builds up successful and failed solution patterns.
Configuration
CLI Configuration Management
Neo provides a simple CLI for managing persistent configuration:
# List all configuration values
neo --config list
# Get a specific value
neo --config get --config-key provider
# Set a value
neo --config set --config-key provider --config-value anthropic
neo --config set --config-key model --config-value claude-3-5-sonnet-20241022
neo --config set --config-key api_key --config-value sk-ant-...
# Reset to defaults
neo --config reset
Exposed Configuration Fields:
provider- LM provider (openai, anthropic, google, azure, ollama, local)model- Model name (e.g., gpt-4, claude-3-5-sonnet-20241022)api_key- API key for the chosen providerbase_url- Base URL for local/Ollama endpoints
Configuration is stored in ~/.neo/config.json and takes precedence over environment variables.
Environment Variables
Alternatively, use environment variables for configuration:
# Required: LM Provider API Key
export ANTHROPIC_API_KEY=sk-ant-...
LM Adapters
OpenAI (Recommended)
from neo.adapters import OpenAIAdapter
adapter = OpenAIAdapter(model="gpt-5-codex", api_key="sk-...")
Latest models: gpt-5-codex (recommended for coding), gpt-5, gpt-5-mini, gpt-5-nano
Anthropic
from neo.adapters import AnthropicAdapter
adapter = AnthropicAdapter(model="claude-sonnet-4-5-20250929")
Latest models: claude-sonnet-4-5-20250929, claude-opus-4-1-20250805, claude-3-5-haiku-20241022
from neo.adapters import GoogleAdapter
adapter = GoogleAdapter(model="gemini-2.5-pro")
Latest models: gemini-2.5-pro, gemini-2.5-flash, gemini-2.5-flash-lite
Ollama
from neo.adapters import OllamaAdapter
adapter = OllamaAdapter(model="llama3.1")
Extending Neo
Add a New LM Provider
from neo.cli import LMAdapter
class CustomAdapter(LMAdapter):
def generate(self, messages, stop=None, max_tokens=4096, temperature=0.7):
# Your implementation
return response_text
def name(self):
return "custom/model-name"
Key Features
- Persistent Memory: Learns from every solution attempt
- Semantic Retrieval: Vector search finds relevant patterns
- Code-First Generation: No diff parsing failures
- Local Storage: Privacy-first JSON storage in ~/.neo directory
- Model-Agnostic: Works with any LM provider
Development
Running Tests
# Run all tests
pytest
# Run specific test
pytest tests/test_neo.py
# Run with coverage
pytest --cov=neo
Research & References
Academic Papers
Neo's design is informed by cutting-edge research in code reasoning and memory systems:
-
ReasoningBank (arXiv:2509.25140v1) Systematic Failure Learning and Semantic Anchor Embedding
- Phase 2: Semantic anchor embedding (pattern+context only)
- Phase 3: Failure root cause extraction and contrastive learning
- Phase 4: Self-contrast consolidation (archetypal vs spurious patterns)
- Phase 5: Strategy evolution tracking (procedural/adaptive/compositional)
- Paper: https://arxiv.org/abs/2509.25140
-
MapCoder - Multi-agent reasoning framework Neo uses Solver-Critic-Verifier agent collaboration for code generation
-
CodeSim - Code similarity metrics Influenced Neo's semantic memory design and pattern matching approach
Technologies
-
Jina Code v2 Embeddings (jinaai/jina-embeddings-v2-base-code) 768-dimensional embeddings optimized for code similarity tasks
-
FAISS (facebookresearch/faiss) Facebook AI Similarity Search - efficient vector similarity search and clustering
-
FastEmbed (qdrant/fastembed) Local embedding generation without external API dependencies
License
Apache License 2.0 - See LICENSE for details.
Contributing
See CONTRIBUTING.md for contribution guidelines.
Changelog
See CHANGELOG.md for version history.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file neo_reasoner-0.7.6.tar.gz.
File metadata
- Download URL: neo_reasoner-0.7.6.tar.gz
- Upload date:
- Size: 163.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3e99f19a19881d9ba5cc7304143bd19c8861d11308818b218011ff0dc7c51f94
|
|
| MD5 |
9a293deef6a1263257fad2f327ef73c8
|
|
| BLAKE2b-256 |
8acfc0f99d94da2e85682d67fffa4974976e4a4736f000ceb7ee8388fffa213a
|
Provenance
The following attestation bundles were made for neo_reasoner-0.7.6.tar.gz:
Publisher:
publish.yml on Parslee-ai/neo
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
neo_reasoner-0.7.6.tar.gz -
Subject digest:
3e99f19a19881d9ba5cc7304143bd19c8861d11308818b218011ff0dc7c51f94 - Sigstore transparency entry: 606141542
- Sigstore integration time:
-
Permalink:
Parslee-ai/neo@5c78a092da90022eca1fbae97b8245b7838e8d83 -
Branch / Tag:
refs/tags/v0.7.6 - Owner: https://github.com/Parslee-ai
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@5c78a092da90022eca1fbae97b8245b7838e8d83 -
Trigger Event:
release
-
Statement type:
File details
Details for the file neo_reasoner-0.7.6-py3-none-any.whl.
File metadata
- Download URL: neo_reasoner-0.7.6-py3-none-any.whl
- Upload date:
- Size: 127.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
40810db399c4b8af321d6969303d4055205fa8ea7660c7abb432e0f97414f320
|
|
| MD5 |
2006805a2521310fcadafa3a5a23d250
|
|
| BLAKE2b-256 |
cdd12b1c3ee4ba30634153057a278754082038d6ba7bd5515b7be0a065bb81f1
|
Provenance
The following attestation bundles were made for neo_reasoner-0.7.6-py3-none-any.whl:
Publisher:
publish.yml on Parslee-ai/neo
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
neo_reasoner-0.7.6-py3-none-any.whl -
Subject digest:
40810db399c4b8af321d6969303d4055205fa8ea7660c7abb432e0f97414f320 - Sigstore transparency entry: 606141548
- Sigstore integration time:
-
Permalink:
Parslee-ai/neo@5c78a092da90022eca1fbae97b8245b7838e8d83 -
Branch / Tag:
refs/tags/v0.7.6 - Owner: https://github.com/Parslee-ai
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@5c78a092da90022eca1fbae97b8245b7838e8d83 -
Trigger Event:
release
-
Statement type: