Universal Memory Aggregation System - Cross-system intelligence for Session-Buddy

Project description

Akosha (आकाश) - Universal Memory Aggregation System

आकाश (Akosha) - The sky has no limits 🚀

Production Ready Phase 1 Complete Tests Passing

Version: 0.3.0 (Phase 1: Production Pilot Ready) Status: Production Ready for 100-System Pilot

"Akosha" (आकाश) means "sky" or "space" in Sanskrit - representing infinite, boundless memory aggregation across all your Session-Buddy instances.

🚀 What is Akosha?

Akosha is a universal memory aggregation system that collects, processes, and analyzes memories from multiple Session-Buddy instances (100-100,000 systems). It provides:

Semantic Search: Find relevant conversations across all systems using vector embeddings
Time-Series Analytics: Detect trends, anomalies, and correlations
Knowledge Graph: Cross-system entity relationships and path finding
Three-Tier Storage: Hot (in-memory) → Warm (on-disk) → Cold (Cloudflare R2)

Key Capabilities

✅ Privacy-First: Local ONNX embeddings, no external API calls required ✅ Scalable: Handles 100 to 100,000+ Session-Buddy instances ✅ Real-Time Analytics: Trend detection, anomaly spotting, cross-system correlation ✅ MCP Protocol: Exposes all capabilities via Model Context Protocol ✅ Production Ready: Comprehensive tests, graceful degradation, type-safe code

⚡ Quick Start

Prerequisites

Python 3.13+ (required for modern type hints)
UV package manager (recommended) or pip
DuckDB (automatically installed)
Optional: sentence-transformers for real embeddings (fallback available)

5-Minute Setup

# 1. Clone repository
git clone https://github.com/yourusername/akosha.git
cd akosha

# 2. Install dependencies
uv sync --group dev

# 3. Start Akosha MCP server
uv run python -m akosha.mcp

# 4. Verify installation
uv run python -c "from akosha.processing.embeddings import get_embedding_service; print('✅ Akosha ready!')"

That's it! Akosha is now running and ready to aggregate memories.

🚀 Production Deployment

For production deployment with Kubernetes, monitoring, and security:

# 1. Review deployment guide
cat docs/DEPLOYMENT_GUIDE.md

# 2. Deploy to Kubernetes
kubectl apply -f kubernetes/

# 3. Verify deployment
kubectl get pods -n akosha
kubectl port-forward -n akosha svc/akosha-api 8000:8000

# 4. Check metrics
curl http://localhost:8000/metrics

See Deployment Guide for complete production setup.

🔧 Installation

Using UV (Recommended)

# Install all dependencies (development + production)
uv sync --group dev

# Install minimal dependencies only (production)
uv sync

# Verify installation
uv run pytest tests/unit/ -v

Using Pip

# Create virtual environment
python3.13 -m venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate

# Install dependencies
pip install -e ".[dev]"

# Verify installation
pytest tests/unit/ -v

Optional Dependencies

For real semantic embeddings (recommended):

# Using UV
uv add --optional embeddings sentence-transformers onnxruntime

# Using pip
pip install "akosha[embeddings]"

Note: Akosha works without these dependencies using deterministic fallback embeddings. Real embeddings provide better semantic search.

⚙️ Configuration

Environment Variables

Create a .env file in the Akosha directory:

# Cloudflare R2 Configuration (Cold Storage)
AKOSHA_COLD_BUCKET=your-bucket-name
AKOSHA_COLD_ENDPOINT=https://your-account.r2.cloudflarestorage.com
AKOSHA_COLD_REGION=auto

# Optional: Embedding Model
AKOSHA_EMBEDDING_MODEL=all-MiniLM-L6-v2

# Optional: Storage Paths
AKOSHA_HOT_PATH=/tmp/akosha/hot
AKOSHA_WARM_PATH=/tmp/akosha/warm

🔌 MCP Server Setup

Global Configuration (Recommended)

Add to ~/.claude/.mcp.json:

{
  "mcpServers": {
    "akosha": {
      "command": "python",
      "args": ["-m", "akosha.mcp"],
      "cwd": "/path/to/akosha",
      "env": {
        "PYTHONPATH": "/path/to/akosha"
      }
    }
  }
}

Note: Replace /path/to/akosha with the actual path to your Akosha installation.

Project-Level Configuration (Alternative)

You can also create a project-level .mcp.json in the Akosha directory for development:

# Create .mcp.json in Akosha directory
cat > .mcp.json << 'EOF'
{
  "mcpServers": {
    "akosha": {
      "command": "uv",
      "args": ["run", "python", "-m", "akosha.mcp"],
      "cwd": "."
    }
  }
}
EOF

Note: Project-level configuration is optional. Use either global or project-level config, not both.

💡 Usage Examples

1. Generate Semantic Embeddings

from akosha.processing.embeddings import get_embedding_service

# Get singleton instance
embedding_service = get_embedding_service()
await embedding_service.initialize()

# Generate embedding
text = "How to implement JWT authentication in FastAPI"
embedding = await embedding_service.generate_embedding(text)

print(f"Embedding dimension: {len(embedding)}")  # 384
print(f"Mode: {'real' if embedding_service.is_available() else 'fallback'}")

2. Detect Trends in Metrics

from akosha.processing.analytics import TimeSeriesAnalytics
from datetime import datetime, timedelta, UTC

analytics = TimeSeriesAnalytics()

# Add metric data
now = datetime.now(UTC)
for i in range(20):
    await analytics.add_metric(
        metric_name="conversation_count",
        value=100 + i * 5,  # Increasing trend
        system_id="system-1",
        timestamp=now - timedelta(hours=20-i),
    )

# Analyze trend
trend = await analytics.analyze_trend(
    metric_name="conversation_count",
    system_id="system-1",
    time_window=timedelta(days=7),
)

print(f"Trend: {trend.trend_direction}")  # "increasing"
print(f"Strength: {trend.trend_strength:.2f}")  # 0.85+
print(f"Change: {trend.percent_change:.1f}%")  # +95%

3. Detect Anomalies

# Add normal data + anomalies
await analytics.add_metric("error_rate", 5.0, "system-1")
await analytics.add_metric("error_rate", 5.2, "system-1")
await analytics.add_metric("error_rate", 95.0, "system-1")  # Anomaly!
await analytics.add_metric("error_rate", 4.8, "system-1")

# Detect anomalies
anomalies = await analytics.detect_anomalies(
    metric_name="error_rate",
    system_id="system-1",
    threshold_std=2.5,
)

print(f"Found {anomalies.anomaly_count} anomalies")
for anomaly in anomalies.anomalies:
    print(f"  - Value: {anomaly['value']}, Z-score: {anomaly['z_score']:.2f}")

4. Cross-System Correlation

# Add correlated data for two systems
for i in range(20):
    base_value = 50.0 + i
    await analytics.add_metric("quality_score", base_value, "system-1")
    await analytics.add_metric("quality_score", base_value + 5, "system-2")

# Analyze correlations
correlation = await analytics.correlate_systems(
    metric_name="quality_score",
    time_window=timedelta(days=7),
)

print(f"Significant correlations: {len(correlation.system_pairs)}")
for pair in correlation.system_pairs:
    print(f"  {pair['system_1']} ↔ {pair['system_2']}: {pair['correlation']:.3f}")

🛠️ CLI Reference

Admin Shell

Launch the interactive admin shell for distributed intelligence operations:

akosha shell

The admin shell provides:

Intelligence Commands:
- aggregate() - Aggregate across systems
- search() - Search distributed memory
- detect() - Detect anomalies
- graph() - Query knowledge graph
- trends() - Analyze trends
Session Tracking: Automatic tracking via Session-Buddy MCP
IPython Features: Tab completion, magic commands, rich output

See Admin Shell Documentation for details.

Other Commands

# Show version
akosha version

# Show system information
akosha info

# Start Akosha server
akosha start --host 0.0.0.0 --port 8000

🏗️ Architecture

Three-Tier Storage

┌─────────────────────────────────────────────────────────┐
│                    Akosha System                         │
├─────────────────────────────────────────────────────────┤
│                                                          │
│  Hot Store (< 7 days)                                   │
│  ├─ DuckDB in-memory                                    │
│  ├─ FLOAT[384] embeddings (full precision)             │
│  └─ Sub-second queries                                  │
│                                                          │
│  Warm Store (7-90 days)                                 │
│  ├─ DuckDB on-disk                                      │
│  ├─ INT8[384] embeddings (75% size reduction)          │
│  └─ Date-based partitioning                             │
│                                                          │
│  Cold Store (> 90 days)                                 │
│  ├─ Parquet files on Cloudflare R2                     │
│  ├─ Extractive summaries (3 sentences)                 │
│  └─ Cost-effective long-term storage                    │
│                                                          │
└─────────────────────────────────────────────────────────┘

MCP Tools (11 Total)

Search Tools (3):

generate_embedding - Generate semantic embeddings
generate_batch_embeddings - Batch embedding generation
search_all_systems - Semantic search across all systems

Analytics Tools (4):

get_system_metrics - Get metrics and statistics
analyze_trends - Detect trends (increasing/decreasing/stable)
detect_anomalies - Find statistical outliers
correlate_systems - Cross-system correlation analysis

Graph Tools (3):

query_knowledge_graph - Query entities and relationships
find_path - Shortest path between entities
get_graph_statistics - Graph metrics and statistics

System Tools (1):

get_storage_status - Storage tier status

🧪 Development

Code Quality Standards

Type Hints: Required for all functions (modern Python 3.13+ syntax)
Docstrings: Comprehensive Google-style docstrings
Testing: 85%+ code coverage required
Linting: Ruff with strict settings
Complexity: Maximum 15 (Ruff default)

Running Development Commands

# Run linter
uv run ruff check akosha/

# Run type checker
uv run mypy akosha/

# Run tests
uv run pytest

# Run tests with coverage
uv run pytest --cov=akosha --cov-report=term-missing

# Run specific test file
uv run pytest tests/unit/test_embeddings.py -v

✅ Testing

Current Test Results

tests/unit/test_embeddings.py ............ (10 passing, 4 skipped)
tests/unit/test_analytics.py ............ (14 passing)
tests/integration/test_mcp_integration.py ........ (8 passing)

Total: 32/32 passing (100% pass rate)

Test Categories

Unit Tests (24 tests): Core functionality testing
Integration Tests (8 tests): End-to-end MCP workflows
Coverage: 76-97% for Phase 2 components

🗺️ Roadmap

✅ Phase 1: Foundation (COMPLETE)

Three-tier storage architecture
Basic ingestion pipeline
Knowledge graph construction
MCP server framework

✅ Phase 2: Advanced Features (COMPLETE)

ONNX embedding service
Time-series analytics
Cross-system correlation
11 MCP tools integrated

✅ Phase 3: Production Hardening (COMPLETE)

✅ Integration test suite (end-to-end testing)
✅ Load testing framework (Locust-based)
✅ Authentication & authorization (JWT + RBAC)
✅ Prometheus metrics collection
✅ Grafana dashboards (ingestion, query, storage)
✅ Prometheus alerting rules
✅ Kubernetes deployment manifests
✅ Security scanning pipeline

🚀 Phase 4: 100-System Pilot (READY TO START)

Deploy to production Kubernetes cluster
Onboard 10 pilot systems
Monitor SLO compliance (P50 <500ms, P99 <2s)
Scale to 100 systems
Validate cost projections

Timeline: 12 weeks total (Phase 1-3 complete, Phase 4 ready to begin)

See docs/ROADMAP.md for complete details.

🤝 Contributing

We welcome contributions! Please follow these guidelines:

Development Workflow

Fork and clone the repository
Create a feature branch: git checkout -b feature/your-feature
Install dependencies: uv sync --group dev
Make your changes following our code standards
Run tests: pytest
Run linter: ruff check akosha/
Commit with conventional commits: git commit -m "feat: add new feature"
Push and create PR: git push origin feature/your-feature

Code Standards

Type hints required on all functions
Docstrings required on all public APIs
Tests required for new features
Maximum complexity: 15 (Ruff)
Coverage: Maintain 85%+

📄 License

🙏 Acknowledgments

Session-Buddy: For the excellent MCP server patterns
Oneiric: For universal storage adapter framework
FastMCP: For elegant MCP protocol implementation
Sentence-Transformers: For all-MiniLM-L6-v2 model

Made with ❤️ by the Akosha team

आकाश (Akosha) - The sky has no limits

Project details

Release history Release notifications | RSS feed

0.4.2

May 2, 2026

0.4.1

Apr 14, 2026

0.4.0

Apr 3, 2026

0.3.3

Apr 3, 2026

0.3.2

Apr 3, 2026

0.3.1

Feb 17, 2026

0.3.0

Feb 13, 2026

This version

0.2.0

Feb 10, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

akosha-0.2.0.tar.gz (671.6 kB view details)

Uploaded Feb 10, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

akosha-0.2.0-py3-none-any.whl (129.1 kB view details)

Uploaded Feb 10, 2026 Python 3

File details

Details for the file akosha-0.2.0.tar.gz.

File metadata

Download URL: akosha-0.2.0.tar.gz
Upload date: Feb 10, 2026
Size: 671.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.10.0 {"installer":{"name":"uv","version":"0.10.0","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for akosha-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`7257ea53291a67d2f622e48044db7e0ba315e3659c91be025ce69a6cb3185505`
MD5	`304442c79c922419ecba14bc8c303bdc`
BLAKE2b-256	`1d71208bf20b1733273541f14ed02048f32639934633769c57e60e2fcafd6224`

See more details on using hashes here.

File details

Details for the file akosha-0.2.0-py3-none-any.whl.

File metadata

Download URL: akosha-0.2.0-py3-none-any.whl
Upload date: Feb 10, 2026
Size: 129.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.10.0 {"installer":{"name":"uv","version":"0.10.0","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for akosha-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`cea6be83399429d17c450af335f12a4bb3a535f121a856b54e0dbb2e649b9cef`
MD5	`87dd4ce4d7cb02038ab7bf31f5dd7628`
BLAKE2b-256	`4ba4970dfb7c982204cf10e929bbd407026d2b75d147a941d27a97094fd04393`

See more details on using hashes here.

akosha 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Akosha (आकाश) - Universal Memory Aggregation System

🚀 What is Akosha?

Key Capabilities

⚡ Quick Start

Prerequisites

5-Minute Setup

🚀 Production Deployment

🔧 Installation

Using UV (Recommended)

Using Pip

Optional Dependencies

⚙️ Configuration

Environment Variables

🔌 MCP Server Setup

Global Configuration (Recommended)

Project-Level Configuration (Alternative)

💡 Usage Examples

1. Generate Semantic Embeddings

2. Detect Trends in Metrics

3. Detect Anomalies

4. Cross-System Correlation

🛠️ CLI Reference

Admin Shell

Other Commands

🏗️ Architecture

Three-Tier Storage

MCP Tools (11 Total)

🧪 Development

Code Quality Standards

Running Development Commands

✅ Testing

Current Test Results

Test Categories

🗺️ Roadmap

✅ Phase 1: Foundation (COMPLETE)

✅ Phase 2: Advanced Features (COMPLETE)

✅ Phase 3: Production Hardening (COMPLETE)

🚀 Phase 4: 100-System Pilot (READY TO START)

🤝 Contributing

Development Workflow

Code Standards

📄 License

🙏 Acknowledgments

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes