Universal LLM adapter service for Sekha AI - Multi-provider bridge supporting 100+ LLMs via LiteLLM

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

Sekha

These details have not been verified by PyPI

Project links

Project description

Sekha LLM Bridge

Universal LLM Adapter - The Bridge Between Memory and Intelligence

🎯 What is Sekha LLM Bridge?

LLM-Bridge is a REQUIRED component of the Sekha ecosystem. It acts as the universal adapter layer that enables the Sekha Controller to work with any LLM provider - from local Ollama to cloud services like OpenAI, Anthropic, and Google.

Why is it Required?

The Controller (Rust) focuses on memory orchestration, storage, and retrieval. LLM-Bridge (Python) handles all LLM-specific operations, providing:

Provider Abstraction: Switch between Ollama, GPT-4, Claude, Gemini without changing Controller code
Universal Compatibility: Powered by LiteLLM for 100+ LLM providers
Async Processing: Celery-based task queue for expensive LLM operations
Retry Logic: Automatic retries with exponential backoff for reliability
Type Safety: Pydantic models for request/response validation

🏗️ Architecture Role

┌─────────────────────────────────────────┐
│      Sekha Controller (Rust)            │
│  • Memory Orchestration                 │
│  • Context Assembly                     │
│  • Storage (SQLite + Chroma)            │
└──────────────┬──────────────────────────┘
               │ HTTP Calls
               ▼
┌─────────────────────────────────────────┐
│      LLM-Bridge (Python) ← YOU ARE HERE │
│  • Universal LLM Adapter                │
│  • Embedding Generation                 │
│  • Summarization                        │
│  • Entity Extraction                    │
│  • Importance Scoring                   │
└──────────────┬──────────────────────────┘
               │ LiteLLM
               ▼
    ┌──────────┴────────────┐
    │                       │
    ▼                       ▼
┌─────────┐            ┌──────────┐
│ Ollama  │            │ OpenAI   │
│ (Local) │            │ GPT-4    │
└─────────┘            └──────────┘
    ▼                        ▼
┌─────────┐            ┌──────────┐
│Anthropic│            │  Google  │
│ Claude  │            │  Gemini  │
└─────────┘            └──────────┘

Multi-LLM Workflow Example:

Morning: Use Claude for code review → Sekha captures via Bridge
Afternoon: Switch to ChatGPT for docs → Bridge forwards to OpenAI
Evening: Use Ollama locally for planning → Bridge uses local LLM
All stored in unified sekha.db regardless of which LLM was used!

✨ Features

Core Services

Endpoint	Purpose	Used By
`POST /embed`	Generate embeddings for semantic search	Controller (on conversation storage)
`POST /summarize`	Hierarchical summarization (daily/weekly/monthly)	Controller orchestrator
`POST /extract`	Extract entities from conversations	Controller (future: auto-labeling)
`POST /score`	Score conversation importance (1-10)	Controller pruning engine
`POST /v1/chat/completions`	OpenAI-compatible chat endpoint	Proxy (optional component)

Current Capabilities

✅ Ollama Integration: Full support for local LLMs
✅ LiteLLM Powered: Ready for 100+ providers (OpenAI, Anthropic, etc.)
✅ Async Processing: Celery task queue for background jobs
✅ Retry Logic: 3 retries with exponential backoff
✅ Health Monitoring: /health endpoint with model availability checks
✅ Prometheus Metrics: /metrics for observability

Supported LLM Providers (via LiteLLM)

Currently Tested:

Ollama (nomic-embed-text, llama3.1, etc.)

Ready to Enable:

OpenAI (GPT-4, GPT-3.5-turbo, text-embedding-ada-002)
Anthropic (Claude 3 Opus, Sonnet, Haiku)
Google (Gemini Pro, Gemini Flash)
Cohere (Command, Embed)
Azure OpenAI
AWS Bedrock
100+ more via LiteLLM

🚀 Quick Start

Installation

# From PyPI (recommended)
pip install sekha-llm-bridge

# Or from source
git clone https://github.com/sekha-ai/sekha-llm-bridge.git
cd sekha-llm-bridge
pip install -e .

With Docker (Full Stack)

LLM-Bridge is included in the full Sekha stack:

git clone https://github.com/sekha-ai/sekha-docker.git
cd sekha-docker/docker
cp .env.example .env

# Edit .env to configure your LLM provider
nano .env

docker compose -f docker-compose.prod.yml up -d

Standalone Development

# Configure (copy and edit)
cp .env.example .env

# Start Redis (required for Celery)
docker run -d -p 6379:6379 redis:7-alpine

# Run
python -m sekha_llm_bridge.main

⚙️ Configuration

Environment Variables

# Server
HOST=0.0.0.0
PORT=5001

# Ollama (local LLMs)
OLLAMA_URL=http://localhost:11434
EMBEDDING_MODEL=nomic-embed-text:latest
SUMMARIZATION_MODEL=llama3.1:8b

# Redis (Celery task queue)
REDIS_URL=redis://localhost:6379/0

# Cloud Providers (optional)
OPENAI_API_KEY=sk-...
ANTHROPIC_API_KEY=sk-ant-...

# Logging
LOG_LEVEL=INFO

Using Different LLM Providers

Switch to OpenAI:

EMBEDDING_MODEL=text-embedding-3-small
SUMMARIZATION_MODEL=gpt-4o-mini
OPENAI_API_KEY=sk-...

Switch to Claude:

SUMMARIZATION_MODEL=claude-3-5-sonnet-20241022
ANTHROPIC_API_KEY=sk-ant-...

LiteLLM automatically routes to the correct provider based on model name!

📡 API Reference

POST /embed

Generate embedding for text.

Request:

{
  "text": "What is the meaning of life?",
  "model": "nomic-embed-text:latest"  // optional
}

Response:

{
  "embedding": [0.123, -0.456, ...],  // 768-dim vector
  "model": "nomic-embed-text:latest",
  "dimension": 768,
  "tokens_used": 42
}

POST /summarize

Generate hierarchical summary.

Request:

{
  "messages": [
    "User discussed Python best practices",
    "Assistant recommended type hints"
  ],
  "level": "daily",  // daily | weekly | monthly
  "model": "llama3.1:8b",  // optional
  "max_words": 200
}

Response:

{
  "summary": "Discussed Python type hints and best practices...",
  "level": "daily",
  "model": "llama3.1:8b",
  "message_count": 2,
  "tokens_used": 156
}

POST /v1/chat/completions

OpenAI-compatible chat endpoint.

Request:

{
  "model": "llama3.1:8b",
  "messages": [
    {"role": "user", "content": "Hello!"}
  ]
}

Response: Standard OpenAI format

🔧 Development

Setup

# Install dev dependencies
pip install -e ".[dev]"

# Or with Poetry
poetry install --with dev

Testing

# Run tests
pytest

# With coverage
pytest --cov=sekha_llm_bridge --cov-report=html

# Type checking
mypy src/

# Linting
ruff check .
black --check .

Project Structure

sekha-llm-bridge/
├── src/
│   └── sekha_llm_bridge/
│       ├── main.py              # FastAPI app
│       ├── config.py            # Settings
│       ├── models.py            # Pydantic models
│       ├── tasks.py             # Celery tasks
│       ├── services/
│       │   ├── embedding_service.py
│       │   ├── summarization_service.py
│       │   ├── entity_extraction_service.py
│       │   └── importance_scorer.py
│       └── utils/
│           └── llm_client.py    # LiteLLM wrapper
├── tests/
├── requirements.txt
└── pyproject.toml

🤝 Integration with Controller

The Controller calls LLM-Bridge for:

Embedding Generation: When storing new conversations

let embedding = llm_bridge.embed_text(&message_content).await?;

Summarization: For hierarchical summaries

let summary = llm_bridge.summarize(messages, "daily").await?;

Importance Scoring: For pruning decisions

let score = llm_bridge.score_importance(&message).await?;

All operations are async and include automatic retries.

📊 Monitoring

Health Check

curl http://localhost:5001/health

Response:

{
  "status": "healthy",
  "timestamp": "2026-01-25T20:00:00Z",
  "ollama_status": {
    "status": "healthy",
    "models_available": ["nomic-embed-text:latest", "llama3.1:8b"]
  }
}

Prometheus Metrics

curl http://localhost:5001/metrics

📝 Changelog

See CHANGELOG.md for full release history.

🗺️ Roadmap

Q1 2026

Ollama integration
LiteLLM foundation
OpenAI production testing
Anthropic Claude integration
Google Gemini support

Q2 2026

Multi-provider load balancing
Cost tracking per provider
Custom model fine-tuning support
Streaming responses

🔗 Related Projects

sekha-controller - Memory orchestration (Rust)
sekha-proxy - Transparent LLM proxy (optional)
sekha-mcp - MCP server for Claude Desktop
sekha-docker - Full stack deployment

📚 Documentation

Full docs: docs.sekha.dev

📄 License

AGPL-3.0-or-later - License Details

🙋 Support

Issues: GitHub Issues
Discord: Join our Discord
Email: dev@sekha-ai.dev

Built with ❤️ by the Sekha AI team

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

Sekha

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.2.0

Feb 16, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sekha_llm_bridge-0.2.0.tar.gz (55.3 kB view details)

Uploaded Feb 16, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

sekha_llm_bridge-0.2.0-py3-none-any.whl (64.6 kB view details)

Uploaded Feb 16, 2026 Python 3

File details

Details for the file sekha_llm_bridge-0.2.0.tar.gz.

File metadata

Download URL: sekha_llm_bridge-0.2.0.tar.gz
Upload date: Feb 16, 2026
Size: 55.3 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for sekha_llm_bridge-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`7992488681e093e428656db207f84645f17a5ec3e8f24234ebdccec3f16376a3`
MD5	`36ee17dd39cee37cc700f5e32ff104e0`
BLAKE2b-256	`5d8d14dc5204b6e01c0959fae1088a0e622212a37135de6900fcef4f6eb75d6d`

See more details on using hashes here.

Provenance

The following attestation bundles were made for sekha_llm_bridge-0.2.0.tar.gz:

Publisher: pypi-release.yml on sekha-ai/sekha-llm-bridge

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: sekha_llm_bridge-0.2.0.tar.gz
- Subject digest: 7992488681e093e428656db207f84645f17a5ec3e8f24234ebdccec3f16376a3
- Sigstore transparency entry: 955947099
- Sigstore integration time: Feb 16, 2026
Source repository:
- Permalink: sekha-ai/sekha-llm-bridge@b276c05562135d95c6ef2a98d2ee9dd169cef587
- Branch / Tag: refs/tags/v0.2.0
- Owner: https://github.com/sekha-ai
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: pypi-release.yml@b276c05562135d95c6ef2a98d2ee9dd169cef587
- Trigger Event: push

File details

Details for the file sekha_llm_bridge-0.2.0-py3-none-any.whl.

File metadata

Download URL: sekha_llm_bridge-0.2.0-py3-none-any.whl
Upload date: Feb 16, 2026
Size: 64.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for sekha_llm_bridge-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d79beb71d7640576fb529dde536345b4aeb6b7736ed581114dcb232e5047057c`
MD5	`5e69a2765770a87400bb4558c5a8ca4c`
BLAKE2b-256	`9d079e4f4dc54575875011b237dbde197458aa566c919d1bbaa6108767526b45`

See more details on using hashes here.

Provenance

The following attestation bundles were made for sekha_llm_bridge-0.2.0-py3-none-any.whl:

Publisher: pypi-release.yml on sekha-ai/sekha-llm-bridge

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: sekha_llm_bridge-0.2.0-py3-none-any.whl
- Subject digest: d79beb71d7640576fb529dde536345b4aeb6b7736ed581114dcb232e5047057c
- Sigstore transparency entry: 955947107
- Sigstore integration time: Feb 16, 2026
Source repository:
- Permalink: sekha-ai/sekha-llm-bridge@b276c05562135d95c6ef2a98d2ee9dd169cef587
- Branch / Tag: refs/tags/v0.2.0
- Owner: https://github.com/sekha-ai
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: pypi-release.yml@b276c05562135d95c6ef2a98d2ee9dd169cef587
- Trigger Event: push

sekha-llm-bridge 0.2.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Sekha LLM Bridge

🎯 What is Sekha LLM Bridge?

Why is it Required?

🏗️ Architecture Role

✨ Features

Core Services

Current Capabilities

Supported LLM Providers (via LiteLLM)

🚀 Quick Start

Installation

With Docker (Full Stack)

Standalone Development

⚙️ Configuration

Environment Variables

Using Different LLM Providers

📡 API Reference

POST /embed

POST /summarize

POST /v1/chat/completions

🔧 Development

Setup

Testing

Project Structure

🤝 Integration with Controller

📊 Monitoring

Health Check

Prometheus Metrics

📝 Changelog

🗺️ Roadmap

Q1 2026

Q2 2026

🔗 Related Projects

📚 Documentation

📄 License

🙋 Support

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance