Protocol-first, vendor-neutral SDK for LLM, Embedding, Vector, and Graph backends.

These details have not been verified by PyPI

Project description

Corpus OS Protocol and SDK

Version Python License LLM Protocol Vector Protocol Graph Protocol Embedding Protocol

Reference implementation of the Corpus OS Protocol and SDK — a wire-first, vendor-neutral SDK for interoperable AI frameworks and data backends across four domains: LLM, Embedding, Vector, and Graph.

┌──────────────────────────────────────────────────────────────────────┐
│  Your App / Agents / RAG Pipelines                                   │
│  (LangChain · LlamaIndex · Semantic Kernel · CrewAI · AutoGen · MCP) │
├──────────────────────────────────────────────────────────────────────┤
│  Corpus OS Protocol and SDK.                                         │
│  One protocol · One error taxonomy · One metrics model               │
├──────────┬──────────────┬────────────┬───────────────────────────────┤
│ LLM/v1   │ Embedding/v1 │ Vector/v1  │ Graph/v1                      │
├──────────┴──────────────┴────────────┴───────────────────────────────┤
│  Any Provider: OpenAI · Anthropic · Pinecone · Neo4j · ...           │
└──────────────────────────────────────────────────────────────────────┘

Keep your frameworks. Standardize your infrastructure.

Open-Core Model: The Corpus OS Protocol Suite and SDK are fully open source (Apache-2.0). Corpus Router and official production adapters are commercial, optional, and built on the same public protocols. Using this SDK does not lock you into CORPUS Router.

Why CORPUS
How CORPUS Compares
When Not to Use CORPUS
Install
Quick Start
Domain Examples
Core Concepts
Error Taxonomy & Observability
Performance & Configuration
Testing & Conformance
Documentation Layout
FAQ
Contributing
License & Commercial Options

Why Corpus OS

Modern AI platforms juggle multiple LLM, embedding, vector, and graph backends. Each vendor ships unique APIs, error schemes, rate limits, and capabilities — making cross-provider integration brittle and costly.

The problem:

Provider proliferation — Dozens of incompatible APIs across AI infrastructure
Duplicate integration — Different error handling, observability, and resilience patterns rewritten per provider and framework
Vendor lock-in — Applications tightly coupled to specific backend choices
Operational complexity — Inconsistent monitoring and debugging across services

Corpus OS provides:

Stable, runtime-checkable protocols across all four domains
Normalized errors with retry hints and machine-actionable scopes
SIEM-safe metrics (low-cardinality, tenant-hashed, no PII)
Deadline propagation for cancellation and cost control
Two modes — compose under your own router (thin) or use lightweight built-in infra (standalone)
Wire-first design — canonical JSON envelopes implementable in any language, with this SDK as reference

Corpus OS is not a replacement for LangChain, LlamaIndex, Semantic Kernel, CrewAI, AutoGen, or MCP. Use those for agent-specific orchestration, agents, tools, and RAG pipelines. Use Corpus OS to standardize the infrastructure layer underneath them. Your app teams keep their frameworks. Your platform team gets one protocol, one error taxonomy, and one observability model across everything.

How Corpus OS Compares

Aspect	LangChain / LlamaIndex	OpenRouter	MCP	Corpus OS
Scope	Application framework	LLM unification	Tools & data sources	AI infrastructure protocols
Domains	LLM + Tools	LLM only	Tools + Data	LLM + Vector + Graph + Embedding
Error Standardization	Partial	Limited	N/A	Comprehensive taxonomy
Multi-Provider Routing	Basic	Managed service	N/A	Protocol for any router
Observability	Basic	Limited	N/A	Built-in metrics + tracing
Vendor Neutrality	High	Service-dependent	High	Protocol-first, no lock-in

Who is this for?

App developers — Keep using your framework of choice. Talk to all backends through Corpus OS protocols. Swap providers or frameworks without rewriting integration code.
Framework maintainers — Implement one CORPUS adapter per protocol. Instantly support any conformant backend.
Backend vendors — Implement llm/v1, embedding/v1, vector/v1, or graph/v1 once, run the conformance suite, and your service works with every framework.
Platform / infra teams — Unified observability: normalized error codes, deadlines, and metrics. One set of dashboards and SLOs across all AI traffic.
MCP users — The Corpus OS MCP server exposes protocols as standard MCP tools. Any MCP client can call into your infra with consistent behavior.

Integration Patterns

Pattern	How It Works	What You Get
Framework → Corpus OS → Providers	Framework uses Corpus OS as client	Unified errors/metrics across providers
Corpus OS → Framework-as-adapter → Providers	Framework wrapped as Corpus OS adapter	Reuse existing chains/indices as "providers"
Mixed	Both of the above	Gradual migration, no big-bang rewrites

Large teams typically run all three patterns at once.

When Not to Use CORPUS

You probably don't need Corpus OS if:

Single-provider and happy — One backend, fine with their SDK and breaking changes.
No governance pressure — No per-tenant isolation, budgets, audit trails, or data residency.
No cross-domain orchestration — Not coordinating LLM + Vector + Graph + Embedding together.
Quick throwaway prototype — Lock-in, metrics, and resilience aren't worth thinking about yet.

If any of these stop being true, corpus_sdk is the incremental next step.

Install

pip install corpus_sdk

Python ≥ 3.10 recommended. No heavy runtime dependencies.

⚡ Quick Start

from corpus_sdk.llm.llm_base import BaseLLMAdapter, OperationContext

class QuickAdapter(BaseLLMAdapter):
    async def _do_complete(self, messages, **kwargs):
        return {"text": "Hello from CORPUS!", "model": "quick-demo"}

adapter = QuickAdapter()
ctx = OperationContext(request_id="test-123")
result = await adapter.complete(
    messages=[{"role": "user", "content": "Hi"}], ctx=ctx
)
print(result.text)  # "Hello from CORPUS!"

A complete quick start with all four protocols is in docs/guides/QUICK_START.md.

Domain Examples

Minimal viable adapter: Implement _do_capabilities, your core operation (_do_embed, _do_complete, _do_query, etc.), and _do_health. All other methods have safe no-op defaults — you only override what you need.

In all examples, swap Example*Adapter with your actual adapter class that inherits the corresponding base and implements _do_* hooks.

Embeddings

from corpus_sdk.embedding.embedding_base import (
    BaseEmbeddingAdapter, EmbedSpec, OperationContext,
    EmbeddingVector, EmbeddingCapabilities, EmbedResult
)

class ExampleEmbeddingAdapter(BaseEmbeddingAdapter):
    async def _do_capabilities(self) -> EmbeddingCapabilities:
        return EmbeddingCapabilities(
            server="example-embeddings", version="1.0.0",
            supported_models=("example-embed-001",),
            max_batch_size=128, max_text_length=8192,
            supports_normalization=True, normalizes_at_source=False,
            supports_deadline=True, supports_token_counting=False,
        )

    async def _do_embed(self, spec: EmbedSpec, *, ctx=None) -> EmbedResult:
        vec = [0.1, 0.2, 0.3]
        return EmbedResult(
            embedding=EmbeddingVector(vector=vec, text=spec.text,
                                      model=spec.model, dimensions=len(vec)),
            model=spec.model, text=spec.text,
            tokens_used=None, truncated=False,
        )

    async def _do_health(self, *, ctx=None) -> dict:
        return {"ok": True, "server": "example-embeddings", "version": "1.0.0"}

# Usage
async with ExampleEmbeddingAdapter() as adapter:
    ctx = OperationContext(request_id="req-1", tenant="acme")
    res = await adapter.embed(
        EmbedSpec(text="hello world", model="example-embed-001"), ctx=ctx
    )
    print(res.embedding.vector)

LLM

from corpus_sdk.llm.llm_base import (
    BaseLLMAdapter, OperationContext, LLMCompletion,
    TokenUsage, LLMCapabilities
)

class ExampleLLMAdapter(BaseLLMAdapter):
    async def _do_capabilities(self) -> LLMCapabilities:
        return LLMCapabilities(
            server="example-llm", version="1.0.0",
            model_family="gpt-4", max_context_length=8192,
            supports_streaming=True, supports_roles=True,
            supports_json_output=False, supports_parallel_tool_calls=False,
            idempotent_writes=False, supports_multi_tenant=True,
            supports_system_message=True,
        )

    async def _do_complete(self, messages, model, **kwargs) -> LLMCompletion:
        return LLMCompletion(
            text="Hello from example-llm!", model=model,
            model_family="gpt-4",
            usage=TokenUsage(prompt_tokens=5, completion_tokens=5, total_tokens=10),
            finish_reason="stop",
        )

    async def _do_health(self, *, ctx=None) -> dict:
        return {"ok": True, "server": "example-llm", "version": "1.0.0"}

# Usage
async with ExampleLLMAdapter() as adapter:
    ctx = OperationContext(request_id="req-2", tenant="acme")
    resp = await adapter.complete(
        messages=[{"role": "user", "content": "Say hi"}],
        model="example-llm-001", ctx=ctx,
    )
    print(resp.text)

Vector

from corpus_sdk.vector.vector_base import (
    BaseVectorAdapter, VectorCapabilities, QuerySpec,
    QueryResult, Vector, VectorMatch, OperationContext, VectorID
)

class ExampleVectorAdapter(BaseVectorAdapter):
    async def _do_capabilities(self) -> VectorCapabilities:
        return VectorCapabilities(
            server="example-vector", version="1.0.0", max_dimensions=3
        )

    async def _do_query(self, spec: QuerySpec, *, ctx=None) -> QueryResult:
        v = Vector(id=VectorID("v1"), vector=[0.1, 0.2, 0.3],
                   metadata={"label": "demo"}, namespace=spec.namespace)
        return QueryResult(
            matches=[VectorMatch(vector=v, score=0.99, distance=0.01)],
            query_vector=spec.vector, namespace=spec.namespace, total_matches=1,
        )

    async def _do_health(self, *, ctx=None) -> dict:
        return {"ok": True, "server": "example-vector", "version": "1.0.0"}

# Usage
adapter = ExampleVectorAdapter()
ctx = OperationContext(request_id="req-3", tenant="acme")
result = await adapter.query(QuerySpec(vector=[0.1, 0.2, 0.3], top_k=1), ctx=ctx)
print(result.matches[0].score)

Graph

from corpus_sdk.graph.graph_base import (
    BaseGraphAdapter, GraphCapabilities, UpsertNodesSpec,
    Node, GraphID, OperationContext, GraphQuerySpec, GraphQueryResult
)

class ExampleGraphAdapter(BaseGraphAdapter):
    async def _do_capabilities(self) -> GraphCapabilities:
        return GraphCapabilities(
            server="example-graph", version="1.0.0",
            supported_query_dialects=("cypher",),
            supports_stream_query=True, supports_bulk_vertices=True,
            supports_batch=True, supports_schema=True,
        )

    async def _do_query(self, spec: GraphQuerySpec, *, ctx=None) -> GraphQueryResult:
        return GraphQueryResult(
            records=[{"id": 1, "name": "Ada"}],
            summary={"rows": 1}, dialect=spec.dialect,
            namespace=spec.namespace,
        )

    async def _do_health(self, *, ctx=None) -> dict:
        return {"ok": True, "server": "example-graph", "version": "1.0.0"}

# Usage
async with ExampleGraphAdapter() as adapter:
    ctx = OperationContext(request_id="req-4", tenant="acme")
    result = await adapter.upsert_nodes(
        UpsertNodesSpec(nodes=[
            Node(id=GraphID("user:1"), labels=("User",),
                 properties={"name": "Ada"})
        ]),
        ctx=ctx,
    )
    print(f"Upserted {result.upserted_count} nodes")

Full implementations with batch operations, streaming, and multi-cloud scenarios are in docs/guides/ADAPTER_RECIPES.md.

Core Concepts

Protocol vs Base — Protocols define required behavior. Bases implement validation, deadlines, observability, and error normalization. You implement _do_* hooks.
OperationContext — Carries request_id, idempotency_key, deadline_ms, traceparent, tenant, and cache hints across all operations.
Wire Protocol — Canonical envelopes (op, ctx, args) and response shapes (ok, code, result) defined in docs/spec/PROTOCOL.md.
Corpus OS-Compatible — Implementations that honor the envelopes, reserved op strings, and error taxonomy. Validated by the conformance suite.

Error Taxonomy & Observability

All domains share a normalized error taxonomy: BadRequest, AuthError, ResourceExhausted, TransientNetwork, Unavailable, NotSupported, DeadlineExceeded, plus domain-specific errors like TextTooLong, ModelOverloaded, DimensionMismatch, and IndexNotReady.

Errors carry machine-actionable hints (retry_after_ms, throttle_scope) so routers and control planes can react consistently across providers. A pluggable MetricsSink protocol lets you bring your own metrics backend. Bases emit one observe per operation, hash tenants before recording, and never log prompts, vectors, or raw tenant IDs.

Full details in docs/spec/ERRORS.md and docs/spec/METRICS.md.

Performance & Configuration

Base overhead is typically <10 ms relative to vendor SDK calls: validation <1 ms, metrics <0.1 ms, cache lookup (standalone) <0.5 ms. Async-first design avoids blocking and supports high concurrency. Batch operations (embed_batch, vector upserts, graph batch) are preferred for throughput.

Benchmarks and deployment patterns in docs/guides/IMPLEMENTATION.md.

Modes: `thin` vs `standalone`

Once you're ready for production, choose a mode:

Mode	Infra Hooks	When to Use
`thin` (default)	All no-ops	You have an external control plane (router, scheduler, limiter)
`standalone`	Deadline enforcement, circuit breaker, token-bucket limiter, in-memory TTL cache	Lightweight deployments without external infra

Use thin under a router to prevent double-stacking resiliency. Use standalone for prototyping or single-service deployments.

Testing & Conformance

One-Command Testing

# All protocols at once
make test-all-conformance

# Specific protocols
make test-llm-conformance
make test-vector-conformance
make test-graph-conformance
make test-embedding-conformance

CLI

corpus-sdk verify                        # All protocols
corpus-sdk verify -p llm -p vector       # Selected protocols
corpus-sdk test-llm-conformance          # Single protocol

Direct pytest

pytest tests/ -v --cov=corpus_sdk --cov-report=html

Requirements for "CORPUS-Compatible" certification are in docs/conformance/CERTIFICATION.md.

📚 Documentation Layout

Spec (normative): docs/spec/

File	Contents
`SPECIFICATION.md`	Full protocol suite specification (all domains, cross-cutting behavior)
`PROTOCOL.md`	Wire-level envelopes, streaming semantics, canonical `op` registry
`ERRORS.md`	Canonical error taxonomy & mapping rules
`METRICS.md`	Metrics schema & SIEM-safe observability
`SCHEMA.md`	JSON/type shapes
`VERSIONING.md`	Semantic versioning & compatibility rules

Guides (how-to): docs/guides/

File	Contents
`QUICK_START.md`	End-to-end flows for all four protocols
`IMPLEMENTATION.md`	How to implement adapters
`ADAPTER_RECIPES.md`	Real-world multi-cloud and RAG scenarios
`CONFORMANCE_GUIDE.md`	How to run & interpret conformance suites

Conformance (testing): docs/conformance/ — Per-protocol test specs, schema conformance, behavioral conformance, and certification requirements.

FAQ

Is the SDK open source?

Yes. The SDK (protocols, bases, example adapters) is open source under Apache-2.0. CORPUS Router and official production adapters are commercial.

Do I have to use CORPUS Router?

No. The SDK composes with any router or control plane. CORPUS Router is optional and adheres to the same public protocols.

How does CORPUS relate to LangChain / LlamaIndex / MCP / OpenRouter?

They're complementary. LangChain/LlamaIndex are application frameworks. MCP standardizes tools and data sources. OpenRouter unifies LLM providers. CORPUS standardizes the infrastructure layer (LLM + Vector + Graph + Embedding) underneath all of them with consistent errors, metrics, and capabilities discovery.

Why async-only?

Modern AI workloads require high concurrency. Async-first prevents blocking the event loop. Sync wrappers can be built on top if needed.

What happens if my adapter raises a non-normalized error?

Bases catch unexpected exceptions and record them as UnhandledException in metrics. Wrap provider errors in normalized exceptions for proper handling.

Can CORPUS Router run on-prem?

Yes. Available as managed cloud or on-prem deployment for regulated and air-gapped environments.

Troubleshooting

Problem	Solution
Double-stacked resiliency (timeouts firing twice)	Ensure `mode="thin"` under your router
Circuit breaker opens frequently	Reduce concurrency or switch to `thin` with external circuit breaker
Cache returns stale results	Verify sampling params in cache key; check `cache_ttl_s`
`DeadlineExceeded` on fast operations	Ensure `deadline_ms` is absolute epoch time, not relative. Check NTP sync.
Health check failures	Inspect `_do_health` implementation; verify backend reachability and credentials

# Debug mode
import logging
logging.basicConfig(level=logging.DEBUG)
logging.getLogger("corpus_sdk").setLevel(logging.DEBUG)

Contributing

git clone https://github.com/corpus/corpus-sdk.git
cd corpus-sdk
pip install -e ".[dev]"
pytest

Follow PEP-8 (ruff/black). Type hints required on all public APIs. Include tests for new features. Maintain low-cardinality metrics — never add PII to extra fields. Observe SemVer.

We especially welcome community adapter contributions for new LLM, vector, graph, and embedding backends.

Community questions: GitHub Discussions preferred.

License & Commercial Options

License: Apache-2.0 (LICENSE)

Need	Solution	Cost
Learning / prototyping	`corpus_sdk` + example adapters	Free (OSS)
Production with your own infra	`corpus_sdk` + your adapters	Free (OSS)
Production with official adapters	`corpus_sdk` + Official Adapters	Commercial
Enterprise multi-provider	`corpus_sdk` + CORPUS Router (Managed or On-Prem)	Commercial

Contact: sales@corpusos.com · partners@corpusos.com

Built by the Corpus OS team — wire-level AI infrastructure you integrate once and stop thinking about.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

1.0.0

Feb 24, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

corpus_sdk-1.0.0.tar.gz (509.6 kB view details)

Uploaded Feb 24, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

corpus_sdk-1.0.0-py3-none-any.whl (534.4 kB view details)

Uploaded Feb 24, 2026 Python 3

File details

Details for the file corpus_sdk-1.0.0.tar.gz.

File metadata

Download URL: corpus_sdk-1.0.0.tar.gz
Upload date: Feb 24, 2026
Size: 509.6 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for corpus_sdk-1.0.0.tar.gz
Algorithm	Hash digest
SHA256	`c150969fbd1beebcad7c8f885922e7c26704f7bd6db3115dacc411a40b507b74`
MD5	`1d0944183c3258a9e5165d1ead66998c`
BLAKE2b-256	`2a74d5148d2ac903054da27fdbc3e47975b71e520fa4b50d78786648df7ae7fc`

See more details on using hashes here.

Provenance

The following attestation bundles were made for corpus_sdk-1.0.0.tar.gz:

Publisher: publish.yml on Corpus-OS/corpusos

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: corpus_sdk-1.0.0.tar.gz
- Subject digest: c150969fbd1beebcad7c8f885922e7c26704f7bd6db3115dacc411a40b507b74
- Sigstore transparency entry: 984844313
- Sigstore integration time: Feb 24, 2026
Source repository:
- Permalink: Corpus-OS/corpusos@7658778de8cd323fed0eae46539b72c84f8aa687
- Branch / Tag: refs/tags/v1.0.0
- Owner: https://github.com/Corpus-OS
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@7658778de8cd323fed0eae46539b72c84f8aa687
- Trigger Event: release

File details

Details for the file corpus_sdk-1.0.0-py3-none-any.whl.

File metadata

Download URL: corpus_sdk-1.0.0-py3-none-any.whl
Upload date: Feb 24, 2026
Size: 534.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for corpus_sdk-1.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6db1711cbbbd7f5034dfad7642a04994153d0dfea1c6dee9de9c774a27822faf`
MD5	`703dcc5d721e38ff3f5cad32656d8cf5`
BLAKE2b-256	`6944aefaa9591b3c32a69f7ed4716dd141e63f67ba5bcb0318313e6bf7955a17`

See more details on using hashes here.

Provenance

The following attestation bundles were made for corpus_sdk-1.0.0-py3-none-any.whl:

Publisher: publish.yml on Corpus-OS/corpusos

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: corpus_sdk-1.0.0-py3-none-any.whl
- Subject digest: 6db1711cbbbd7f5034dfad7642a04994153d0dfea1c6dee9de9c774a27822faf
- Sigstore transparency entry: 984844317
- Sigstore integration time: Feb 24, 2026
Source repository:
- Permalink: Corpus-OS/corpusos@7658778de8cd323fed0eae46539b72c84f8aa687
- Branch / Tag: refs/tags/v1.0.0
- Owner: https://github.com/Corpus-OS
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@7658778de8cd323fed0eae46539b72c84f8aa687
- Trigger Event: release

corpus-sdk 1.0.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Corpus OS Protocol and SDK

Table of Contents

Why Corpus OS

How Corpus OS Compares

Who is this for?

Integration Patterns

When Not to Use CORPUS

Install

⚡ Quick Start

Domain Examples

Core Concepts

Error Taxonomy & Observability

Performance & Configuration

Modes: thin vs standalone

Testing & Conformance

One-Command Testing

CLI

Direct pytest

📚 Documentation Layout

FAQ

Troubleshooting

Contributing

License & Commercial Options

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

Modes: `thin` vs `standalone`