Skip to main content

Largestack AI — production-grade candidate framework for typed agents, tools, RAG, guardrails, and orchestration

Project description

Largestack AI

PyPI version Python License

Largestack AI is a Python 3.11+ production-grade candidate framework for typed agents, tools, RAG, guardrails, observability, and orchestration.

It is designed for developers who want to build real AI systems without starting from a blank file: support-ticket agents, RAG assistants, code reviewers, workflow automations, BFSI governance flows, and enterprise-style AI copilots.

Current status: v1.0 Release Candidate / controlled-pilot ready. Ubuntu, Mac evidence, Windows clean validation, Docker, security, package, DeepSeek live validation, and 24-hour soak evidence have passed.

Install

pip install largestack

Verify:

largestack --help
python -c "import largestack; print(largestack.__version__)"

Why Largestack?

Most agent frameworks solve only one layer: agents, chains, RAG, or observability. Largestack brings the main production surfaces together:

Layer What Largestack provides
Agents Agent, typed agents, role-based agents, multi-agent teams
Tools Safe tool calling, schemas, retries, timeout controls, approval policies
Workflows Sequential, parallel, router, supervisor, graph/DAG-style orchestration
RAG Loaders, chunking, retrievers, rerankers, vector stores, citations, no-answer behavior
Guardrails PII checks, injection controls, topic/sensitive data policies, tool/provider policies
Memory Buffer, long-term, vector-backed, shared and isolated memory patterns
Observability Traces, cost tracking, event logs, dashboard APIs, OTEL helpers
Enterprise RBAC, audit trail, tenant scoping, SSO/session modules, payment/billing scaffolds
Deployment Docker, Compose, Helm charts, CI validation, release evidence
Testing Unit, integration, security, RAG eval, live provider validation, generated project checks

Development quickstart

1. Open a source checkout

# Public GitHub clone URL should be added after repository visibility is enabled.
cd largestack

2. Create environment

python3.12 -m venv .venv
source .venv/bin/activate
python -m pip install -U pip setuptools wheel

3. Install editable development dependencies

For normal source development:

python -m pip install -e ".[dev]"

For CPU-only PyTorch dependency resolution on Linux/macOS:

PIP_EXTRA_INDEX_URL=https://download.pytorch.org/whl/cpu \
python -m pip install -e ".[dev]"

4. Run a first validation

python -m pytest tests/unit/test_memory.py -q --tb=short

5. Run the full suite

python -m pytest tests -q --tb=short -ra

Minimal agent example

import asyncio
from largestack import Agent

async def main():
    agent = Agent(
        name="assistant",
        llm="deepseek/deepseek-chat",
        instructions="Be concise and practical."
    )
    result = await agent.run("Explain Largestack in one sentence.")
    print(result.content)

asyncio.run(main())

For deterministic tests, use the built-in test/offline model patterns instead of a live cloud provider.


Live provider setup

DeepSeek:

export LARGESTACK_DEEPSEEK_API_KEY="your_key_here"
python examples/01_hello/main.py

OpenAI:

export LARGESTACK_OPENAI_API_KEY="your_key_here"
export LARGESTACK_DEFAULT_MODEL="openai/gpt-4o-mini"
python examples/01_hello/main.py

Never commit .env or paste API keys into source files.


LLM/API provider support

Largestack is provider-switchable. The core agent, workflow, RAG, guardrail, and observability layers run through a model string such as openai/gpt-4o-mini, anthropic/claude-sonnet-4-6, deepseek/deepseek-chat, litellm/groq/llama-3.1-70b-versatile, or local/llama3.2.

Recommended public claim:

Largestack supports OpenAI/GPT, Anthropic/Claude, DeepSeek, LiteLLM, Ollama/local models, and many OpenAI-compatible providers through a verified/partial capability matrix.

Do not claim every provider has identical production-grade tool calling, structured output, streaming, and cost tracking until that provider/model has passed live E2E validation.

Provider/API path Model string example Env/config Status
OpenAI / GPT openai/gpt-4o-mini LARGESTACK_OPENAI_API_KEY Verified primary adapter path
Anthropic / Claude anthropic/claude-sonnet-4-6 LARGESTACK_ANTHROPIC_API_KEY Verified native adapter path
DeepSeek deepseek/deepseek-chat LARGESTACK_DEEPSEEK_API_KEY Live E2E validated
LiteLLM gateway litellm/<provider>/<model> Provider-specific LiteLLM env vars Partial; downstream capability varies
Local OpenAI-compatible local/<model> LARGESTACK_OPENAI_COMPATIBLE_BASE_URL Partial; gateway/model capability varies
Ollama native ollama/<model> LARGESTACK_OLLAMA_BASE_URL optional Partial; chat path first
Azure OpenAI azure/<deployment> LARGESTACK_AZURE_OPENAI_KEY, LARGESTACK_AZURE_OPENAI_ENDPOINT Partial; deployment-specific
Groq, Mistral, OpenRouter, xAI, Cerebras, SambaNova, NVIDIA <provider>/<model> LARGESTACK_<PROVIDER>_API_KEY Partial/OpenAI-compatible; verify live
Google/Gemini, Cohere, Bedrock <provider>/<model> Provider env/credentials Partial; feature support differs

Inspect the runtime matrix:

python - <<'PY'
from largestack import provider_support_matrix
for row in provider_support_matrix():
    print(row["provider"], row["status"], "tools=", row["tool_calling"], "structured=", row["structured_output"])
PY

Run the provider-switchable flow demo offline:

python examples/provider_flow_demo/main.py

Run the same flow against GPT:

export LARGESTACK_OPENAI_API_KEY="your_key_here"
export LARGESTACK_DEFAULT_MODEL="openai/gpt-4o-mini"
export LARGESTACK_FLOW_DEMO_LIVE=1
python examples/provider_flow_demo/main.py

Run the same flow against Claude:

export LARGESTACK_ANTHROPIC_API_KEY="your_key_here"
export LARGESTACK_DEFAULT_MODEL="anthropic/claude-sonnet-4-6"
export LARGESTACK_FLOW_DEMO_LIVE=1
python examples/provider_flow_demo/main.py

Run the same flow against a local OpenAI-compatible endpoint:

export LARGESTACK_OPENAI_COMPATIBLE_BASE_URL="http://localhost:11434/v1"
export LARGESTACK_OPENAI_COMPATIBLE_API_KEY="ollama"
export LARGESTACK_DEFAULT_MODEL="local/llama3.2"
export LARGESTACK_FLOW_DEMO_LIVE=1
python examples/provider_flow_demo/main.py

Flow demo

The quickest workflow demo is examples/provider_flow_demo/main.py. It runs offline by default and can be switched to any configured provider by changing only LARGESTACK_DEFAULT_MODEL.

flowchart LR
    U[User task] --> I[Intake agent]
    I --> P[Planner agent]
    P --> R[Responder agent]
    R --> O[Final answer]

What the demo proves:

  • one task flows through three agents,
  • DAG dependencies control execution order,
  • each agent can use the same model string or provider family,
  • offline TestModel validation requires no API key,
  • live mode works with GPT, Claude, DeepSeek, LiteLLM, or local-compatible providers when credentials are configured.

Built-in example areas

Example Purpose
examples/00_offline_test_model.py Offline deterministic model check
examples/01_hello Basic provider-backed agent
examples/02_tools Tool calling
examples/03_team Multi-agent/team behavior
examples/04_guards Guardrails/security behavior
examples/05_rag_knowledge RAG with knowledge files
examples/06_streaming Streaming responses
examples/07_structured Structured outputs
examples/08_mcp_server MCP server pattern
examples/10_full_app Integrated app pattern
examples/provider_flow_demo Provider-switchable workflow demo
examples/rag_basic Basic RAG assistant
examples/fintech_kyc BFSI/KYC style workflow
examples/riva_ai Riva/Largestack demo pipelines

Validation status

Latest confirmed release-candidate evidence includes:

Gate Status
Ubuntu full pytest Passed
Mac validation Passed / evidence added
Windows validation Passed / clean Windows validation confirmed
DeepSeek live difficult projects 5/5 passed
Full DeepSeek integration suite Passed with one known provider-format skip
Provider support matrix Present / explicit verified-partial-adapter statuses
Offline provider flow demo Passed with TestModel
Security suite Passed
RAG eval suite Passed
Package build + twine check Passed
Docker runtime /health Passed
Helm lint/template Passed
4-hour soak evidence Passed
24-hour soak Passed / 210 successful cycles / 0 recorded failures

Architecture at a glance

flowchart TD
    U[User / API / CLI / App] --> C[CLI or SDK]
    C --> A[Agent Runtime]
    A --> W[Workflow Orchestrator]
    A --> T[Tool Registry]
    A --> M[Memory Layer]
    A --> R[RAG Layer]
    A --> G[Guardrails]
    W --> S[State / Checkpoints]
    T --> I[Integrations]
    R --> V[Vector Stores / Retrievers / Rerankers]
    G --> E[Enterprise Policies]
    A --> O[Observability]
    O --> D[Dashboard / Metrics / Traces]
    E --> AUD[Audit / RBAC / Tenant Controls]
    C --> DEP[Docker / Compose / Helm]

Repository map

Path Purpose
largestack/_core Main agent/tool/runtime primitives
largestack/_workflow Workflow graph, checkpoints, interrupts, subgraphs
largestack/_rag RAG query engines, eval, summary index
largestack/_memory Memory stores and memory tools
largestack/_guard Provider/tool guardrail policies
largestack/_security Sandbox, permissions, vault, encryption, network controls
largestack/_enterprise RBAC, audit, tenant, SSO/session, billing/payment modules
largestack/_observe Cost, traces, OTEL, telemetry helpers
largestack/_dashboard Dashboard app and APIs
largestack/_integrations Provider/tool integrations
largestack/_templates Project starter templates
examples/ Runnable examples
tests/ Unit, integration, security, RAG eval tests
scripts/ Certification, smoke, scenario, and live DeepSeek validation scripts
deploy/ Docker, Compose, Helm, monitoring assets
release_evidence/ Validation evidence and release proof

Production-positioning honesty

Largestack is strong for:

  • developer demos,
  • investor demos,
  • internal AI platform experiments,
  • controlled pilots,
  • agentic framework portfolio proof,
  • private beta deployments.

Largestack should not yet be marketed as:

  • fully BFSI-certified,
  • SOC2/ISO-certified,
  • full LangChain/LangGraph ecosystem replacement,
  • public SaaS production platform without load tests, external VAPT, and real Kubernetes install proof.

Known limitations are tracked in docs/known-limitations.md. Review that file before publishing release, SaaS, BFSI, or regulated-enterprise claims.


Roadmap

Priority Work
P0 Add load/concurrency evidence after completed 24h soak
P0 Queue/backpressure for high traffic
P0 Distributed workers and job leasing
P0 Durable replay/checkpoint recovery
P1 Real Kubernetes cluster install test
P1 Observability UI polish and replay debugger
P1 More beginner templates and tutorials
P2 Public docs website
P2 Community examples and plugin ecosystem
P3 Enterprise certifications, VAPT, compliance evidence

License

Apache-2.0.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

largestack-1.0.1.tar.gz (618.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

largestack-1.0.1-py3-none-any.whl (693.8 kB view details)

Uploaded Python 3

File details

Details for the file largestack-1.0.1.tar.gz.

File metadata

  • Download URL: largestack-1.0.1.tar.gz
  • Upload date:
  • Size: 618.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.12

File hashes

Hashes for largestack-1.0.1.tar.gz
Algorithm Hash digest
SHA256 3cc50bc140373df78682d327c2893aa018fccf31e56701d6879d5a973aad508f
MD5 185074a54b0d91c80ba03aedfeeae884
BLAKE2b-256 7fe3c3a03016a33087c48f166732a481122780e7cd8efed0fa8387011bcf1aa7

See more details on using hashes here.

File details

Details for the file largestack-1.0.1-py3-none-any.whl.

File metadata

  • Download URL: largestack-1.0.1-py3-none-any.whl
  • Upload date:
  • Size: 693.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.12

File hashes

Hashes for largestack-1.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 19fedad0d9667dbb9f2ca07ea41647642a647faa5b72db4f142a8194eafed150
MD5 eae8ab145d0ae34f8f363b3f94c25d03
BLAKE2b-256 f14b2df123d626b023c93261137087ba30af593067ab12d8249cc67a6e39f288

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page