Infrastructure for efficient and scalable AI applications.

These details have not been verified by PyPI

Project links

Project description

ai-infra

Infrastructure for efficient and scalable AI applications: clean LLM interfaces, composable graphs, and MCP client/server utilities. Batteries-included quickstarts help you ship fast.

LLM: simple chat, agents with tools, streaming, retries, structured output, HITL hooks
Graph: small-to-large workflows using LangGraph with typed state and tracing
MCP: multi-server client, tool discovery, OpenMCP (OpenAPI-like) doc generation

Install

Python: 3.11 – 3.13
Package manager: Poetry (recommended) or pip

Using Poetry (dev):

poetry install
poetry shell

Using pip (library use):

pip install ai-infra

Configure providers (env)

Create a .env (or export in your shell) with any providers you plan to use.

# OpenAI
export OPENAI_API_KEY=...
# Anthropic
export ANTHROPIC_API_KEY=...
# Google Generative AI
export GOOGLE_API_KEY=...
# xAI
export XAI_API_KEY=...

Optional: MCP HTTP headers for servers you call through the client.

export MCP_AUTH_TOKEN=...

Quickstarts

Below are tiny copy/paste snippets and how to run included examples.

LLM: chat (sync)

from ai_infra.llm import LLM, Providers

llm = LLM()
resp = llm.chat(
    user_msg="One fun fact about the moon?",
    system="You are concise.",
    provider=Providers.openai,
    model_name="gpt-4o",
)
print(resp)

Run the included example (calls a main() function):

python -c "from ai_infra.llm.examples.02_llm_chat_basic import main; main()"

LLM: agent (tools, sync)

from ai_infra.llm import Agent, Providers

agent = Agent()
resp = agent.run_agent(
    messages=[{"role": "user", "content": "Introduce yourself in one sentence."}],
    provider=Providers.openai,
    model_name="gpt-4o",
    model_kwargs={"temperature": 0.7},
)
print(getattr(resp, "content", resp))

Run the included example:

python -c "from ai_infra.llm.examples.01_agent_basic import main; main()"

LLM: token streaming (async)

import asyncio
from ai_infra.llm import LLM, Providers

async def demo():
    llm = LLM()
    async for token, meta in llm.stream_tokens(
        "Stream one short paragraph about Mars.",
        provider=Providers.openai,
        model_name="gpt-4o",
    ):
        print(token, end="", flush=True)

asyncio.run(demo())

See more examples in src/ai_infra/llm/examples:

03_structured_output.py, 04_agent_stream.py, 05_tool_controls.py, 06_hitl.py, 07_retry.py, 08_agent_stream_tokens.py, 09_chat_stream.py

Graph: minimal state machine

from typing_extensions import TypedDict
from langgraph.graph import END
from ai_infra.graph.core import Graph
from ai_infra.graph.models import Edge, ConditionalEdge

class MyState(TypedDict):
    value: int

def inc(s: MyState) -> MyState:
    s["value"] += 1
    return s

def mul(s: MyState) -> MyState:
    s["value"] *= 2
    return s

graph = Graph(
    state_type=MyState,
    node_definitions=[inc, mul],
    edges=[
        Edge(start="inc", end="mul"),
        ConditionalEdge(
            start="mul", router_fn=lambda s: "inc" if s["value"] < 40 else END, targets=["inc", END]
        ),
    ],
)

print(graph.run({"value": 1}))

Run the included example:

python -c "from ai_infra.graph.examples.01_graph_basic import main; main()"

MCP: multi-server client

import asyncio
from ai_infra.mcp.client.core import MCPClient

async def main():
    client = MCPClient([
        {"transport": "streamable_http", "url": "http://127.0.0.1:8000/api/mcp", "headers": {"Authorization": "Bearer $MCP_AUTH_TOKEN"}},
        # {"transport": "stdio", "command": "./your-mcp-server", "args": []},
        # {"transport": "sse", "url": "http://127.0.0.1:8001/sse"},
    ])

    await client.discover()
    tools = await client.list_tools()
    print("Discovered tools:", tools)

    docs = await client.get_openmcp()  # or client.get_openmcp("your_server_name")
    print("OpenMCP doc keys:", list(docs.keys()))

asyncio.run(main())

Run the included example:

python -m ai_infra.mcp.examples.01_mcps

Running all quickstarts

If you prefer a single runner command, add a tiny script like this locally:

# quickstart.py
import sys

M = {
    "llm_agent_basic": "ai_infra.llm.examples.01_agent_basic:main",
    "llm_chat_basic": "ai_infra.llm.examples.02_llm_chat_basic:main",
    "graph_basic": "ai_infra.graph.examples.01_graph_basic:main",
    "mcp_discover": "ai_infra.mcp.examples.01_mcps:__main__",
}

if __name__ == "__main__":
    key = sys.argv[1]
    mod, _, func = M[key].partition(":")
    if func == "__main__":
        import runpy; runpy.run_module(mod, run_name="__main__")
    else:
        mod = __import__(mod, fromlist=[func])
        getattr(mod, func)()

Run:

python quickstart.py llm_chat_basic
python quickstart.py graph_basic
python quickstart.py llm_agent_basic
python quickstart.py mcp_discover

MCP server config examples

Add entries like these to your Copilot MCP config (e.g., ~/.config/github-copilot/intellij/mcp.json):

{
  "servers": {
    "stdio-publisher-mcp": {
      "command": "npx",
      "args": [
        "-y",
        "--package=github:Aliikhatami94/ai-infra",
        "stdio-publisher-mcp"
      ]
    }
  }
}

Tip:

If you want to pin a specific ref (branch, tag, commit), set AI_INFRA_REF in your environment before launching the IDE.

Testing and quality

Unit tests: pytest
- pytest -q
Lint: ruff
- ruff check src tests
Types: mypy
- mypy src

Tip: add a test_examples.py that imports and runs the example main() functions to smoke test provider wiring without hitting network (use mocks).

Project layout

src/ai_infra/llm: core LLM and Agent APIs, providers, tools, and utils
src/ai_infra/graph: Graph wrapper, typed models, and utilities
src/ai_infra/mcp: MCP client, examples, and server stubs
tests: add your unit/integration tests here

Notes and roadmap

Providers: OpenAI, Anthropic, Google GenAI, xAI (via langchain providers)
Features include structured output, retries, fallbacks, streaming, and tool call controls
MCP doc generation (OpenMCP) is available via MCPClient.get_openmcp()
Nice-to-haves: add a simple example runner module; more test coverage around examples and MCP flows

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.18.0

May 6, 2026

1.17.0

May 4, 2026

1.16.0

Apr 26, 2026

1.15.0

Mar 29, 2026

1.14.0

Mar 17, 2026

1.13.0

Mar 15, 2026

1.12.0

Mar 15, 2026

1.11.0

Mar 15, 2026

1.10.0

Mar 14, 2026

1.9.1

Mar 3, 2026

1.9.0

Mar 3, 2026

1.8.1

Feb 2, 2026

1.8.0

Jan 23, 2026

1.7.0

Jan 19, 2026

1.6.0

Jan 17, 2026

1.5.2

Jan 12, 2026

1.5.1

Jan 5, 2026

1.5.0

Jan 5, 2026

1.4.0

Jan 5, 2026

1.3.0

Jan 5, 2026

1.2.0

Jan 4, 2026

1.1.2

Jan 3, 2026

1.1.1

Dec 31, 2025

1.1.0

Dec 30, 2025

1.0.3

Dec 28, 2025

1.0.2

Dec 28, 2025

1.0.1

Dec 28, 2025

1.0.0

Dec 28, 2025

0.1.171

Dec 28, 2025

0.1.170

Dec 28, 2025

0.1.169

Dec 28, 2025

0.1.168

Dec 27, 2025

0.1.167

Dec 27, 2025

0.1.166

Dec 26, 2025

0.1.165

Dec 24, 2025

0.1.164

Dec 22, 2025

0.1.163

Dec 19, 2025

0.1.162

Dec 18, 2025

0.1.161

Dec 18, 2025

0.1.160

Dec 18, 2025

0.1.159

Dec 18, 2025

0.1.158

Dec 18, 2025

0.1.157

Dec 18, 2025

0.1.156

Dec 18, 2025

0.1.155

Dec 17, 2025

0.1.154

Dec 17, 2025

0.1.153

Dec 17, 2025

0.1.152

Dec 17, 2025

0.1.151

Dec 17, 2025

0.1.150

Dec 16, 2025

0.1.149

Dec 16, 2025

0.1.148

Dec 16, 2025

0.1.147

Dec 15, 2025

0.1.146

Dec 15, 2025

0.1.145

Dec 14, 2025

0.1.144

Dec 14, 2025

0.1.143

Dec 14, 2025

0.1.142

Dec 14, 2025

0.1.141

Dec 14, 2025

0.1.140

Dec 13, 2025

0.1.139

Dec 12, 2025

0.1.138

Dec 11, 2025

0.1.137

Dec 10, 2025

0.1.136

Dec 10, 2025

0.1.135

Dec 10, 2025

0.1.134

Dec 10, 2025

0.1.133

Dec 9, 2025

0.1.132

Dec 9, 2025

0.1.131

Dec 9, 2025

0.1.130

Dec 8, 2025

0.1.129

Dec 8, 2025

0.1.128

Dec 8, 2025

0.1.127

Dec 7, 2025

0.1.126

Dec 7, 2025

0.1.125

Dec 7, 2025

0.1.124

Dec 6, 2025

0.1.123

Dec 6, 2025

0.1.122

Dec 6, 2025

0.1.121

Dec 4, 2025

0.1.120

Dec 4, 2025

0.1.119

Dec 4, 2025

0.1.118

Dec 4, 2025

0.1.117

Dec 3, 2025

0.1.116

Dec 3, 2025

0.1.115

Dec 3, 2025

0.1.114

Dec 3, 2025

0.1.112

Dec 2, 2025

0.1.111

Dec 2, 2025

0.1.109

Dec 2, 2025

0.1.108

Dec 1, 2025

0.1.107

Dec 1, 2025

0.1.106

Dec 1, 2025

0.1.105

Dec 1, 2025

0.1.104

Dec 1, 2025

0.1.103

Dec 1, 2025

0.1.102

Nov 30, 2025

0.1.101

Nov 30, 2025

0.1.100

Nov 30, 2025

0.1.99

Nov 30, 2025

0.1.98

Nov 29, 2025

0.1.97

Nov 29, 2025

0.1.96

Nov 28, 2025

0.1.95

Nov 28, 2025

0.1.94

Nov 28, 2025

0.1.93

Nov 28, 2025

0.1.92

Nov 28, 2025

0.1.91

Nov 28, 2025

0.1.90

Nov 28, 2025

0.1.89

Nov 27, 2025

0.1.88

Nov 27, 2025

0.1.87

Nov 27, 2025

0.1.86

Nov 27, 2025

0.1.85

Nov 27, 2025

0.1.84

Nov 27, 2025

0.1.83

Nov 27, 2025

0.1.82

Nov 26, 2025

0.1.81

Nov 26, 2025

0.1.80

Nov 26, 2025

0.1.79

Nov 26, 2025

0.1.78

Nov 26, 2025

0.1.77

Nov 26, 2025

0.1.76

Nov 26, 2025

This version

0.1.75

Nov 26, 2025

0.1.74

Nov 26, 2025

0.1.73

Nov 26, 2025

0.1.72

Nov 26, 2025

0.1.71

Nov 26, 2025

0.1.70

Nov 13, 2025

0.1.69

Nov 8, 2025

0.1.68

Nov 7, 2025

0.1.67

Sep 15, 2025

0.1.66

Sep 8, 2025

0.1.65

Sep 8, 2025

0.1.64

Sep 8, 2025

0.1.63

Sep 8, 2025

0.1.62

Sep 7, 2025

0.1.61

Sep 7, 2025

0.1.60

Sep 7, 2025

0.1.59

Sep 6, 2025

0.1.58

Sep 3, 2025

0.1.57

Sep 3, 2025

0.1.56

Sep 3, 2025

0.1.55

Sep 3, 2025

0.1.54

Sep 3, 2025

0.1.53

Sep 3, 2025

0.1.52

Sep 3, 2025

0.1.51

Sep 3, 2025

0.1.50

Sep 3, 2025

0.1.49

Sep 3, 2025

0.1.48

Sep 3, 2025

0.1.47

Sep 3, 2025

0.1.46

Sep 3, 2025

0.1.45

Sep 3, 2025

0.1.44

Sep 3, 2025

0.1.43

Sep 3, 2025

0.1.42

Sep 2, 2025

0.1.41

Sep 2, 2025

0.1.40

Sep 2, 2025

0.1.39

Sep 2, 2025

0.1.38

Sep 2, 2025

0.1.37

Sep 2, 2025

0.1.36

Sep 2, 2025

0.1.35

Sep 2, 2025

0.1.34

Sep 1, 2025

0.1.33

Sep 1, 2025

0.1.32

Aug 29, 2025

0.1.31

Aug 29, 2025

0.1.30

Aug 29, 2025

0.1.29

Aug 29, 2025

0.1.28

Aug 28, 2025

0.1.27

Aug 28, 2025

0.1.26

Aug 28, 2025

0.1.25

Aug 28, 2025

0.1.24

Aug 28, 2025

0.1.23

Aug 28, 2025

0.1.22

Aug 28, 2025

0.1.21

Aug 27, 2025

0.1.20

Aug 27, 2025

0.1.19

Aug 27, 2025

0.1.18

Aug 27, 2025

0.1.17

Aug 27, 2025

0.1.16

Aug 26, 2025

0.1.15

Aug 26, 2025

0.1.14

Aug 26, 2025

0.1.13

Aug 25, 2025

0.1.12

Aug 25, 2025

0.1.11

Aug 25, 2025

0.1.10

Aug 25, 2025

0.1.9

Aug 25, 2025

0.1.7

Aug 25, 2025

0.1.6

Aug 25, 2025

0.1.5

Aug 24, 2025

0.1.3

Aug 24, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ai_infra-0.1.75.tar.gz (98.7 kB view details)

Uploaded Nov 26, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ai_infra-0.1.75-py3-none-any.whl (131.1 kB view details)

Uploaded Nov 26, 2025 Python 3

File details

Details for the file ai_infra-0.1.75.tar.gz.

File metadata

Download URL: ai_infra-0.1.75.tar.gz
Upload date: Nov 26, 2025
Size: 98.7 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for ai_infra-0.1.75.tar.gz
Algorithm	Hash digest
SHA256	`16c96022c2663baefec87d21c18be587d3625850d321cc4dd296ee2cec3d07c0`
MD5	`fb95aa5d296b3594dfa066a67171bdfe`
BLAKE2b-256	`7dcb36b5472b39caccb499fa8bd5b2df23b6c93adb232920f823600fd070a4b3`

See more details on using hashes here.

File details

Details for the file ai_infra-0.1.75-py3-none-any.whl.

File metadata

Download URL: ai_infra-0.1.75-py3-none-any.whl
Upload date: Nov 26, 2025
Size: 131.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for ai_infra-0.1.75-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c49de1c346d6b0d18e51cec5a844245f3cd7b855b2f76c7a6fe012f7f526d450`
MD5	`b1e1879e8d7af9784e7f0245c053668b`
BLAKE2b-256	`3ff7b64537b35e7edf7f6a943a0469829460a7f71ca7878bdf1639cfc2cc6dc3`

See more details on using hashes here.

ai-infra 0.1.75

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

ai-infra

Install

Configure providers (env)

Quickstarts

LLM: chat (sync)

LLM: agent (tools, sync)

LLM: token streaming (async)

Graph: minimal state machine

MCP: multi-server client

Running all quickstarts

MCP server config examples

Testing and quality

Project layout

Notes and roadmap

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes