Flexible LLM model initialization from YAML configuration.

These details have not been verified by PyPI

Project description

Langfabric

Unit Tests

Langfabric is a flexible Python framework for managing, instantiating, and caching Large Language Model (LLM) instances from YAML configuration files.
It supports OpenAI, Azure OpenAI, Groq, Ollama, AzureML, and other providers, making LLM orchestration and deployment easy, reproducible, and robust.

Features

Declarative YAML model configs (with secret support via seyaml)
Multiple provider support: OpenAI, Azure OpenAI, Groq, Ollama, AzureML, and more
Thread-safe model caching
Runtime overrides: temperature, max tokens, etc.
Parallel/preloaded model initialization
Automatic LangChain model rebuild
Automatic caching in ModelManager

Installation

pip install langfabric

Or clone the repo and install locally:

git clone https://github.com/grakn/langfabric.git
cd langfabric
pip install -e .

Example: Model Configuration YAML

# models.yaml
- name: gpt4o
  provider: azure_openai
  model: gpt-4o
  deployment_name: gpt-4o-deployment
  api_key: !env AZURE_OPENAI_API_KEY
  endpoint: https://your-endpoint.openai.azure.com
  api_version: 2024-06-01-preview
  max_tokens: 4096
  temperature: 0.1

- name: llama3
  provider: ollama
  model: llama3
  max_tokens: 4096

Usage

1. Load model configs

from langfabric.loader import load_model_configs

model_configs = load_model_configs("./models.yaml")

2. Build and cache models

from langfabric.manager import ModelManager

manager = ModelManager(model_configs)
model = manager.load("gpt4o")  # Get Azure OpenAI GPT-4o model

3. Optional preload all models in parallel (multi-threaded)

manager.preload_all()  # Warms up cache in threads for all configs

4. Use runtime parameter overrides

custom_model = manager.load(
    "gpt4o",
    temperature=0.5,
    max_tokens=2048,
    json_response=True,
    streaming=False,
)

5. Get total amount loaded models

manager.active()

Advanced

Load model configs with secrets

from langfabric.loader import load_model_configs

secrets = {"api_key": "sk-..."}
model_configs = load_model_configs("./models/model.yaml", secrets)

Use !secrets pre-processor to point on secret names

# models.yaml
- name: gpt4o
  provider: azure_openai
  model: gpt-4o
  deployment_name: gpt-4o-deployment
  api_key: !secret api_key
  endpoint: https://your-endpoint.openai.azure.com
  api_version: 2024-06-01-preview
  max_tokens: 4096
  temperature: 0.1
- name: ollama
  provider: ollama
  model: llama3
  max_tokens: 4096

Load multiple model config files with secrets

from langfabric.loader import load_model_configs

secrets = {"api_key": "sk-..."}
model_configs = load_model_configs(["./models/models1.yaml", "./models/models2.yaml"], secrets)

Load multiple model config files from directories

from langfabric.loader import load_model_configs

secrets = {"api_key": "sk-..."}
model_configs = load_model_configs(["./models1/", "./models2/], secrets)

📘 Simple Example: Using `langfabric` with OpenAI

This example demonstrates how to load a model configuration from YAML and run an asynchronous prompt chain.

1. Create a `models.yaml` file

- name: gpt4o-mini
  provider: openai
  model: o4-mini
  api_key: !env OPENAI_API_KEY
  max_tokens: 4096
  streaming: True

2. Export Your API Key

Before running the example, make sure your OpenAI API key is available as an environment variable:

export OPENAI_API_KEY=your_openai_key_here

3. Run the Example Script

import asyncio
from langchain_core.prompts import ChatPromptTemplate
from langfabric import load_model_configs, build_model

async def main():
    # Load model configuration from YAML
    cfg = load_model_configs(["models.yaml"])

    # Build the model instance using the configuration
    llm = build_model(cfg["gpt4o-mini"])

    # Define a structured prompt with system and user roles
    prompt = ChatPromptTemplate.from_messages([
        (
            "system",
            "You are a helpful assistant answering questions about the region {region_name}. Provide short and clear answers.",
        ),
        ("human", "{input}"),
    ])

    # Create a prompt-model chain
    chain = prompt | llm

    # Execute the chain with input values
    output = await chain.ainvoke({
        "region_name": "Bay Area",
        "input": "How many people are living there?",
    })

    # Print the model output
    print(output.content)

if __name__ == "__main__":
    asyncio.run(main())

Example Output

The Bay Area has approximately 7.7 million people.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.1.8

Jun 26, 2025

0.1.6

Jun 23, 2025

0.1.5

Jun 22, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

langfabric-0.1.8.tar.gz (6.3 kB view details)

Uploaded Jun 26, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

langfabric-0.1.8-py3-none-any.whl (8.4 kB view details)

Uploaded Jun 26, 2025 Python 3

File details

Details for the file langfabric-0.1.8.tar.gz.

File metadata

Download URL: langfabric-0.1.8.tar.gz
Upload date: Jun 26, 2025
Size: 6.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.1.3 CPython/3.11.13 Linux/6.11.0-1015-azure

File hashes

Hashes for langfabric-0.1.8.tar.gz
Algorithm	Hash digest
SHA256	`5e2b4b44f625022c59da2fa9001fc6e6b7a730dbde9cc3350e95821ac396472d`
MD5	`83ebd3a6d2a7d634c059bebc6e204dbe`
BLAKE2b-256	`c3c07e9221e3cc60d67b6abd47ac2fd468bc896811ed08f476e71a353bf41366`

See more details on using hashes here.

File details

Details for the file langfabric-0.1.8-py3-none-any.whl.

File metadata

Download URL: langfabric-0.1.8-py3-none-any.whl
Upload date: Jun 26, 2025
Size: 8.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.1.3 CPython/3.11.13 Linux/6.11.0-1015-azure

File hashes

Hashes for langfabric-0.1.8-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f0ef8e750baa1396feece7da87cb02a7a93162cae747e7278491213cbee1974e`
MD5	`356db73d36c760418020f0acf338943a`
BLAKE2b-256	`b700cbd4ade6830a7baa151f43a3ffc40b81adf9925b22abf699c2aae06af166`

See more details on using hashes here.

langfabric 0.1.8

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

Langfabric

Features

Installation

Example: Model Configuration YAML

Usage

1. Load model configs

2. Build and cache models

3. Optional preload all models in parallel (multi-threaded)

4. Use runtime parameter overrides

5. Get total amount loaded models

Advanced

Load model configs with secrets

Load multiple model config files with secrets

Load multiple model config files from directories

📘 Simple Example: Using langfabric with OpenAI

1. Create a models.yaml file

2. Export Your API Key

3. Run the Example Script

Example Output

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

📘 Simple Example: Using `langfabric` with OpenAI

1. Create a `models.yaml` file