A comprehensive LLM configuration package supporting multiple providers (OpenAI, VLLM, Gemini, Infinity) for chat assistants and embeddings

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

liux2

These details have not been verified by PyPI

Project description

Langchain LLM Config

Yet another redundant Langchain abstraction: comprehensive Python package for managing and using multiple LLM providers (OpenAI, VLLM, Gemini, Infinity) with a unified interface for both chat assistants and embeddings.

Features

🤖 Multiple Chat Providers: Support for OpenAI, VLLM, and Gemini
🔗 Multiple Embedding Providers: Support for OpenAI, VLLM, and Infinity
⚙️ Unified Configuration: Single YAML configuration file for all providers
🚀 Easy Setup: CLI tool for quick configuration initialization
🔄 Easy Context Concatenation: Simplified process for combining contexts into chat
🔒 Environment Variables: Secure API key management
📦 Self-Contained: No need to import specific paths
⚡ Async Support: Full async/await support for all operations
🌊 Streaming Chat: Real-time streaming responses for interactive experiences
🛠️ Enhanced CLI: Environment setup and validation commands

Installation

Using pip

pip install langchain-llm-config

Using uv (recommended)

uv add langchain-llm-config

Development installation

git clone https://github.com/liux2/Langchain-LLM-Config.git
cd langchain-llm-config
uv sync --dev
uv run pip install -e .

Quick Start

1. Initialize Configuration

# Initialize config in current directory
llm-config init

# Or specify a custom location
llm-config init ~/.config/api.yaml

This creates an api.yaml file with all supported providers configured.

2. Set Up Environment Variables

# Set up environment variables and create .env file
llm-config setup-env

# Or with custom config path
llm-config setup-env --config-path ~/.config/.env

This creates a .env file with placeholders for your API keys.

3. Configure Your Providers

Edit the generated api.yaml file with your API keys and settings:

llm:
  openai:
    chat:
      api_base: "https://api.openai.com/v1"
      api_key: "${OPENAI_API_KEY}"
      model_name: "gpt-3.5-turbo"
      temperature: 0.7
      max_tokens: 8192
    embeddings:
      api_base: "https://api.openai.com/v1"
      api_key: "${OPENAI_API_KEY}"
      model_name: "text-embedding-ada-002"
  
  vllm:
    chat:
      api_base: "http://localhost:8000/v1"
      api_key: "${OPENAI_API_KEY}"
      model_name: "meta-llama/Llama-2-7b-chat-hf"
      temperature: 0.6
  
  default:
    chat_provider: "openai"
    embedding_provider: "openai"

4. Set Environment Variables

Edit the .env file with your actual API keys:

OPENAI_API_KEY=your-openai-api-key
GEMINI_API_KEY=your-gemini-api-key

5. Use in Your Code

Basic Usage (Synchronous)

from langchain_llm_config import create_assistant, create_embedding_provider
from pydantic import BaseModel, Field
from typing import List


# Define your response model
class ArticleAnalysis(BaseModel):
    summary: str = Field(..., description="Article summary")
    keywords: List[str] = Field(..., description="Key topics")
    sentiment: str = Field(..., description="Overall sentiment")


# Create an assistant without response model (raw text mode)
assistant = create_assistant(
    response_model=None,  # Explicitly set to None for raw text
    system_prompt="You are a helpful article analyzer.",
    provider="openai",  # or "vllm", "gemini"
    auto_apply_parser=False,
)

# Use the assistant for raw text output
print("=== Raw Text Mode ===")
result = assistant.ask("Analyze this article: ...")
print(result)

# Apply parser to the same assistant (modifies in place)
print("\n=== Applying Parser ===")
assistant.apply_parser(response_model=ArticleAnalysis)

# Now use the same assistant for structured output
print("\n=== Structured Mode ===")
result = assistant.ask("Analyze this article: ...")
print(result)

# Create an embedding provider
embedding_provider = create_embedding_provider(provider="openai")

# Get embeddings (synchronous)
texts = ["Hello world", "How are you?"]
embeddings = embedding_provider.embed_texts(texts)

Advanced Usage (Asynchronous)

import asyncio

# Use the assistant (asynchronous)
result = await assistant.ask_async("Analyze this article: ...")
print(result["summary"])

# Get embeddings (asynchronous)
embeddings = await embedding_provider.embed_texts_async(texts)

Streaming Chat

import asyncio
from langchain_llm_config import create_chat_streaming


async def main():
    """Main async function to run the streaming chat example"""
    # Create streaming chat assistant
    # Try with OpenAI first to test streaming
    streaming_chat = create_chat_streaming(
        provider="vllm", system_prompt="You are a helpful assistant."
    )

    print("🤖 Starting streaming chat...")
    print("Response: ", end="", flush=True)

    try:
        # Stream responses in real-time
        async for chunk in streaming_chat.chat_stream("Tell me a story"):
            if chunk["type"] == "stream":
                print(chunk["content"], end="", flush=True)
            elif chunk["type"] == "final":
                print(f"\n\nProcessing time: {chunk['processing_time']:.2f}s")
                print(f"Model used: {chunk['model_used']}")
    except Exception as e:
        print(f"\n❌ Error occurred: {e}")


if __name__ == "__main__":
    # Run the async function
    asyncio.run(main())

Supported Providers

Chat Providers

Provider	Models	Features
OpenAI	GPT-3.5, GPT-4, etc.	Streaming, function calling, structured output
VLLM	Any HuggingFace model	Local deployment, high performance
Gemini	Gemini Pro, etc.	Google's latest models

Embedding Providers

Provider	Models	Features
OpenAI	text-embedding-ada-002, etc.	High quality, reliable
VLLM	BGE, sentence-transformers	Local deployment
Infinity	Various embedding models	Fast inference

CLI Commands

# Initialize a new configuration file
llm-config init [path]

# Set up environment variables and create .env file
llm-config setup-env [path] [--force]

# Validate existing configuration
llm-config validate [path]

# Show package information
llm-config info

Advanced Usage

Custom Configuration Path

from langchain_llm_config import create_assistant

assistant = create_assistant(
    response_model=MyModel,
    config_path="/path/to/custom/api.yaml"
)

Context-Aware Conversations

# Add context to your queries
result = await assistant.ask_async(
    query="What are the main points?",
    context="This is a research paper about machine learning...",
    extra_system_prompt="Focus on technical details."
)

Direct Provider Usage

from langchain_llm_config import VLLMAssistant, OpenAIEmbeddingProvider

# Use providers directly
vllm_assistant = VLLMAssistant(
    config={"api_base": "http://localhost:8000/v1", "model_name": "llama-2"},
    response_model=MyModel
)

openai_embeddings = OpenAIEmbeddingProvider(
    config={"api_key": "your-key", "model_name": "text-embedding-ada-002"}
)

Complete Example with Error Handling

import asyncio
from langchain_llm_config import create_assistant, create_embedding_provider
from pydantic import BaseModel, Field
from typing import List

class ChatResponse(BaseModel):
    message: str = Field(..., description="The assistant's response message")
    confidence: float = Field(..., description="Confidence score", ge=0.0, le=1.0)
    suggestions: List[str] = Field(default_factory=list, description="Follow-up questions")

async def main():
    try:
        # Create assistant
        assistant = create_assistant(
            response_model=ChatResponse,
            provider="openai",
            system_prompt="You are a helpful AI assistant."
        )
        
        # Chat conversation
        response = await assistant.ask_async("What is the capital of France?")
        print(f"Assistant: {response['message']}")
        print(f"Confidence: {response['confidence']:.2f}")
        
        # Create embedding provider
        embedding_provider = create_embedding_provider(provider="openai")
        
        # Get embeddings
        texts = ["Hello world", "How are you?"]
        embeddings = await embedding_provider.embed_texts_async(texts)
        print(f"Generated {len(embeddings)} embeddings")
        
    except Exception as e:
        print(f"Error: {e}")

# Run the example
asyncio.run(main())

Configuration Reference

Environment Variables

The package supports environment variable substitution in configuration:

api_key: "${OPENAI_API_KEY}"  # Will be replaced with actual value

Configuration Structure

llm:
  provider_name:
    chat:
      api_base: "https://api.example.com/v1"
      api_key: "${API_KEY}"
      model_name: "model-name"
      temperature: 0.7
      max_tokens: 8192
      top_p: 1.0
      connect_timeout: 60
      read_timeout: 60
      model_kwargs: {}
      # ... other parameters
    embeddings:
      api_base: "https://api.example.com/v1"
      api_key: "${API_KEY}"
      model_name: "embedding-model"
      # ... other parameters
  default:
    chat_provider: "provider_name"
    embedding_provider: "provider_name"

Development

Running Tests

uv run pytest

Code Formatting

uv run black .
uv run isort .

Type Checking

uv run mypy .

Contributing

Fork the repository
Create a feature branch
Make your changes
Add tests
Submit a pull request

License

MIT License - see LICENSE file for details.

Support

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

liux2

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.3.4

Mar 16, 2026

0.3.3

Mar 16, 2026

0.3.2

Dec 22, 2025

0.3.1

Dec 20, 2025

0.2.0

Aug 8, 2025

This version

0.1.6

Jul 28, 2025

0.1.5

Jul 23, 2025

0.1.3

Jul 1, 2025

0.1.0

Jun 30, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

langchain_llm_config-0.1.6.tar.gz (1.1 MB view details)

Uploaded Jul 28, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

langchain_llm_config-0.1.6-py3-none-any.whl (807.9 kB view details)

Uploaded Jul 28, 2025 Python 3

File details

Details for the file langchain_llm_config-0.1.6.tar.gz.

File metadata

Download URL: langchain_llm_config-0.1.6.tar.gz
Upload date: Jul 28, 2025
Size: 1.1 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for langchain_llm_config-0.1.6.tar.gz
Algorithm	Hash digest
SHA256	`63c7b172e626043353bd3e9c98d03544c7cd45f5d5c866812855703485caa9dd`
MD5	`702130e8e54444a22da448578beefb0c`
BLAKE2b-256	`5833507aea28e29b24f79d925331877fa291d3707c5dfcebf5d27e09394d2e29`

See more details on using hashes here.

Provenance

The following attestation bundles were made for langchain_llm_config-0.1.6.tar.gz:

Publisher: python-publish.yml on liux2/Langchain-LLM-Config

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: langchain_llm_config-0.1.6.tar.gz
- Subject digest: 63c7b172e626043353bd3e9c98d03544c7cd45f5d5c866812855703485caa9dd
- Sigstore transparency entry: 318948629
- Sigstore integration time: Jul 28, 2025
Source repository:
- Permalink: liux2/Langchain-LLM-Config@b64cf8c04459ad39001f834f2695e8efc129eea8
- Branch / Tag: refs/tags/v0.1.6
- Owner: https://github.com/liux2
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-publish.yml@b64cf8c04459ad39001f834f2695e8efc129eea8
- Trigger Event: release

File details

Details for the file langchain_llm_config-0.1.6-py3-none-any.whl.

File metadata

Download URL: langchain_llm_config-0.1.6-py3-none-any.whl
Upload date: Jul 28, 2025
Size: 807.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for langchain_llm_config-0.1.6-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c630cf92564957fc69d00f6e5d8acd19a5ad7cb42baca3d1db0304a53c3b852a`
MD5	`bd0b58d5efb8e88b23bec342ad5a0f3f`
BLAKE2b-256	`e5899eba44bd2b7b645f53900c1a5599db55fa5c22395036eb2e8c696ade3e83`

See more details on using hashes here.

Provenance

The following attestation bundles were made for langchain_llm_config-0.1.6-py3-none-any.whl:

Publisher: python-publish.yml on liux2/Langchain-LLM-Config

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: langchain_llm_config-0.1.6-py3-none-any.whl
- Subject digest: c630cf92564957fc69d00f6e5d8acd19a5ad7cb42baca3d1db0304a53c3b852a
- Sigstore transparency entry: 318948652
- Sigstore integration time: Jul 28, 2025
Source repository:
- Permalink: liux2/Langchain-LLM-Config@b64cf8c04459ad39001f834f2695e8efc129eea8
- Branch / Tag: refs/tags/v0.1.6
- Owner: https://github.com/liux2
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-publish.yml@b64cf8c04459ad39001f834f2695e8efc129eea8
- Trigger Event: release

langchain-llm-config 0.1.6

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Langchain LLM Config

Features

Installation

Using pip

Using uv (recommended)

Development installation

Quick Start

1. Initialize Configuration

2. Set Up Environment Variables

3. Configure Your Providers

4. Set Environment Variables

5. Use in Your Code

Basic Usage (Synchronous)

Advanced Usage (Asynchronous)

Streaming Chat

Supported Providers

Chat Providers

Embedding Providers

CLI Commands

Advanced Usage

Custom Configuration Path

Context-Aware Conversations

Direct Provider Usage

Complete Example with Error Handling

Configuration Reference

Environment Variables

Configuration Structure

Development

Running Tests

Code Formatting

Type Checking

Contributing

License

Support

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance