Generative AI tools from ToolForge-AI organization

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

Project description

genai-forge

Lightweight, provider-agnostic utilities to call LLMs and parse their outputs with Pydantic. Build once, run on any LLM provider.

Features

🔄 Multi-Provider Support: OpenAI, Anthropic (Claude), Google (Gemini), Mistral AI, Cohere
🔗 Composable Chains: Pipe operators for elegant prompt → LLM → parser workflows
✅ Type-Safe Parsing: Validate LLM outputs with Pydantic models
📦 Minimal Core: Only install what you need with optional provider dependencies
🔧 Integration Ready: Seamless integration with prompting-forge for prompt versioning

Installation

Basic Installation (OpenAI only)

pip install genai-forge

With Specific Providers

# Anthropic (Claude)
pip install genai-forge[anthropic]

# Google (Gemini)
pip install genai-forge[google]

# Mistral AI
pip install genai-forge[mistral]

# Cohere
pip install genai-forge[cohere]

# All providers
pip install genai-forge[all]

Requirements

Python 3.10+
API keys for your chosen provider(s)

Quick Start

1. Set Up Environment Variables

Create a .env file in your project root:

# OpenAI
OPENAI_API_KEY=sk-...

# Anthropic (Claude)
ANTHROPIC_API_KEY=sk-ant-...

# Google (Gemini)
GOOGLE_API_KEY=AIza...

# Mistral AI
MISTRAL_API_KEY=...

# Cohere
COHERE_API_KEY=...

genai-forge automatically loads .env files via python-dotenv.

2. Basic Usage

from genai_forge import get_llm
from prompting_forge.prompting import PromptTemplate

# Create a prompt template
template = PromptTemplate(
    system="You are a concise expert assistant.",
    template="Generate one actionable tip.\nAudience: {audience}\nTime: {time}",
)

# Create an LLM (choose your provider)
llm = get_llm("openai:gpt-4o-mini", temperature=0.2)

# Chain: query | template | llm
query = "Provide a short productivity tip."
chain = query | template | llm
result = chain({"audience": "Backend Python developer", "time": "30 minutes"})
print(result)

Supported Providers & Models

OpenAI

# GPT-4o models
llm = get_llm("openai:gpt-4o", temperature=0.3)
llm = get_llm("openai:gpt-4o-mini", temperature=0.3)

# GPT-4 models
llm = get_llm("openai:gpt-4-turbo", temperature=0.3)
llm = get_llm("openai:gpt-4", temperature=0.3)

# GPT-3.5 models
llm = get_llm("openai:gpt-3.5-turbo", temperature=0.3)

Environment Variable: OPENAI_API_KEY

Anthropic (Claude)

# Claude 3.5 models
llm = get_llm("anthropic:claude-3-5-sonnet-20241022", temperature=0.3)
llm = get_llm("anthropic:claude-3-5-haiku-20241022", temperature=0.3)

# Claude 3 models
llm = get_llm("anthropic:claude-3-opus-20240229", temperature=0.3)
llm = get_llm("anthropic:claude-3-sonnet-20240229", temperature=0.3)
llm = get_llm("anthropic:claude-3-haiku-20240307", temperature=0.3)

Environment Variable: ANTHROPIC_API_KEY

Google (Gemini)

# Gemini 2.0 models
llm = get_llm("google:gemini-2.0-flash-exp", temperature=0.3)

# Gemini 1.5 models
llm = get_llm("google:gemini-1.5-pro", temperature=0.3)
llm = get_llm("google:gemini-1.5-flash", temperature=0.3)
llm = get_llm("google:gemini-1.5-flash-8b", temperature=0.3)

Environment Variable: GOOGLE_API_KEY

Mistral AI

# Mistral models
llm = get_llm("mistral:mistral-large-latest", temperature=0.3)
llm = get_llm("mistral:mistral-medium-latest", temperature=0.3)
llm = get_llm("mistral:mistral-small-latest", temperature=0.3)

# Open models
llm = get_llm("mistral:open-mistral-7b", temperature=0.3)
llm = get_llm("mistral:open-mixtral-8x7b", temperature=0.3)
llm = get_llm("mistral:open-mixtral-8x22b", temperature=0.3)

Environment Variable: MISTRAL_API_KEY

Cohere

# Command R models
llm = get_llm("cohere:command-r-plus", temperature=0.3)
llm = get_llm("cohere:command-r", temperature=0.3)

# Command models
llm = get_llm("cohere:command", temperature=0.3)
llm = get_llm("cohere:command-light", temperature=0.3)

Environment Variable: COHERE_API_KEY

Parsing Structured Outputs with Pydantic

Use PydanticOutputParser to have the LLM return valid JSON validated into a Pydantic model. Format instructions are automatically injected into your prompt.

from typing import List
from pydantic import BaseModel
from genai_forge import get_llm, PydanticOutputParser
from prompting_forge.prompting import PromptTemplate

class CityPlan(BaseModel):
    city: str
    attractions: List[str]
    days: int

template = PromptTemplate(
    system="You are a helpful travel planner.",
    template="Create a city plan.\nCity: {city}\nDays: {days}",
)

# Use any provider you want
llm = get_llm("anthropic:claude-3-5-haiku-20241022", temperature=0.1)
parser = PydanticOutputParser(CityPlan)

# Chain: query | template | llm | parser
query = "Create a 3-day city plan for Tokyo."
chain = query | template | llm | parser
result = chain({"city": "Tokyo", "days": 3})  # -> CityPlan instance
print(f"City: {result.city}")
print(f"Days: {result.days}")
print(f"Attractions: {', '.join(result.attractions)}")

How Parsing Works

PydanticOutputParser:

Accepts tolerant output formats (e.g., extra text or ```json fences)
Extracts JSON from the LLM response
Validates against your Pydantic model
Automatically injects format instructions when used in a chain

API surface:

from genai_forge import PydanticOutputParser, BaseOutputParser, OutputParserException

parser = PydanticOutputParser(YourModel)
instructions = parser.get_format_instructions()  # JSON schema for the LLM
validated_obj = parser.parse(llm_output_text)    # Parsed & validated model

Chaining with the Pipe Operator

The | operator builds elegant pipelines:

# Simple chain
chain = template | llm

# With parser
chain = template | llm | parser

# With query
chain = query | template | llm | parser

# Execute
result = chain(context_variables)

What happens:

query + template → renders prompt with context variables
llm → sends prompt to LLM provider
parser → validates and parses response (format instructions auto-injected)

Embedding Models

genai-forge also supports embedding models for generating vector representations of text.

Basic Embedding Usage

from genai_forge import get_embedding

# Create an embedding model
embedding = get_embedding("openai:text-embedding-3-small")

# Embed a single text
vector = embedding("Hello, world!")
print(f"Embedding dimension: {len(vector)}")

# Embed multiple texts
texts = ["First document", "Second document", "Third document"]
vectors = embedding(texts)
print(f"Number of vectors: {len(vectors)}")

Supported Embedding Models

OpenAI

# Latest V3 models
emb = get_embedding("openai:text-embedding-3-large")  # 3072 dimensions
emb = get_embedding("openai:text-embedding-3-small")  # 1536 dimensions

# Legacy V2 model
emb = get_embedding("openai:text-embedding-ada-002")  # 1536 dimensions

Environment Variable: OPENAI_API_KEY

Google (Gemini)

emb = get_embedding("google:text-embedding-004")  # 768 dimensions
emb = get_embedding("google:embedding-001")       # 768 dimensions (legacy)

Environment Variable: GOOGLE_API_KEY

Mistral AI

emb = get_embedding("mistral:mistral-embed")  # 1024 dimensions

Environment Variable: MISTRAL_API_KEY

Cohere

# Standard models
emb = get_embedding("cohere:embed-english-v3.0")       # 1024 dimensions
emb = get_embedding("cohere:embed-multilingual-v3.0")  # 1024 dimensions

# Lightweight models
emb = get_embedding("cohere:embed-english-light-v3.0")       # 384 dimensions
emb = get_embedding("cohere:embed-multilingual-light-v3.0")  # 384 dimensions

Environment Variable: COHERE_API_KEY

Embedding Use Cases

Semantic Search

from genai_forge import get_embedding
import numpy as np

# Initialize embedding model
embedding = get_embedding("openai:text-embedding-3-small")

# Documents to search
documents = [
    "Python is a programming language",
    "Machine learning uses algorithms",
    "Natural language processing analyzes text",
]

# Embed documents
doc_vectors = embedding(documents)

# Query
query = "What is NLP?"
query_vector = embedding(query)

# Compute cosine similarities
def cosine_similarity(a, b):
    return np.dot(a, b) / (np.linalg.norm(a) * np.linalg.norm(b))

similarities = [cosine_similarity(query_vector, doc) for doc in doc_vectors]

# Find most similar document
best_idx = np.argmax(similarities)
print(f"Most relevant: {documents[best_idx]}")
print(f"Similarity: {similarities[best_idx]:.4f}")

Document Clustering

from genai_forge import get_embedding
from sklearn.cluster import KMeans

embedding = get_embedding("openai:text-embedding-3-small")

docs = [
    "Python programming tutorial",
    "Java development guide",
    "Cooking recipes for beginners",
    "Advanced Python techniques",
    "Italian cuisine recipes",
]

# Get embeddings
vectors = embedding(docs)

# Cluster
kmeans = KMeans(n_clusters=2, random_state=0)
labels = kmeans.fit_predict(vectors)

for doc, label in zip(docs, labels):
    print(f"Cluster {label}: {doc}")

Multi-Provider Comparison Example

Compare outputs from different providers on the same prompt:

from genai_forge import get_llm
from prompting_forge.prompting import PromptTemplate

template = PromptTemplate(
    system="You are a creative writer.",
    template="Write a haiku about {topic}.",
)

providers = [
    "openai:gpt-4o-mini",
    "anthropic:claude-3-5-haiku-20241022",
    "google:gemini-1.5-flash",
    "mistral:mistral-small-latest",
    "cohere:command-light",
]

context = {"topic": "artificial intelligence"}

for provider_model in providers:
    try:
        llm = get_llm(provider_model, temperature=0.7)
        chain = template | llm
        result = chain(context)
        print(f"\n{provider_model}:")
        print(result)
    except Exception as e:
        print(f"\n{provider_model}: ERROR - {e}")

Advanced Usage: LLMCall with Versioning

LLMCall provides advanced features like prompt versioning and call logging:

from genai_forge import get_llm
from genai_forge.llm import LLMCall
from prompting_forge.prompting import PromptTemplate

template = PromptTemplate(
    system="You are a helpful assistant.",
    template="Explain {concept} in simple terms.",
)

llm = get_llm("openai:gpt-4o-mini")

# Create an LLMCall with versioning
call = LLMCall(
    query="Explain the concept clearly",
    prompt_template=template,
    client=llm,
    name="explainer_assistant",
    enable_versioning=True,  # Saves call records to .llm_call/
)

# Execute
rendered_prompt, response = call.run({"concept": "quantum computing"})

print("Rendered:", rendered_prompt)
print("Response:", response)

Call records are saved to .llm_call/{name}/{timestamp}.json with full request/response details.

Integration with prompting-forge

genai-forge works seamlessly with prompting-forge for prompt versioning and synthesis:

from prompting_forge.prompting import PromptTemplate, FinalPromptTemplate
from genai_forge import get_llm
from genai_forge.llm import LLMCall

# Create versioned prompts
v1 = PromptTemplate(
    system="You are a helpful assistant.",
    template="Translate: {text}",
    instance_name="translator"
)

v2 = PromptTemplate(
    system="You are a professional translator.",
    template="Translate the following text to {language}:\n{text}",
    instance_name="translator"
)

# Synthesize final prompt from versions
llm = get_llm("openai:gpt-4o")
final = FinalPromptTemplate(
    instance_name="translator",
    variables=["text", "language"],
    llm_client=llm
)

# Use final prompt in production
call = LLMCall(
    query="Translate this text",
    prompt_template=final,
    client=llm,
    name="production_translator"
)

result = call.run({"text": "Hello, world!", "language": "Spanish"})

See the prompting-forge documentation for more details.

Provider Configuration

Default Provider

If you don't specify a provider, OpenAI is used by default:

llm = get_llm("gpt-4o-mini")  # Same as "openai:gpt-4o-mini"

Explicit Provider

Always recommended for clarity:

llm = get_llm("openai:gpt-4o-mini")
llm = get_llm("anthropic:claude-3-5-sonnet-20241022")

Override API Key

Pass the API key directly instead of using environment variables:

llm = get_llm(
    "anthropic:claude-3-5-haiku-20241022",
    api_key="sk-ant-your-key-here",
    temperature=0.2
)

Error Handling

from genai_forge import get_llm, OutputParserException

try:
    llm = get_llm("unknown:model")
except ValueError as e:
    print(f"Unknown provider: {e}")

try:
    result = parser.parse(invalid_json)
except OutputParserException as e:
    print(f"Parsing failed: {e}")

Running the Example

An example.py is included in the repository demonstrating:

Multiple provider usage
PromptTemplate with system prompts
PydanticOutputParser for structured outputs
Error handling

Ensure you have a .env with your API keys, then:

python example.py

Project Structure

genai_forge/
├── __init__.py              # Public API
├── llm/                     # LLM core
│   ├── base.py             # BaseLLM, LLM protocol
│   ├── registry.py         # Provider registry & factory
│   └── llm_call.py         # LLMCall with versioning
├── parsing/                # Output parsers
│   └── output_parser.py   # PydanticOutputParser
└── providers/              # LLM providers
    ├── openai.py          # OpenAI
    ├── anthropic.py       # Anthropic (Claude)
    ├── google.py          # Google (Gemini)
    ├── mistral.py         # Mistral AI
    └── cohere.py          # Cohere

Architecture

See ARCHITECTURE.md for detailed design documentation.

API Reference

Core Functions

get_llm(model: str, **kwargs) -> LLM

model: Provider and model name (e.g., "openai:gpt-4o-mini")
temperature: Sampling temperature (default: 0.3)
api_key: Optional API key override
provider: Optional explicit provider name
logger: Optional logger instance
Returns: LLM instance (callable)

get_embedding(model: str, **kwargs) -> Embedding

model: Provider and model name (e.g., "openai:text-embedding-3-small")
api_key: Optional API key override
provider: Optional explicit provider name
logger: Optional logger instance
Returns: Embedding instance (callable that takes text and returns vectors)

PydanticOutputParser(model: Type[T], strict: bool = True)

model: Pydantic model class
strict: Whether to enforce strict validation
Methods:
- get_format_instructions() -> str
- parse(text: str) -> T

LLMCall(query, prompt_template, client, **kwargs)

query: User query string
prompt_template: PromptTemplate instance
client: LLM instance
output_parser: Optional parser
name: Instance name for versioning
enable_versioning: Save call records
version_root: Root directory for versioning
Methods:
- run(context: dict) -> tuple[str, Any]

FAQ

Can I use multiple providers in the same application?

Yes! Each get_llm() call creates an independent LLM instance:

openai_llm = get_llm("openai:gpt-4o-mini")
claude_llm = get_llm("anthropic:claude-3-5-sonnet-20241022")
gemini_llm = get_llm("google:gemini-1.5-pro")

Do I need all provider packages installed?

No. Only install the providers you need:

pip install genai-forge[anthropic,google]  # Only Anthropic and Google

What if a provider API changes?

genai-forge abstracts provider differences. Update the library version, and your code should continue working.

How do I add a custom provider?

See ARCHITECTURE.md § Extensibility for a guide on implementing custom providers.

Can I use this with async code?

Not yet. Async support is planned for a future release.

Contributing

Contributions are welcome! Areas for improvement:

Additional providers (Hugging Face, AI21, etc.)
Async support
Streaming responses
Enhanced error handling
More examples

Changelog

0.2.0 (2025-11-12)

✨ Added multi-provider support: Anthropic, Google, Mistral, Cohere
📚 Comprehensive ARCHITECTURE.md documentation
🔧 Optional provider dependencies
📦 Improved package structure

0.1.17

🚀 Initial release with OpenAI support
✅ Pydantic output parsing
🔗 Chaining with pipe operator
📝 Prompt versioning with LLMCall

License

See LICENSE.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

daguilera

Release history Release notifications | RSS feed

0.2.5

Nov 13, 2025

0.2.4

Nov 12, 2025

This version

0.2.3

Nov 12, 2025

0.2.1

Nov 12, 2025

0.1.17

Nov 12, 2025

0.1.16

Nov 12, 2025

0.1.15

Nov 12, 2025

0.1.14

Nov 12, 2025

0.1.13

Nov 12, 2025

0.1.12

Nov 12, 2025

0.1.11

Nov 11, 2025

0.1.10

Nov 11, 2025

0.1.9

Nov 11, 2025

0.1.8

Nov 11, 2025

0.1.7

Nov 11, 2025

0.1.6

Nov 10, 2025

0.1.5

Nov 10, 2025

0.1.4

Nov 10, 2025

0.1.3

Nov 10, 2025

0.1.2

Nov 10, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

genai_forge-0.2.3.tar.gz (69.3 kB view details)

Uploaded Nov 12, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

genai_forge-0.2.3-py3-none-any.whl (30.6 kB view details)

Uploaded Nov 12, 2025 Python 3

File details

Details for the file genai_forge-0.2.3.tar.gz.

File metadata

Download URL: genai_forge-0.2.3.tar.gz
Upload date: Nov 12, 2025
Size: 69.3 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for genai_forge-0.2.3.tar.gz
Algorithm	Hash digest
SHA256	`b51b516d637ee4aedd3a81645fc0b26112acf14920ccfd19bf384b09efe4215a`
MD5	`711f160e3929f08dee3dce4e89780d6c`
BLAKE2b-256	`d215dc8b6a3ad707559f4cfc90ccaf63116b9e9ec4dd9142f0955cdceb038bcd`

See more details on using hashes here.

Provenance

The following attestation bundles were made for genai_forge-0.2.3.tar.gz:

Publisher: release.yml on ToolForge-AI/genai-forge

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: genai_forge-0.2.3.tar.gz
- Subject digest: b51b516d637ee4aedd3a81645fc0b26112acf14920ccfd19bf384b09efe4215a
- Sigstore transparency entry: 697507819
- Sigstore integration time: Nov 12, 2025
Source repository:
- Permalink: ToolForge-AI/genai-forge@979df64460e8acec302674d627ac2343b5829f5b
- Branch / Tag: refs/tags/0.2.3
- Owner: https://github.com/ToolForge-AI
- Access: private
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@979df64460e8acec302674d627ac2343b5829f5b
- Trigger Event: push

File details

Details for the file genai_forge-0.2.3-py3-none-any.whl.

File metadata

Download URL: genai_forge-0.2.3-py3-none-any.whl
Upload date: Nov 12, 2025
Size: 30.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for genai_forge-0.2.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3190dd178d1d04fbb88b7d393e6d8a6e7bcdd8ec72163bc6c3d734ce1c32bb26`
MD5	`94aee188c6f89320da3a3649c0434ae8`
BLAKE2b-256	`979c3fc54fc01c895f9dacdccd5f9e022fba0a49f54dcf9c3eb785153197eed7`

See more details on using hashes here.

Provenance

The following attestation bundles were made for genai_forge-0.2.3-py3-none-any.whl:

Publisher: release.yml on ToolForge-AI/genai-forge

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: genai_forge-0.2.3-py3-none-any.whl
- Subject digest: 3190dd178d1d04fbb88b7d393e6d8a6e7bcdd8ec72163bc6c3d734ce1c32bb26
- Sigstore transparency entry: 697507864
- Sigstore integration time: Nov 12, 2025
Source repository:
- Permalink: ToolForge-AI/genai-forge@979df64460e8acec302674d627ac2343b5829f5b
- Branch / Tag: refs/tags/0.2.3
- Owner: https://github.com/ToolForge-AI
- Access: private
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@979df64460e8acec302674d627ac2343b5829f5b
- Trigger Event: push

genai-forge 0.2.3

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Project links

Meta

Project description

genai-forge

Features

Installation

Basic Installation (OpenAI only)

With Specific Providers

Requirements

Quick Start

1. Set Up Environment Variables

2. Basic Usage

Supported Providers & Models

OpenAI

Anthropic (Claude)

Google (Gemini)

Mistral AI

Cohere

Parsing Structured Outputs with Pydantic

How Parsing Works

Chaining with the Pipe Operator

Embedding Models

Basic Embedding Usage

Supported Embedding Models

OpenAI

Google (Gemini)

Mistral AI

Cohere

Embedding Use Cases

Semantic Search

Document Clustering

Multi-Provider Comparison Example

Advanced Usage: LLMCall with Versioning

Integration with prompting-forge

Provider Configuration

Default Provider

Explicit Provider

Override API Key

Error Handling

Running the Example

Project Structure

Architecture

API Reference

Core Functions

FAQ

Can I use multiple providers in the same application?

Do I need all provider packages installed?

What if a provider API changes?

How do I add a custom provider?

Can I use this with async code?

Contributing

Changelog

0.2.0 (2025-11-12)

0.1.17

License

Links

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes