A toolkit library for developers building LLM applications

These details have not been verified by PyPI

Project links

Project description

Kerb

The complete toolkit for developers building LLM applications.

Built to drive production ML systems at ApX Machine Learning (apxml.com), available open source.

Getting Started

Overview

Simple

Advanced LLM techniques made simple. Clean, easy-to-use interfaces for complex operations.

Lightweight

Only install what you need. Kerb is modular, no unnecessary dependencies.

Compatible

Works with any LLM project. Kerb is a toolkit, not a framework. Use it alongside your existing stack.

Installation

# Install everything
pip install kerb[all]

# Or install specific modules
pip install kerb[generation] kerb[embeddings] kerb[evaluation]

Quick Start

from kerb.generation import generate, ModelName, LLMProvider
from kerb.prompt import render_template

# Generate with any provider, easy config change.
response = generate(
    "Explain quantum computing",
    model=ModelName.GPT_4O_MINI,
    provider=LLMProvider.OPENAI
)

print(f"Response: {response.content}")
print(f"Tokens: {response.usage.total_tokens}")
print(f"Cost: ${response.cost:.6f}")

Modules

Everything you need to build LLM applications.

Module	Description
Agent	Agent orchestration and execution patterns for multi-step reasoning.
Cache	Response and embedding caching to reduce costs and latency.
Chunk	Text chunking utilities for optimal context windows and retrieval.
Config	Configuration management for models, providers, and application settings.
Context	Context window management and token budget tracking.
Document	Document loading and processing for PDFs, web pages, and more.
Embedding	Embedding generation and similarity search helpers.
Evaluation	Metrics and benchmarking tools for LLM outputs.
Fine-Tuning	Model fine-tuning utilities and large dataset preparation.
Generation	Unified LLM generation with multi-provider support (OpenAI, Anthropic, Gemini).
Memory	Conversation memory and entity tracking for stateful applications.
Multimodal	Image, audio, and video processing for multimodal models.
Parsing	Output parsing and validation (JSON, structured data, function calls).
Preprocessing	Text cleaning and preprocessing for LLM inputs.
Prompt	Prompt engineering utilities, templates, and chain-of-thought patterns.
Retrieval	RAG and vector search utilities for semantic retrieval.
Safety	Content moderation and safety filters.
Testing	Testing utilities for LLM outputs and evaluation.
Tokenizer	Token counting and text splitting for any model.

Project Structure

kerb/
├── core/           # Shared types and interfaces
├── agent/          # Agent systems and reasoning
├── cache/          # Caching mechanisms
├── chunk/          # Text chunking utilities
├── config/         # Configuration management
├── context/        # Context window management
├── document/       # Document loading
├── embedding/      # Embedding generation
├── evaluation/     # Evaluation metrics
├── fine_tuning/    # Model fine-tuning
├── generation/     # LLM text generation
├── memory/         # Memory systems
├── multimodal/     # Multimodal processing
├── parsing/        # Output parsing
├── preprocessing/  # Text preprocessing
├── prompt/         # Prompt management
├── retrieval/      # RAG and retrieval
├── safety/         # Content safety
├── testing/        # Testing utilities
└── tokenizer/      # Token counting

Examples

RAG Pipeline

from kerb.document import load_document
from kerb.chunk import chunk_text
from kerb.embedding import embed, embed_batch
from kerb.retrieval import semantic_search, Document
from kerb.generation import generate, ModelName, LLMProvider

# Load and process document
doc = load_document("paper.pdf")
chunks = chunk_text(doc.content, chunk_size=512, overlap=50)

# Create embeddings
chunk_embeddings = embed_batch(chunks)

# Search for relevant chunks
query = "main findings"
query_embedding = embed(query)
documents = [Document(content=c) for c in chunks]
results = semantic_search(
    query_embedding=query_embedding,
    documents=documents,
    document_embeddings=chunk_embeddings,
    top_k=5
)

# Generate answer with context
context = "\n".join([r.document.content for r in results])
answer = generate(
    f"Based on: {context}\n\nQuestion: What are the main findings?",
    model=ModelName.GPT_4O_MINI,
    provider=LLMProvider.OPENAI
)

LLM Caching

from kerb.cache import create_memory_cache, generate_prompt_key
from kerb.generation import generate, ModelName

cache = create_memory_cache(max_size=1000, default_ttl=3600)

def cached_generate(prompt, model=ModelName.GPT_4O_MINI, temperature=0.7):
    cache_key = generate_prompt_key(
        prompt, 
        model=model.value, 
        temperature=temperature
    )
    
    if cached := cache.get(cache_key):
        return cached['response']
    
    response = generate(prompt, model=model, temperature=temperature)
    cache.set(cache_key, {'response': response, 'cost': response.cost})
    return response

# First call
response1 = cached_generate("Explain Python decorators briefly")

# Hit Cache
response2 = cached_generate("Explain Python decorators briefly")

Agent Workflow

from kerb.agent.patterns import ReActAgent

def llm_function(prompt: str) -> str:
    """Your LLM function (OpenAI, Anthropic, etc.)"""
    # Implementation here
    return "agent response"

# Create a ReAct agent
agent = ReActAgent(
    name="ResearchAgent",
    llm_func=llm_function,
    max_iterations=5
)

# Execute multi-step task
result = agent.run("Research the latest AI papers and summarize key trends")

print(f"Status: {result.status.value}")
print(f"Output: {result.output}")
print(f"Steps taken: {len(result.steps)}")

Custom Evaluation

from kerb.evaluation import (
    calculate_bleu,
    calculate_rouge,
    calculate_f1_score,
    calculate_semantic_similarity
)

# Evaluate translation quality
reference = "Hello, how are you?"
candidate = "Hi, how are you?"

# Calculate metrics
bleu_score = calculate_bleu(candidate, reference)
rouge_scores = calculate_rouge(candidate, reference, rouge_type="rouge-l")
f1 = calculate_f1_score(candidate, reference)

print(f"BLEU: {bleu_score:.3f}")
print(f"ROUGE-L F1: {rouge_scores['fmeasure']:.3f}")
print(f"F1 Score: {f1:.3f}")

Fine-Tuning Dataset Preparation

from kerb.fine_tuning import (
    write_jsonl,
    read_jsonl,
    TrainingExample,
    TrainingDataset,
    DatasetFormat,
    to_openai_format,
)
from kerb.fine_tuning.jsonl import (
    append_jsonl,
    merge_jsonl,
    validate_jsonl,
    count_jsonl_lines,
)

# Create training examples
examples = []
for i in range(10):
    examples.append(TrainingExample(
        messages=[
            {"role": "user", "content": f"How do I use Python feature {i}?"},
            {"role": "assistant", "content": f"Here's how to use feature {i}: example_code()"}
        ],
        metadata={"category": "coding", "index": i}
    ))

dataset = TrainingDataset(
    examples=examples,
    format=DatasetFormat.CHAT,
    metadata={"source": "coding_qa"}
)

# Convert to OpenAI format and write to JSONL
data = to_openai_format(dataset)
write_jsonl(data, "training_data.jsonl")

# Validate the JSONL file
result = validate_jsonl("training_data.jsonl")
print(f"Valid: {result.is_valid}, Examples: {result.total_examples}")

# Count lines efficiently
count = count_jsonl_lines("training_data.jsonl")
print(f"Total examples: {count}")

License

Apache 2.0 License - see LICENSE for details.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.2.1

Mar 10, 2026

0.2.0

Mar 10, 2026

0.1.4

Dec 1, 2025

0.1.3

Dec 1, 2025

0.1.1

Oct 18, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kerb-0.2.1.tar.gz (294.6 kB view details)

Uploaded Mar 10, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

kerb-0.2.1-py3-none-any.whl (322.1 kB view details)

Uploaded Mar 10, 2026 Python 3

File details

Details for the file kerb-0.2.1.tar.gz.

File metadata

Download URL: kerb-0.2.1.tar.gz
Upload date: Mar 10, 2026
Size: 294.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.2

File hashes

Hashes for kerb-0.2.1.tar.gz
Algorithm	Hash digest
SHA256	`824d1ee6582f9f4c0fc0849a3e5d0b105f992fdf981f8f718a08f7e4430c8720`
MD5	`64214070d3b433871f6b9c0b2d10f509`
BLAKE2b-256	`0e57b46819b216994cb6a90c77984fb30edc88a4a31872609fae58445b511b50`

See more details on using hashes here.

File details

Details for the file kerb-0.2.1-py3-none-any.whl.

File metadata

Download URL: kerb-0.2.1-py3-none-any.whl
Upload date: Mar 10, 2026
Size: 322.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.2

File hashes

Hashes for kerb-0.2.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b498d100913c6d4031a404bc9bdf690456c9c78700f12ef16b1b3913ce2185de`
MD5	`03f602e3023336e018a1188d2144ea05`
BLAKE2b-256	`6089c5bf134a457ef17d680c205aa589f3d3910bee219a597a09d4f325614b25`

See more details on using hashes here.

kerb 0.2.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Kerb

Overview

Simple

Lightweight

Compatible

Installation

Quick Start

Modules

Project Structure

Examples

RAG Pipeline

LLM Caching

Agent Workflow

Custom Evaluation

Fine-Tuning Dataset Preparation

License

Links

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes