A Python library for typed prompts
Project description
schema-cat
A Python library for creating typed prompts for Large Language Models (LLMs). Schema-cat allows you to define the structure of LLM responses using Pydantic models, making it easy to get structured, typed data from LLM APIs.
Published by The Famous Cat.
Features
- Define response structures using Pydantic models
- Automatically convert Pydantic models to XML schemas
- Parse LLM responses back into Pydantic models
- Built-in retry mechanism with exponential backoff for handling rate limits and transient errors
- Support for multiple LLM providers:
- OpenAI
- Anthropic
- OpenRouter
Installation
pip install schema-cat
Usage
Basic Usage
from pydantic import BaseModel
from schema_cat import prompt_with_schema, Provider
import asyncio
# Define your response structure
class UserInfo(BaseModel):
name: str
age: int
is_student: bool
# Create a prompt
prompt = "Extract information about John Doe, who is 25 years old and not a student."
# Get a structured response
async def main():
result = await prompt_with_schema(
prompt=prompt,
schema=UserInfo,
model="gpt-4-turbo", # Use an appropriate model
provider=Provider.OPENAI
)
print(f"Name: {result.name}")
print(f"Age: {result.age}")
print(f"Is student: {result.is_student}")
asyncio.run(main())
Using Different Providers
# OpenAI
result = await prompt_with_schema(prompt, UserInfo, "gpt-4-turbo", Provider.OPENAI)
# Anthropic
result = await prompt_with_schema(prompt, UserInfo, "claude-3-haiku-20240307", Provider.ANTHROPIC)
# OpenRouter
result = await prompt_with_schema(prompt, UserInfo, "anthropic/claude-3-opus-20240229", Provider.OPENROUTER)
Working with Complex Schemas
from pydantic import BaseModel
from typing import List
from schema_cat import prompt_with_schema, Provider
class Address(BaseModel):
street: str
city: str
zip_code: str
class Person(BaseModel):
name: str
age: int
addresses: List[Address]
prompt = """
Extract information about Jane Smith, who is 30 years old.
She has two addresses:
1. 123 Main St, New York, 10001
2. 456 Park Ave, Boston, 02108
"""
async def main():
result = await prompt_with_schema(prompt, Person, "gpt-4o")
print(f"Name: {result.name}")
print(f"Age: {result.age}")
print(f"Addresses:")
for addr in result.addresses:
print(f" - {addr.street}, {addr.city}, {addr.zip_code}")
asyncio.run(main())
API Reference
prompt_with_schema(prompt: str, schema: Type[T], model: str, max_tokens: int = 8192, temperature: float = 0.0, sys_prompt: str = "", max_retries: int = 5, initial_delay: float = 1.0, max_delay: float = 60.0) -> T
Makes a request to an LLM provider with a prompt and schema, returning a structured response.
prompt: The prompt to send to the LLMschema: A Pydantic model class defining the expected response structuremodel: The LLM model to use (e.g., "gpt-4-turbo", "claude-3-haiku")max_tokens: Maximum number of tokens to generate (default: 8192)temperature: Sampling temperature (0.0 to 1.0, default: 0.0)sys_prompt: Optional system prompt to prepend (default: "")max_retries: Maximum number of retries for API calls (default: 5)initial_delay: Initial delay between retries in seconds (default: 1.0)max_delay: Maximum delay between retries in seconds (default: 60.0)
schema_to_xml(schema: Type[BaseModel]) -> ElementTree.XML
Converts a Pydantic model class to an XML representation.
xml_to_base_model(xml_tree: ElementTree.XML, schema: Type[T]) -> T
Converts an XML element to a Pydantic model instance.
xml_to_string(xml_tree: ElementTree.XML) -> str
Converts an XML element to a pretty-printed string.
Environment Variables
The library uses the following environment variables:
OPENAI_API_KEY: Required for OpenAI providerOPENAI_BASE_URL: Optional, defaults to "https://api.openai.com/v1"ANTHROPIC_API_KEY: Required for Anthropic providerOPENROUTER_API_KEY: Required for OpenRouter providerOPENROUTER_BASE_URL: Optional, defaults to "https://openrouter.ai/api/v1"OPENROUTER_HTTP_REFERER: Optional, defaults to "https://www.thefamouscat.com"OPENROUTER_X_TITLE: Optional, defaults to "SchemaCat"
Development
Install dependencies with Poetry:
poetry install
Retry Mechanism
Schema-cat includes a built-in retry mechanism with exponential backoff to handle rate limits and transient errors when making API calls to LLM providers. This helps improve the reliability of your application by automatically retrying failed requests.
How It Works
When an API call fails with a retryable error (such as a network error or rate limit), schema-cat will:
- Wait for a short delay
- Retry the API call
- If it fails again, wait for a longer delay (exponential backoff)
- Continue retrying until the call succeeds or the maximum number of retries is reached
Configuration
You can configure the retry behavior using the following parameters in prompt_with_schema:
result = await prompt_with_schema(
prompt="Your prompt",
schema=YourSchema,
model="gpt-4-turbo",
max_retries=5, # Maximum number of retry attempts
initial_delay=1.0, # Initial delay between retries in seconds
max_delay=60.0 # Maximum delay between retries in seconds
)
Retryable Errors
By default, the retry mechanism handles common errors such as:
- Network errors (ConnectionError, TimeoutError)
- HTTP client errors (httpx.ConnectError, httpx.ReadTimeout, etc.)
- Provider-specific errors (openai.RateLimitError, anthropic.RateLimitError, etc.)
Running Tests
pytest
For end-to-end tests that make actual API calls:
pytest -m slow
Publishing
To publish to PyPI:
poetry build
poetry publish
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file schema_cat-0.1.19.tar.gz.
File metadata
- Download URL: schema_cat-0.1.19.tar.gz
- Upload date:
- Size: 16.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.1.1 CPython/3.12.11 Darwin/24.5.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5be7e2c2eaa347732c14eb3a49a8e3fbaf04abbd7fe9487984cdb02bc2ca68de
|
|
| MD5 |
2aced4f662e12529ce42210da5748987
|
|
| BLAKE2b-256 |
5d3b1b041a67ddf3d61a1f7a891d2574122836108c9a8914109c4c928e4ae6ea
|
File details
Details for the file schema_cat-0.1.19-py3-none-any.whl.
File metadata
- Download URL: schema_cat-0.1.19-py3-none-any.whl
- Upload date:
- Size: 20.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.1.1 CPython/3.12.11 Darwin/24.5.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6b73e23c852589160c23d47afc68348839e1adf5986a3780f1ffa194ecf9d2e6
|
|
| MD5 |
dba066fe0c786a963d16647c71be639c
|
|
| BLAKE2b-256 |
3d0b977bcc182527655bc34b7a4dac78b2236ef9baf8e4aa857770c793980249
|