structured outputs for llm

These details have not been verified by PyPI

Project links

repository

Project description

Instructor: Structured Outputs for LLMs

Get reliable JSON from any LLM. Built on Pydantic for validation, type safety, and IDE support.

import instructor
from pydantic import BaseModel


# Define what you want
class User(BaseModel):
    name: str
    age: int


# Extract it from natural language
client = instructor.from_provider("openai/gpt-4o-mini")
user = client.chat.completions.create(
    response_model=User,
    messages=[{"role": "user", "content": "John is 25 years old"}],
)

print(user)  # User(name='John', age=25)

That's it. No JSON parsing, no error handling, no retries. Just define a model and get structured data.

Why Instructor?

Getting structured data from LLMs is hard. You need to:

Write complex JSON schemas
Handle validation errors
Retry failed extractions
Parse unstructured responses
Deal with different provider APIs

Instructor handles all of this with one simple interface:

Without Instructor

With Instructor

response = openai.chat.completions.create(
    model="gpt-4",
    messages=[{"role": "user", "content": "..."}],
    tools=[
        {
            "type": "function",
            "function": {
                "name": "extract_user",
                "parameters": {
                    "type": "object",
                    "properties": {
                        "name": {"type": "string"},
                        "age": {"type": "integer"},
                    },
                },
            },
        }
    ],
)

# Parse response
tool_call = response.choices[0].message.tool_calls[0]
user_data = json.loads(tool_call.function.arguments)

# Validate manually
if "name" not in user_data:
    # Handle error...
    pass

client = instructor.from_provider("openai/gpt-4")

user = client.chat.completions.create(
    response_model=User,
    messages=[{"role": "user", "content": "..."}],
)

# That's it! user is validated and typed

Install in seconds

pip install instructor

Or with your package manager:

uv add instructor
poetry add instructor

Works with every major provider

Use the same code with any LLM provider:

# OpenAI
client = instructor.from_provider("openai/gpt-4o")

# Anthropic
client = instructor.from_provider("anthropic/claude-3-5-sonnet")

# Google
client = instructor.from_provider("google/gemini-pro")

# Ollama (local)
client = instructor.from_provider("ollama/llama3.2")

# With API keys directly (no environment variables needed)
client = instructor.from_provider("openai/gpt-4o", api_key="sk-...")
client = instructor.from_provider("anthropic/claude-3-5-sonnet", api_key="sk-ant-...")
client = instructor.from_provider("groq/llama-3.1-8b-instant", api_key="gsk_...")

# All use the same API!
user = client.chat.completions.create(
    response_model=User,
    messages=[{"role": "user", "content": "..."}],
)

Production-ready features

Automatic retries

Failed validations are automatically retried with the error message:

from pydantic import BaseModel, field_validator


class User(BaseModel):
    name: str
    age: int

    @field_validator('age')
    def validate_age(cls, v):
        if v < 0:
            raise ValueError('Age must be positive')
        return v


# Instructor automatically retries when validation fails
user = client.chat.completions.create(
    response_model=User,
    messages=[{"role": "user", "content": "..."}],
    max_retries=3,
)

Streaming support

Stream partial objects as they're generated:

from instructor import Partial

for partial_user in client.chat.completions.create(
    response_model=Partial[User],
    messages=[{"role": "user", "content": "..."}],
    stream=True,
):
    print(partial_user)
    # User(name=None, age=None)
    # User(name="John", age=None)
    # User(name="John", age=25)

Nested objects

Extract complex, nested data structures:

from typing import List


class Address(BaseModel):
    street: str
    city: str
    country: str


class User(BaseModel):
    name: str
    age: int
    addresses: List[Address]


# Instructor handles nested objects automatically
user = client.chat.completions.create(
    response_model=User,
    messages=[{"role": "user", "content": "..."}],
)

Used in production by

Trusted by over 100,000 developers and companies building AI applications:

3M+ monthly downloads
10K+ GitHub stars
1000+ community contributors

Companies using Instructor include teams at OpenAI, Google, Microsoft, AWS, and many YC startups.

Get started

Basic extraction

Extract structured data from any text:

from pydantic import BaseModel
import instructor

client = instructor.from_provider("openai/gpt-4o-mini")


class Product(BaseModel):
    name: str
    price: float
    in_stock: bool


product = client.chat.completions.create(
    response_model=Product,
    messages=[{"role": "user", "content": "iPhone 15 Pro, $999, available now"}],
)

print(product)
# Product(name='iPhone 15 Pro', price=999.0, in_stock=True)

Multiple languages

Instructor's simple API is available in many languages:

Python - The original
TypeScript - Full TypeScript support
Ruby - Ruby implementation
Go - Go implementation
Elixir - Elixir implementation
Rust - Rust implementation

Learn more

Documentation - Comprehensive guides
Examples - Copy-paste recipes
Blog - Tutorials and best practices
Discord - Get help from the community

Why use Instructor over alternatives?

vs Raw JSON mode: Instructor provides automatic validation, retries, streaming, and nested object support. No manual schema writing.

vs LangChain/LlamaIndex: Instructor is focused on one thing - structured extraction. It's lighter, faster, and easier to debug.

vs Custom solutions: Battle-tested by thousands of developers. Handles edge cases you haven't thought of yet.

Contributing

We welcome contributions! Check out our good first issues to get started.

License

MIT License - see LICENSE for details.

Built by the Instructor community. Special thanks to Jason Liu and all contributors.

Project details

These details have not been verified by PyPI

Project links

repository

Release history Release notifications | RSS feed

This version

1.10.0

Jul 18, 2025

1.9.2

Jul 7, 2025

1.9.1

Jul 7, 2025

1.9.0

Jun 21, 2025

1.8.3

May 22, 2025

1.8.2

May 15, 2025

1.8.1

May 9, 2025

1.8.0

May 7, 2025

1.7.9

Apr 3, 2025

1.7.8

Mar 29, 2025

1.7.7

Mar 17, 2025

1.7.6

Mar 17, 2025

1.7.5

Mar 16, 2025

1.7.4

Mar 12, 2025

1.7.3

Mar 6, 2025

1.7.2

Dec 26, 2024

1.7.1

Dec 25, 2024

1.7.0

Nov 27, 2024

1.6.4

Nov 14, 2024

1.6.3

Oct 21, 2024

1.6.2

Oct 17, 2024

1.6.1

Oct 17, 2024

1.6.0

Oct 17, 2024

1.5.2

Oct 8, 2024

1.5.1

Oct 4, 2024

1.5.0

Sep 30, 2024

1.4.3

Sep 19, 2024

1.4.2

Sep 14, 2024

1.4.1

Sep 6, 2024

1.4.0

Aug 22, 2024

1.3.7

Jul 24, 2024

1.3.6

Jul 23, 2024

1.3.5

Jul 17, 2024

1.3.4

Jun 25, 2024

1.3.3

Jun 11, 2024

1.3.2

May 27, 2024

1.3.1

May 23, 2024

1.3.0

May 23, 2024

1.2.6

May 9, 2024

1.2.5

May 1, 2024

1.2.4

Apr 29, 2024

1.2.3

Apr 27, 2024

1.2.2

Apr 20, 2024

1.2.1

Apr 18, 2024

1.2.0

Apr 14, 2024

1.1.0

Apr 11, 2024

1.0.3

Apr 5, 2024

1.0.2

Apr 5, 2024

1.0.0

Apr 1, 2024

0.6.8

Mar 29, 2024

0.6.7

Mar 21, 2024

0.6.6

Mar 21, 2024

0.6.5

Mar 20, 2024

0.6.4

Mar 8, 2024

0.6.3

Mar 6, 2024

0.6.2

Mar 1, 2024

0.6.1

Feb 20, 2024

0.6.0

Feb 18, 2024

0.5.2

Feb 7, 2024

0.5.0

Feb 4, 2024

0.4.8

Jan 23, 2024

0.4.7

Jan 14, 2024

0.4.6

Jan 5, 2024

0.4.5

Dec 19, 2023

0.4.4

Dec 17, 2023

0.4.3

Dec 17, 2023

0.4.2

Dec 6, 2023

0.4.0

Nov 27, 2023

0.3.5

Nov 19, 2023

0.3.4

Nov 13, 2023

0.3.3

Nov 13, 2023

0.3.2

Nov 11, 2023

0.3.1

Nov 9, 2023

0.3.0

Nov 8, 2023

0.2.11

Nov 6, 2023

0.2.9

Oct 22, 2023

0.2.8

Sep 19, 2023

0.2.7

Sep 8, 2023

0.2.6

Sep 6, 2023

0.2.5

Aug 24, 2023

0.2.4

Aug 17, 2023

0.2.1

Jul 28, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

instructor-1.10.0.tar.gz (69.4 MB view details)

Uploaded Jul 18, 2025 Source

Built Distribution

instructor-1.10.0-py3-none-any.whl (119.5 kB view details)

Uploaded Jul 18, 2025 Python 3

File details

Details for the file instructor-1.10.0.tar.gz.

File metadata

Download URL: instructor-1.10.0.tar.gz
Upload date: Jul 18, 2025
Size: 69.4 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.8.0

File hashes

Hashes for instructor-1.10.0.tar.gz
Algorithm	Hash digest
SHA256	`887d33e058b913290dbf526b0096b1bb8d7ea1a07d75afecbf716161f959697b`
MD5	`367aa6ca185793ebdbdb8f4a50d9d949`
BLAKE2b-256	`a56763c4b4d2cc3c7b4238920ad3388a6f5d67265ab7c09ee34012d6b591130e`

See more details on using hashes here.

File details

Details for the file instructor-1.10.0-py3-none-any.whl.

File metadata

Download URL: instructor-1.10.0-py3-none-any.whl
Upload date: Jul 18, 2025
Size: 119.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.8.0

File hashes

Hashes for instructor-1.10.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9c789f0fce915d5498059afb5314530c8a5b22b0283302679148ddae98f732b0`
MD5	`89f20daec5a8c591a61c8a985e91e63a`
BLAKE2b-256	`2cfbffc1ade9779795a8dc8e2379b1bfb522161ee7df8df12722f50d348fb4ea`

See more details on using hashes here.

instructor 1.10.0

Navigation

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Project description

Instructor: Structured Outputs for LLMs

Why Instructor?

Install in seconds

Works with every major provider

Production-ready features

Automatic retries

Streaming support

Nested objects

Used in production by

Get started

Basic extraction

Multiple languages

Learn more

Why use Instructor over alternatives?

Contributing

License

Project details

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes