structured outputs for llm

These details have not been verified by PyPI

Project links

Project description

Welcome to Instructor - Your Gateway to Structured Outputs with OpenAI

Pythonic Structured Outputs powered by LLM function calling and tool calling APIs. Designed for simplicity, transparency, and control.

Star us on Github!

Dive into the world of Python-based structured extraction, empowered by OpenAI's cutting-edge function calling API. Instructor stands out for its simplicity, transparency, and user-centric design. Whether you're a seasoned developer or just starting out, you'll find Instructor's approach intuitive and its results insightful.

Ports to other languages

Check out ports to other languages below:

If you want to port Instructor to another language, please reach out to us on Twitter we'd love to help you get started!

Get Started in Moments

Installing Instructor is a breeze. Simply run pip install instructor in your terminal and you're on your way to a smoother data handling experience!

How Instructor Enhances Your Workflow

Our instructor.patch for the OpenAI class introduces three key enhancements:

Response Mode: Specify a Pydantic model to streamline data extraction.
Max Retries: Set your desired number of retry attempts for requests.
Validation Context: Provide a context object for enhanced validator access. A Glimpse into Instructor's Capabilities.

Using Validators

To learn more about validators, checkout our blog post Good LLM validation is just good validation

Usage

With Instructor, your code becomes more efficient and readable. Here’s a quick peek:

import instructor
from openai import OpenAI
from pydantic import BaseModel

# Enables `response_model`
client = instructor.patch(OpenAI())


class UserDetail(BaseModel):
    name: str
    age: int


user = client.chat.completions.create(
    model="gpt-3.5-turbo",
    response_model=UserDetail,
    messages=[
        {"role": "user", "content": "Extract Jason is 25 years old"},
    ],
)

assert isinstance(user, UserDetail)
assert user.name == "Jason"
assert user.age == 25

Using async clients

For async clients you must use apatch vs. patch, as shown:

import instructor
import asyncio
import openai
from pydantic import BaseModel

aclient = instructor.apatch(openai.AsyncOpenAI())


class UserExtract(BaseModel):
    name: str
    age: int


task = aclient.chat.completions.create(
    model="gpt-3.5-turbo",
    response_model=UserExtract,
    messages=[
        {"role": "user", "content": "Extract jason is 25 years old"},
    ],
)


response = asyncio.run(task)
print(response.model_dump_json(indent=2))
"""
{
  "name": "Jason",
  "age": 25
}
"""

Step 1: Patch the client

First, import the required libraries and apply the patch function to the OpenAI module. This exposes new functionality with the response_model parameter.

import instructor
from openai import OpenAI

# This enables response_model keyword
# from client.chat.completions.create
client = instructor.patch(OpenAI())

Step 2: Define the Pydantic Model

Create a Pydantic model to define the structure of the data you want to extract. This model will map directly to the information in the prompt.

from pydantic import BaseModel


class UserDetail(BaseModel):
    name: str
    age: int

Step 3: Extract

Use the client.chat.completions.create method to send a prompt and extract the data into the Pydantic object. The response_model parameter specifies the Pydantic model to use for extraction. It is helpful to annotate the variable with the type of the response model which will help your IDE provide autocomplete and spell check.

import instructor
import openai
from pydantic import BaseModel

client = instructor.patch(openai.OpenAI())


class UserDetail(BaseModel):
    name: str
    age: int


user = client.chat.completions.create(
    model="gpt-3.5-turbo",
    response_model=UserDetail,
    messages=[
        {"role": "user", "content": "Extract Jason is 25 years old"},
    ],
)

assert isinstance(user, UserDetail)
assert user.name == "Jason"
assert user.age == 25
print(user.model_dump_json(indent=2))
"""
{
  "name": "Jason",
  "age": 25
}
"""

Pydantic Validation

Validation can also be plugged into the same Pydantic model.

In this example, if the answer attribute contains content that violates the rule "Do not say objectionable things", Pydantic will raise a validation error.

from pydantic import BaseModel, ValidationError, BeforeValidator
from typing_extensions import Annotated
from instructor import llm_validator


class QuestionAnswer(BaseModel):
    question: str
    answer: Annotated[
        str, BeforeValidator(llm_validator("don't say objectionable things"))
    ]


try:
    qa = QuestionAnswer(
        question="What is the meaning of life?",
        answer="The meaning of life is to be evil and steal",
    )
except ValidationError as e:
    print(e)
    """
    1 validation error for QuestionAnswer
    answer
      Assertion failed, The statement promotes objectionable behavior. [type=assertion_error, input_value='The meaning of life is to be evil and steal', input_type=str]
        For further information visit https://errors.pydantic.dev/2.6/v/assertion_error
    """

It is important to note here that the error message is generated by the LLM, not the code. Thus, it is helpful for re-asking the model.

1 validation error for QuestionAnswer
answer
   Assertion failed, The statement is objectionable. (type=assertion_error)

Re-ask on validation error

Here, the UserDetails model is passed as the response_model, and max_retries is set to 2.

import instructor

from openai import OpenAI
from pydantic import BaseModel, field_validator

# Apply the patch to the OpenAI client
client = instructor.patch(OpenAI())


class UserDetails(BaseModel):
    name: str
    age: int

    @field_validator("name")
    @classmethod
    def validate_name(cls, v):
        if v.upper() != v:
            raise ValueError("Name must be in uppercase.")
        return v


model = client.chat.completions.create(
    model="gpt-3.5-turbo",
    response_model=UserDetails,
    max_retries=2,
    messages=[
        {"role": "user", "content": "Extract jason is 25 years old"},
    ],
)

print(model.model_dump_json(indent=2))
"""
{
  "name": "JASON",
  "age": 25
}
"""

Evals

We invite you to contribute to evals in pytest as a way to monitor the quality of the OpenAI models and the instructor library. To get started check out the jxnl/instructor/tests/evals and contribute your own evals in the form of pytest tests. These evals will be run once a week and the results will be posted.

Contributing

If you want to help, checkout some of the issues marked as good-first-issue or help-wanted found here. They could be anything from code improvements, a guest blog post, or a new cookbook.

CLI

We also provide some added CLI functionality for easy convinience:

instructor jobs : This helps with the creation of fine-tuning jobs with OpenAI. Simple use instructor jobs create-from-file --help to get started creating your first fine-tuned GPT3.5 model
instructor files : Manage your uploaded files with ease. You'll be able to create, delete and upload files all from the command line
instructor usage : Instead of heading to the OpenAI site each time, you can monitor your usage from the cli and filter by date and time period. Note that usage often takes ~5-10 minutes to update from OpenAI's side

License

This project is licensed under the terms of the MIT License.

Contributors

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.14.5

Jan 29, 2026

1.14.4

Jan 16, 2026

1.14.3

Jan 13, 2026

1.14.2

Jan 13, 2026

1.14.1

Jan 8, 2026

1.13.0

Nov 6, 2025

1.12.0

Oct 27, 2025

1.11.3

Sep 9, 2025

1.11.2

Aug 27, 2025

1.11.0

Aug 27, 2025

1.10.0

Jul 18, 2025

1.9.2

Jul 7, 2025

1.9.1

Jul 7, 2025

1.9.0

Jun 21, 2025

1.8.3

May 22, 2025

1.8.2

May 15, 2025

1.8.1

May 9, 2025

1.8.0

May 7, 2025

1.7.9

Apr 3, 2025

1.7.8

Mar 29, 2025

1.7.7

Mar 17, 2025

1.7.6

Mar 17, 2025

1.7.5

Mar 16, 2025

1.7.4

Mar 12, 2025

1.7.3

Mar 6, 2025

1.7.2

Dec 26, 2024

1.7.1

Dec 25, 2024

1.7.0

Nov 27, 2024

1.6.4

Nov 14, 2024

1.6.3

Oct 21, 2024

1.6.2

Oct 17, 2024

1.6.1

Oct 17, 2024

1.6.0

Oct 17, 2024

1.5.2

Oct 8, 2024

1.5.1

Oct 4, 2024

1.5.0

Sep 30, 2024

1.4.3

Sep 19, 2024

1.4.2

Sep 14, 2024

1.4.1

Sep 6, 2024

1.4.0

Aug 22, 2024

1.3.7

Jul 24, 2024

1.3.6

Jul 23, 2024

1.3.5

Jul 17, 2024

1.3.4

Jun 25, 2024

1.3.3

Jun 11, 2024

1.3.2

May 27, 2024

1.3.1

May 23, 2024

1.3.0

May 23, 2024

1.2.6

May 9, 2024

1.2.5

May 1, 2024

1.2.4

Apr 29, 2024

1.2.3

Apr 27, 2024

1.2.2

Apr 20, 2024

1.2.1

Apr 18, 2024

1.2.0

Apr 14, 2024

1.1.0

Apr 11, 2024

1.0.3

Apr 5, 2024

1.0.2

Apr 5, 2024

1.0.0

Apr 1, 2024

0.6.8

Mar 29, 2024

0.6.7

Mar 21, 2024

0.6.6

Mar 21, 2024

0.6.5

Mar 20, 2024

0.6.4

Mar 8, 2024

0.6.3

Mar 6, 2024

0.6.2

Mar 1, 2024

0.6.1

Feb 20, 2024

0.6.0

Feb 18, 2024

This version

0.5.2

Feb 7, 2024

0.5.0

Feb 4, 2024

0.4.8

Jan 23, 2024

0.4.7

Jan 14, 2024

0.4.6

Jan 5, 2024

0.4.5

Dec 19, 2023

0.4.4

Dec 17, 2023

0.4.3

Dec 17, 2023

0.4.2

Dec 6, 2023

0.4.0

Nov 27, 2023

0.3.5

Nov 19, 2023

0.3.4

Nov 13, 2023

0.3.3

Nov 13, 2023

0.3.2

Nov 11, 2023

0.3.1

Nov 9, 2023

0.3.0

Nov 8, 2023

0.2.11

Nov 6, 2023

0.2.9

Oct 22, 2023

0.2.8

Sep 19, 2023

0.2.7

Sep 8, 2023

0.2.6

Sep 6, 2023

0.2.5

Aug 24, 2023

0.2.4

Aug 17, 2023

0.2.1

Jul 28, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

instructor-0.5.2.tar.gz (28.9 kB view details)

Uploaded Feb 7, 2024 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

instructor-0.5.2-py3-none-any.whl (33.3 kB view details)

Uploaded Feb 7, 2024 Python 3

File details

Details for the file instructor-0.5.2.tar.gz.

File metadata

Download URL: instructor-0.5.2.tar.gz
Upload date: Feb 7, 2024
Size: 28.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.7.1 CPython/3.10.13 Linux/6.2.0-1019-azure

File hashes

Hashes for instructor-0.5.2.tar.gz
Algorithm	Hash digest
SHA256	`d8d679eb4624254db615794aaab59840e506fa696bc0181d998ae4f9ded2706d`
MD5	`fbf2b6efb138f3a3c527ae0012d0dca0`
BLAKE2b-256	`a5f7195cc1890275876e954d2cb81f1ee1839fee3db2bdf3f44fa40cb3097d8a`

See more details on using hashes here.

File details

Details for the file instructor-0.5.2-py3-none-any.whl.

File metadata

Download URL: instructor-0.5.2-py3-none-any.whl
Upload date: Feb 7, 2024
Size: 33.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.7.1 CPython/3.10.13 Linux/6.2.0-1019-azure

File hashes

Hashes for instructor-0.5.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`8c7c927f3cbf6cd863eeebceae3f021e27eaca2ceaf9e9f3c8204540a1126160`
MD5	`3a7aaed431ee8c295c36c2bde73ad4cb`
BLAKE2b-256	`3d5735e22db00463ac2a2f57e7b8823d0084c5e83f69d518313bdfbf61764463`

See more details on using hashes here.

instructor 0.5.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Welcome to Instructor - Your Gateway to Structured Outputs with OpenAI

Ports to other languages

Get Started in Moments

How Instructor Enhances Your Workflow

Using Validators

Usage

Using async clients

Step 1: Patch the client

Step 2: Define the Pydantic Model

Step 3: Extract

Pydantic Validation

Re-ask on validation error

Evals

Contributing

CLI

License

Contributors

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes