Probabilistic Generative Model Programming

These details have not been verified by PyPI

Project links

Project description

Outlines Logo

Outlines

Build reliable workflows based on interactions with generative models.

Prompting • Controlled generation • Agents • Sampling • Parallel execution • Examples

Outlines allows you to control and diagnose interactions with LLMs more effectively. Modern language models are powerful and versatile, but the way they interface with existing systems can be very brittle, their outputs can be unreliable, and complex workflows (agents) can introduce a lot of error-prone code duplication. Outlines provides robust prompting primitives that separate the prompting from the execution logic and lead to simple implementations of few-shot generations, ReAct, meta-prompting, agents, etc. Outlines helps developers control text generation and produce predictable outputs that make the interaction with user code more robust. Its sampling-first approach allows one to diagnose issues with model-generated output more easily, and implement more robust generation methods such as self-consistency or DiVeRSe.

Outlines is designed as a library that integrates well with the broader Python environment. Generation can be interleaved with control flow or custom function calls, prompts can be imported from other modules or libraries.

Features

Simple and powerful prompting primitives based on the Jinja templating engine.
Interleave completions with loops, conditionals, and custom Python functions
Caching of generations
Integration with OpenAI and HuggingFace models
Controlled generation, including multiple choice, type constraints and dynamic stopping
Sampling of multiple sequences
Vectorized execution

Installation

Outlines is available on PyPi:

pip install outlines

Prompting

Writing prompts by concatenating strings in pure Python quickly becomes cumbersome: the prompt building logic gets entangled with the rest of the program, and the structure of the rendered prompt is obfuscated.Outlines makes it easier to write and manage prompts by encapsulating templates inside "template functions".

These functions make it possible to neatly separate the prompt logic from the general program logic; they can be imported from other modules and libraries.

Template functions require no superfluous abstraction, they use the Jinja2 templating engine to help build complex prompts in a concise manner:

import outlines.text as text
import outlines.models as models


examples = [
    ("The food was digusting", "Negative"),
    ("We had a fantastic night", "Positive"),
    ("Recommended", "Positive"),
    ("The waiter was rude", "Negative")
]

@text.prompt
def labelling(to_label, examples):
    """You are a sentiment-labelling assistant.

    {% for example in examples %}
    {{ example[0] }} // {{ example[1] }}
    {% endfor %}
    {{ to_label }} //
    """

complete = models.text_completion.openai("text-davinci-003")
prompt = labelling("Just awesome", examples)
answer = complete(prompt)

Chaining with loops and conditionals (example)

Outlines comes with very few abstractions, and is designed to blend into existing code and integrate with the rest of the ecosystem.

reviews = ["Just awesome", "Avoid", "Will come back"]

def send_notification(review):
    """This function sends a notification with the review's content."""
    ...

for review in reviews:
    prompt = labelling(review, examples)
    answer = model(prompt)
    if answer == "Positive":
        send_notification(review)

Agents (example)

Outlines makes building agents like AutoGPT, BabyAGI, ViperGPT or Transformers Agent easier by removing boilerplate prompting code.

Tools

We can teach language models to call external functions to get additional informations or perform tasks, by encoding the functions' description in the prompt. To avoid duplicating information between the function definition and the description passed to the prompt, we define custom Jinja filters that can extract the function's name, description, signature and source:

from typing import Callable, List
import outlines.text as text


def google_search(query: str):
    """Google Search"""
    pass


def wikipedia_search(query: str):
    """Wikipedia Search"""
    pass


@text.prompt
def agent(tools: List[Callable]):
    """AVAILABLE COMMANDS:

    {% for tool in tools %}
    TOOL
    {{ tool | name }}, {{ tool | description }}, args: {{ tool | signature }}
    {{ tool | source }}
    {% endfor %}
    """


prompt = my_commands([google_search, wikipedia_search])

Response models

We can instruct models to return their output in a pre-defined format, often JSON. To avoid duplicating information between the function definition and the description passed to the prompt we define a custom Jinja filter that can extract the expected response's schema:

from pydantic import BaseModel
import outlines.text as text


class Joke(BaseModel):
    joke: str
    explanation: str


@text.prompt
def joke_ppt(response_model):
    """Tell a joke and explain why the joke is funny.

    RESPONSE FORMAT:
    {{ response_model | schema }}
    """


joke_ppt(Joke)
# Tell a joke and explain why the joke is funny.
#
# RESPONSE FORMAT:
# {
#    "joke": "The joke"
#    "explanation": "The explanation of why the joke is funny"
#  }

Controlled generation

The first step towards reliability of systems that include large language models is to ensure that there is a well-defined interface between their output and user-defined code. Outlines provides ways to control the generation of language models to make their output more predictable.

You can stop the generation after a given sequence has been found:

answer = model("Tell me a one-sentence joke.", stop_at=["."])

You can reduce the completion to a choice between multiple possibilities:

prompt = labelling("Just awesome", examples)
answer = model(prompt, is_in=["Positive", "Negative"])

You can require the generated sequence to be an int or a float:

import outlines.models as models


model = models.text_completion.hf("sshleifer/tiny-gpt2")
answer = model("2 + 2 = ", type="int")
print(answer)
# 4

model = models.text_completion.hf("sshleifer/tiny-gpt2")
answer = model("1.7 + 3.2 = ", type="float")
print(answer)
# 4.9

Sampling (uncertainty, simulation-based inference)

Outlines is strictly sampling based, and focused on using methods such as self-consistency, adaptive consistency, DiVeRSe, Tree of thoughts, lattice sampling, etc. Several samples can be obtained using the num_samples keyword argument:

import outlines.models as models


model = models.text_completion.hf("sshleifer/tiny-gpt2")
answer = model("2 + 2 = ", num_samples=5)
print(answer)
# [4, 5, 4, 4, 4]

The focus on sampling allows us to explore different ideas, such as using the diversity of answers to evaluate the model's uncertainty, or simulation-based inference to optimize the prompt.

Vectorization and parallel execution

You can pass prompts in a NumPy array to Outlines models:

import numpy as np
import outlines.models as models

model = models.text_completion.openai("text-davinci-003")

prompts = [
    ["Translate 'Hello' in Italian", "Translate 'Hello' in French"],
    ["Translate 'Hello' in Spanish", "Translate 'Hello' in German"],
]
answers = model(prompts)

print(answers.shape)
# (2, 2)

Outlines also provide a outlines.vectorize decorator that will vectorize any function. If the function is async the requests will be run concurrently:

import aiohttp
import numpy as np
import outlines

@outlines.vectorize
async def wikipedia_search(query):
    url = f"https://en.wikipedia.org/w/api.php?format=json&action=query&prop=extracts&exintro&explaintext&redirects=1&titles={query}&origin=*"
    async with aiohttp.ClientSession() as session:
        async with session.get(url) as response:
            return await response.text()

results = wikipedia_search([["Cat", "Dog"],["Bird", "Horse"]])
print(results.shape)
# (2, 2)

This feature allows you to run multiple workflows in parallel, for instance to avoid overfitting when iterating over a workflow or in production to run workflows over several different inputs.

Contributing

What contributions?

We curently only accept bug fixes and documentation contributions. If you have a feature request, please start a new discussions. The issue tracker is only intended for actionable items.

How to contribute?

Run pip install -e .[test] or conda env create -f environment.yml. To build the documentation you will also need to run pip install -r requirements-doc.txt.

Before pushing your code to repository please run pre-commit run --all-files and pytest to make sure that the code is formatted correctly and that the tests pass.

Do not hesitate to open a draft PR before your contribution is ready, especially if you have questions and/or need feedback.

Examples

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.0.4

Jul 4, 2025

1.0.3

Jul 1, 2025

1.0.2

Jun 26, 2025

1.0.1

Jun 20, 2025

1.0.0

Jun 18, 2025

0.2.3

Apr 3, 2025

0.2.1

Feb 24, 2025

0.2.0

Feb 19, 2025

0.1.14

Jan 27, 2025

0.1.13

Jan 15, 2025

0.1.12

Jan 10, 2025

0.1.11

Dec 13, 2024

0.1.10

Dec 11, 2024

0.1.9

Dec 10, 2024

0.1.8

Dec 6, 2024

0.1.7

Nov 29, 2024

0.1.6

Nov 27, 2024

0.1.5

Nov 22, 2024

0.1.4

Nov 18, 2024

0.1.3

Nov 10, 2024

0.1.2

Nov 8, 2024

0.1.1

Oct 15, 2024

0.1.0

Oct 7, 2024

0.1.dev0 pre-release

Mar 22, 2023

0.0.46

Jun 22, 2024

0.0.45

Jun 17, 2024

0.0.44

Jun 14, 2024

0.0.43

Jun 4, 2024

0.0.41

Apr 30, 2024

0.0.40

Apr 21, 2024

0.0.39

Apr 17, 2024

0.0.37

Mar 25, 2024

0.0.36

Mar 12, 2024

0.0.35

Mar 12, 2024

0.0.34

Feb 28, 2024

0.0.33

Feb 22, 2024

0.0.32

Feb 16, 2024

0.0.31

Feb 14, 2024

0.0.30

Feb 13, 2024

0.0.29

Feb 12, 2024

0.0.28

Feb 10, 2024

0.0.27

Feb 6, 2024

0.0.26

Feb 5, 2024

0.0.25

Jan 26, 2024

0.0.24

Jan 14, 2024

0.0.23

Jan 11, 2024

0.0.22

Jan 8, 2024

0.0.21

Dec 29, 2023

0.0.19

Dec 21, 2023

0.0.18

Dec 19, 2023

0.0.17

Dec 19, 2023

0.0.16

Dec 14, 2023

0.0.15

Dec 13, 2023

0.0.14

Dec 8, 2023

0.0.13

Nov 30, 2023

0.0.12

Nov 24, 2023

0.0.11

Nov 15, 2023

0.0.9

Oct 22, 2023

0.0.8

Aug 14, 2023

0.0.7

Jul 24, 2023

0.0.6

Jul 19, 2023

This version

0.0.4

Jun 6, 2023

0.0.3

Jun 6, 2023

0.0.2

May 25, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

outlines-0.0.4.tar.gz (443.2 kB view details)

Uploaded Jun 6, 2023 Source

Built Distribution

outlines-0.0.4-py3-none-any.whl (29.2 kB view details)

Uploaded Jun 6, 2023 Python 3

File details

Details for the file outlines-0.0.4.tar.gz.

File metadata

Download URL: outlines-0.0.4.tar.gz
Upload date: Jun 6, 2023
Size: 443.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.9.16

File hashes

Hashes for outlines-0.0.4.tar.gz
Algorithm	Hash digest
SHA256	`d709ff1224bbcf12040c406e0416501b643d02260f31ea889e74c380f212c5c5`
MD5	`dfe4a62e4986978c4698a8b5c9c7d905`
BLAKE2b-256	`635916f29daa8784c555b2be30a3e7b3c5eaa139e328c38913dc446fa00ef888`

See more details on using hashes here.

File details

Details for the file outlines-0.0.4-py3-none-any.whl.

File metadata

Download URL: outlines-0.0.4-py3-none-any.whl
Upload date: Jun 6, 2023
Size: 29.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.9.16

File hashes

Hashes for outlines-0.0.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`496b0fbb4ca36a0b54da55ab18bb76d1107fb100f6cb70143ac7e06fe1da45dd`
MD5	`659b86f743d6345e04ae719943f422b7`
BLAKE2b-256	`f6984661b88a7dfc8d483864fe2074887068e5b1cef6fa65213f1201200c474f`

See more details on using hashes here.

outlines 0.0.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Outlines

Features

Installation

Prompting

Chaining with loops and conditionals (example)

Agents (example)

Tools

Response models

Controlled generation

Sampling (uncertainty, simulation-based inference)

Vectorization and parallel execution

Contributing

What contributions?

How to contribute?

Examples

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes