Probabilistic programming with Large Language Models.

These details have not been verified by PyPI

Project description

LLaMPPL

LLaMPPL is a research prototype for language model probabilistic programming: specifying language generation tasks by writing probabilistic programs that combine calls to LLMs, symbolic program logic, and probabilistic conditioning. To solve these tasks, LLaMPPL uses a specialized sequential Monte Carlo inference algorithm. This technique, SMC steering, is described in our recent workshop abstract.

This library was formerly known as hfppl.

Installation

If you just want to try out LLaMPPL, check out our demo notebook on Colab, which performs a simple constrained generation task using GPT-2. (Larger models may require more RAM or GPU resources than Colab's free version provides.)

To get started on your own machine, you can install this library from PyPI:

pip install llamppl

For faster inference on Apple Silicon devices, you can install with MLX backend:

pip install llamppl[mlx]

Local installation

For local development, clone this repository and run pip install -e ".[dev,examples]" to install llamppl and its development dependencies.

git clone https://github.com/genlm/llamppl
cd llamppl
pip install -e ".[dev,examples]"

Then, try running an example. Note that this will cause the weights of a HuggingFace model to be downloaded.

python examples/hard_constraints.py

If everything is working, you should see the model generate political news using words that are at most five letters long (e.g., "Dr. Jill Biden may still be a year away from the White House but she is set to make her first trip to the U.N. today.").

Modeling with LLaMPPL

A LLaMPPL program is a subclass of the llamppl.Model class.

from llamppl import Model, LMContext, CachedCausalLM

# A LLaMPPL model subclasses the Model class
class MyModel(Model):

    # The __init__ method is used to process arguments
    # and initialize instance variables.
    def __init__(self, lm, prompt, forbidden_letter):
        super().__init__()

        # A stateful context object for the LLM, initialized with the prompt
        self.context = LMContext(lm, prompt)
        self.eos_token = lm.tokenizer.eos_token_id

        # The forbidden letter
        self.forbidden_tokens = set(i for (i, v) in enumerate(lm.vocab)
                                      if forbidden_letter in v)

    # The step method is used to perform a single 'step' of generation.
    # This might be a single token, a single phrase, or any other division.
    # Here, we generate one token at a time.
    async def step(self):
        # Condition on the next token *not* being a forbidden token.
        await self.observe(self.context.mask_dist(self.forbidden_tokens), False)

        # Sample the next token from the LLM -- automatically extends `self.context`.
        token = await self.sample(self.context.next_token())

        # Check for EOS or end of sentence
        if token.token_id == self.eos_token or str(token) in ['.', '!', '?']:
            # Finish generation
            self.finish()

    # To improve performance, a hint that `self.forbidden_tokens` is immutable
    def immutable_properties(self):
        return set(['forbidden_tokens'])

The Model class provides a number of useful methods for specifying a LLaMPPL program:

self.sample(dist[, proposal]) samples from the given distribution. Providing a proposal does not modify the task description, but can improve inference. Here, for example, we use a proposal that pre-emptively avoids the forbidden letter.
self.condition(cond) conditions on the given Boolean expression.
self.finish() indicates that generation is complete.
self.observe(dist, obs) performs a form of 'soft conditioning' on the given distribution. It is equivalent to (but more efficient than) sampling a value v from dist and then immediately running condition(v == obs).

To run inference, we use the smc_steer or smc_standard methods:

import asyncio
from llamppl import smc_steer

# Initialize the language model
lm = CachedCausalLM.from_pretrained("meta-llama/Llama-2-7b-hf")

# Create a model instance
model = MyModel(lm, "The weather today is expected to be", "e")

# Run inference
particles = asyncio.run(smc_steer(model, 5, 3)) # number of particles N, and beam factor K

Sample output:

sunny.
sunny and cool.
34° (81°F) in Chicago with winds at 5mph.
34° (81°F) in Chicago with winds at 2-9 mph.
hot and humid with a possibility of rain, which is not uncommon for this part of Mississippi.

Further documentation can be found at https://genlm.github.io/llamppl.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.2.3

Apr 9, 2026

0.2.2

Nov 18, 2025

0.2.1

Jul 25, 2025

0.2.0

Apr 10, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llamppl-0.2.3.tar.gz (72.4 kB view details)

Uploaded Apr 9, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

llamppl-0.2.3-py3-none-any.whl (29.9 kB view details)

Uploaded Apr 9, 2026 Python 3

File details

Details for the file llamppl-0.2.3.tar.gz.

File metadata

Download URL: llamppl-0.2.3.tar.gz
Upload date: Apr 9, 2026
Size: 72.4 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for llamppl-0.2.3.tar.gz
Algorithm	Hash digest
SHA256	`15b247ad3c0efaa5b5beb411d6ea342d2affae000eb63f7e22f850d01f32aeb5`
MD5	`9d59fdb53f3e6a533dd7eae041a0213a`
BLAKE2b-256	`9ae171d4e2a35b2b1f863dd40153d5933900e94b1bfb3bc6da46913ecc92bf0e`

See more details on using hashes here.

Provenance

The following attestation bundles were made for llamppl-0.2.3.tar.gz:

Publisher: release.yml on genlm/llamppl

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: llamppl-0.2.3.tar.gz
- Subject digest: 15b247ad3c0efaa5b5beb411d6ea342d2affae000eb63f7e22f850d01f32aeb5
- Sigstore transparency entry: 1262788245
- Sigstore integration time: Apr 9, 2026
Source repository:
- Permalink: genlm/llamppl@1b143b049ec183a2534538bed5e6605869b2ba2b
- Branch / Tag: refs/tags/v0.2.3
- Owner: https://github.com/genlm
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@1b143b049ec183a2534538bed5e6605869b2ba2b
- Trigger Event: release

File details

Details for the file llamppl-0.2.3-py3-none-any.whl.

File metadata

Download URL: llamppl-0.2.3-py3-none-any.whl
Upload date: Apr 9, 2026
Size: 29.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for llamppl-0.2.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0dac877afafcfbb5a90c8ca4f5a1d39a22538b5f98af32efaaf82dd3693b93ff`
MD5	`dba88d1b58662c299553c3bb4612fe55`
BLAKE2b-256	`5a7c6bc8d180c08584ed2c4e7001f649a6763cdf2d7e2400bee97f7e64b94131`

See more details on using hashes here.

Provenance

The following attestation bundles were made for llamppl-0.2.3-py3-none-any.whl:

Publisher: release.yml on genlm/llamppl

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: llamppl-0.2.3-py3-none-any.whl
- Subject digest: 0dac877afafcfbb5a90c8ca4f5a1d39a22538b5f98af32efaaf82dd3693b93ff
- Sigstore transparency entry: 1262788255
- Sigstore integration time: Apr 9, 2026
Source repository:
- Permalink: genlm/llamppl@1b143b049ec183a2534538bed5e6605869b2ba2b
- Branch / Tag: refs/tags/v0.2.3
- Owner: https://github.com/genlm
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@1b143b049ec183a2534538bed5e6605869b2ba2b
- Trigger Event: release

llamppl 0.2.3

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

LLaMPPL

Installation

Local installation

Modeling with LLaMPPL

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance