A lightweight library of verified inference pipelines for HuggingFace models.

These details have not been verified by PyPI

Project links

Project description

bolo logo

Bolo: Curated, Verified, and Ready-to-run Inference Pipelines for HuggingFace Models

Bolo is a lightweight Python library that gives you curated, verified, and ready-to-run inference pipelines for HuggingFace models with ZERO efforts.

📦 Curated templates: every supported model ships with a tested Jinja2 inference template maintained by the Illinois CreateLab team.

🔒 Isolated venvs: each model runs inside its own uv-managed virtual environment, so dependency conflicts between models are impossible.

⚡ One-call API: bolo.pipeline(repo_id, device="cuda", ...) conducts inference in one API call, simpler than HuggingFace two-stages pipeline API.

🖥️ CLI included: a bolo command lets you manage venvs and run inference directly from the shell.

🌐 Auto-fetching templates: template bundles are downloaded on first use from the bolo-templates GitHub release and cached locally — no manual setup needed.

Quick demo

Install bolo:

pip install pybolo

Run inference with two lines of Python:

import bolo

result = bolo.pipeline("<REPO_ID>", device="cuda:0")
print(result)

Before running inference you can inspect what parameters the template accepts:

bolo.list_params("<REPO_ID>")

And manage venvs explicitly:

python_bin = bolo.create_a_venv("<REPO_ID>")
# ... activate the created venv and do inference ...
bolo.remove_venv("<REPO_ID>")

CLI

The bolo command mirrors the Python API from your shell.

Create an isolated venv for a model:

bolo create-venv <REPO_ID>
bolo create-venv <REPO_ID> --venv-path /path/to/venv
# then activate the venv

Run inference:

bolo run <REPO_ID> device=cuda:0

Pre-download the templates cache (optional, useful on air-gapped machines):

bolo fetch-templates

How does bolo work?

bolo separates what to run (the Jinja2 template) from where to run it (the model's isolated venv).

flowchart LR
    A[User calls bolo.pipeline] --> B[Fetch / load templates]
    B --> C[Render Jinja2 template\nwith user params]
    C --> D[Execute rendered script\ninside model venv]
    D --> E[Return RESULT]

Templates — stored in the bolo-templates release bundle. Each model folder contains a template.j2 (the inference script template) and a requirements.txt (the model's exact dependencies). Templates are downloaded once and cached at ~/.cache/bolo/templates/.
Venvs — created with uv venv + uv pip install -r requirements.txt. Every model gets its own venv so you can safely use models with conflicting PyTorch or CUDA versions side-by-side.
Rendering — template parameters are collected from the leading {% set key = default %} blocks in each template.j2. bolo list_params shows you every knob with its type and default value.
Execution — the rendered script is executed and its RESULT variable is returned to the caller.

Install from source

git clone https://github.com/illinoisdata/Bolo.git
cd Bolo
pip install -e .

Custom templates directory

Set BOLO_TEMPLATES_DIR to point bolo at your own templates folder:

export BOLO_TEMPLATES_DIR=/path/to/my/templates

Platforms

bolo is developed and tested on Red Hat Enterprise Linux, with CUDA version 12.8. So all dependencies (torch-related) assume the cu128.

Contributing

Contributions are welcome! Please open an issue or pull request on GitHub.

License

MIT — see LICENSE for details.

Acknowledgement

Thanks Jojo and her sister for designing the mascot.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.1.0

Apr 30, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pybolo-0.1.0.tar.gz (11.0 kB view details)

Uploaded Apr 30, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pybolo-0.1.0-py3-none-any.whl (9.4 kB view details)

Uploaded Apr 30, 2026 Python 3

File details

Details for the file pybolo-0.1.0.tar.gz.

File metadata

Download URL: pybolo-0.1.0.tar.gz
Upload date: Apr 30, 2026
Size: 11.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.11

File hashes

Hashes for pybolo-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`3c43557b875dadce5f7c4e9f3f06cd21e74761348e8e859c50d65a33e2562fb0`
MD5	`e795979be47883fb9b0b3de83612bf3a`
BLAKE2b-256	`02cdd732a939270692c8e736a7691be21f0b60154f1c7f1ef1c9cd7492621172`

See more details on using hashes here.

File details

Details for the file pybolo-0.1.0-py3-none-any.whl.

File metadata

Download URL: pybolo-0.1.0-py3-none-any.whl
Upload date: Apr 30, 2026
Size: 9.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.11

File hashes

Hashes for pybolo-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d2770fd90b7c72ee9330281f340f8dc2666e0c9accf97a63dd66ca8b75ae8a0d`
MD5	`7549302ce14acbb5fa1c4496cbe9707b`
BLAKE2b-256	`04bf39cf0b9c16349d19b0e76fe11af5305e6aa134a18361e6201fb89d87d3cc`

See more details on using hashes here.

pybolo 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Bolo: Curated, Verified, and Ready-to-run Inference Pipelines for HuggingFace Models

Quick demo

CLI

How does bolo work?

Install from source

Custom templates directory

Platforms

Contributing

License

Acknowledgement

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes