Weave batch LLM jobs across OpenAI, Anthropic, and Google.

These details have not been verified by PyPI

Project links

Project description

Loom: LLM Batch Processing Made Easy

Weave LLM jobs across OpenAI, Anthropic, Google, and OpenRouter — in batch or live.

1. Introduction

Loom is a small Python CLI for running a dataset of prompts (JSON, CSV, or Parquet) through an LLM and merging the responses back into the original file. It speaks two modes:

Batch (loom run, default): submits the dataset to the provider's batch API, persists the batch id locally, and later you call loom fetch to download and merge results. Cheap (50% off on OpenAI / Anthropic) but asynchronous — can take up to 24 hours.
Sequential (loom run --sync): calls the chat-completion endpoint per prompt with a concurrent worker pool, writes the output file immediately, and uses an on-disk response cache.

It also ships a loom tokens command that uses each provider's token-counting API where available.

Supported providers

Provider	Batch (`loom run`)	Sequential (`loom run --sync`)	Token counter (`loom tokens`)
OpenAI	✓	✓	✗ — no remote API
Anthropic	✓	✓	✓
Google (Gemini)	✓	✓	✓
OpenRouter	✗	✓	✗ — no remote API

Loom: LLM Batch Processing Made Easy

2. Getting started

Installation

pip install loom-batch

The PyPI package is loom-batch (the name loom was taken); the CLI command is loom.

From source, for hacking or running tests:

git clone https://github.com/jannehring/loom
cd loom
python -m venv .venv && source .venv/bin/activate
pip install -e ".[dev]"

Tip: if you create the venv with uv venv, pip is not installed inside it. Use uv pip install -e ".[dev]" instead, or recreate the venv with stdlib python -m venv (see Troubleshooting).

Preparing the data

Loom accepts three input formats — plain or gzip-compressed (.json.gz, .csv.gz). Compressed inputs are decompressed transparently; JSON and CSV outputs are always written uncompressed (.json / .csv). Parquet inputs produce .parquet output.

JSON — a list of {id, prompt} objects. The id is reused as the row key in the merged output.

[
  {"id": "task-001", "prompt": "Summarize the plot of Hamlet in one sentence."},
  {"id": "task-002", "prompt": "Translate 'Good morning' to French."}
]

CSV — any schema; Loom reads the prompt from the text column by default (override with --col). All original columns are preserved; a new llm_response column is appended.

id,text,priority
1,"Explain quantum physics in one paragraph",low
2,"Write a haiku about rust",high

Parquet — same semantics as CSV: reads the text column by default (override with --col). All original columns are preserved; llm_response is appended. Output is written as .parquet.

Submit a batch request

Minimal end-to-end run, passing the API key inline (see Storing API keys for cleaner options):

loom run --file prompts.json \
         --provider openai \
         --model gpt-4o-mini \
         --api-key sk-...
# -> Batch submitted. id=batch_abc123 provider=openai

# ...minutes or hours later...
loom fetch              # --all is the default; fetches every pending batch
# -> Fabric complete. id=batch_abc123 -> prompts_results_openai_gpt-4o-mini.json

The output is written next to the input as <name>_results_<provider>_<model>.<ext>. Forward slashes and other unsafe characters in the model id are replaced with underscores (e.g. openai/gpt-4o-mini → openai_gpt-4o-mini). For gzipped inputs the .gz is dropped — data.csv.gz → data_results_<provider>_<model>.csv. Override the path entirely with --output.

3. Usage

Command-line reference

`loom run`

Submit a dataset as a batch job (default) or run it synchronously with --sync.

Flag	Default	Description
`--file`, `-f`	required	Input `.json`, `.csv`, `.parquet`, `.json.gz`, or `.csv.gz`.
`--provider`, `-p`	required	`openai`, `anthropic`, `google`, or `openrouter`.
`--model`, `-m`	required	Provider-specific model id (e.g. `gpt-4o-mini`, `claude-3-5-sonnet-latest`, `gemini-2.0-flash`, `openai/gpt-4o-mini`).
`--col`, `-c`	`text`	Prompt column name (CSV and Parquet).
`--api-key`	env / `.env`	Override the resolved API key for this run.
`--output`, `-o`	`<input>_results_<provider>_<model>.<ext>`	Custom output file path.
`--sync` / `--batch`	`--batch`	`--sync` calls the provider per prompt and writes the output immediately. `--batch` uses the provider's batch API.
`--workers`, `-w`	`8`	Concurrent workers in `--sync` mode.
`--no-cache`	off	Disable the on-disk response cache (`--sync` only).
`--force`	off	Overwrite an existing output file without prompting (`--sync` only).
`--with-meta`	off	Add `llm_provider` and `llm_model` columns (CSV/Parquet) or fields (JSON) to the output, alongside `llm_response`.

OpenRouter has no batch API; using --provider openrouter without --sync exits with a helpful error.

`loom fetch`

Poll the provider, download results, merge into the output file.

Flag	Default	Description
`--id`, `-i`	—	Fetch a single batch by id. If set, implies `--no-all`.
`--all` / `--no-all`, `-a`	`--all`	Process every pending batch under `~/.loom/batches/`. This is the default — `loom fetch` with no args walks all batches.
`--api-key`	env / `.env`	Override the resolved API key.
`--keep`, `-k`	off	Keep the metadata file in `~/.loom/batches/` after a successful fetch (default: delete it).
`--force`	off	Overwrite existing output files without prompting.

For pending batches, loom fetch prints the current status and a one-sentence explanation. The full set of possible statuses:

Status	Meaning
`validating`	Provider has accepted the batch and is queueing/preparing it; no work has started yet.
`in_progress`	Provider is actively running the prompts; check back later.
`completed`	All prompts finished and results were downloaded — the merged output file has been written.
`failed`	Provider reported the batch as failed; results are not available.
`expired`	Batch exceeded the provider's time limit (typically 24h) before completing.
`cancelled`	Batch was cancelled — either by you on the provider's dashboard, or by the provider itself.
`unknown`	The last fetch attempt raised an error (invalid id, auth failure, network glitch, or an API response Loom doesn't recognise). Re-run `loom fetch` to retry; if it persists, inspect the metadata file under `~/.loom/batches/`.

validating and in_progress are the only non-terminal states — loom fetch will pick the batch up again on the next run. The other states are terminal: completed means the output file is on disk, and failed / expired / cancelled mean no merge happened.

`loom list`

List every batch known to Loom, with last-seen status, model, and source file. No flags.

`loom tokens`

Count input tokens for every prompt using the provider's token-counting API. See Token counter.

Flag	Default	Description
`--file`, `-f`	required	Input `.json`, `.csv`, `.parquet`, `.json.gz`, or `.csv.gz`.
`--provider`, `-p`	required	`openai`, `anthropic`, `google`, or `openrouter`.
`--model`, `-m`	required	Provider-specific model id.
`--col`, `-c`	`text`	Prompt column name (CSV and Parquet).
`--api-key`	env / `.env`	Override the resolved API key.
`--workers`, `-w`	`8`	Concurrent workers.

`loom cache clear`

Delete every cached response under ~/.loom/cache/. See Caching.

Flag	Default	Description
`--yes`, `-y`	off	Skip the confirmation prompt.

Batch vs sequential

	`loom run` (batch, default)	`loom run --sync` (sequential)
Latency	Up to 24h	Real-time
Pricing (OpenAI, Anthropic)	50% off	Standard
Steps	`run` → wait → `fetch`	Single command
Cache	n/a	On-disk, on by default
OpenRouter	✗	✓ (only mode)
State on disk	`~/.loom/batches/`	None (cache only)

Pick batch when you have a large dataset and don't care about wall-clock time. Pick sync when you want results now, or when the provider has no batch API (OpenRouter).

Storing API keys

Loom resolves keys in this order: --api-key flag → environment variable → .env file in the current working directory (loaded via python-dotenv, does not overwrite existing env vars).

Recognised environment variables:

OPENAI_API_KEY=sk-...
ANTHROPIC_API_KEY=sk-ant-...
GOOGLE_API_KEY=...
OPENROUTER_API_KEY=sk-or-...

A .env in the working directory is the friction-free option for daily use; --api-key is handy for one-offs or shared workstations.

Caching

In --sync mode, Loom caches every response under ~/.loom/cache/. The cache key is sha256("<provider>|<model>|<prompt>"), so changing any of those misses the cache. There is no TTL or eviction — the cache grows monotonically until you clear it.

loom run --sync -p openai -m gpt-4o-mini -f data.csv -c text   # first run: API calls
loom run --sync -p openai -m gpt-4o-mini -f data.csv -c text   # second run: 100% cache hits
loom run --sync -p openai -m gpt-4o-mini -f data.csv -c text --no-cache  # bypass
loom cache clear                                                # wipe ~/.loom/cache/

loom run --sync reports cache hits live in its progress bar.

Token counter

loom tokens --file prompts.json --provider anthropic --model claude-3-5-sonnet-latest
# Counting tokens ████████░░░░  340/1000  est_total≈36,210  errors=0  0:01:12  eta 0:02:35
# -> Total input tokens: 12,345 across 100 prompts (provider=anthropic, model=claude-3-5-sonnet-latest, errors=0)

loom tokens calls each provider's official count-tokens endpoint, one prompt at a time, with a concurrent worker pool. The live progress bar shows:

done/total prompts processed,
est_total — running estimate of the final input-token count, computed as the mean tokens-per-prompt-so-far multiplied by total (refines as more prompts complete),
errors,
elapsed time and eta (estimated time remaining, based on the current rate).

Provider	Endpoint	Available
Anthropic	`client.messages.count_tokens(...)` → `input_tokens`	✓
Google	`client.models.count_tokens(...)` → `total_tokens`	✓
OpenAI	—	✗ (no remote API; use `tiktoken` locally)
OpenRouter	—	✗

For unsupported providers, loom tokens prints "Token counting not available: ..." and exits with code 2.

Where Loom stores state

~/.loom/
├── batches/        # one <provider>_<batch_id>.json per pending or kept batch
└── cache/          # one <sha256>.json per cached --sync response

~/.loom/batches/<provider>_<safe_id>.json is created by loom run (batch mode) and contains batch_id, provider, model, original_file_path, file_type, the prompt column, an id_map mapping internal custom_id → original row id, created_at, and the last-seen status. loom fetch updates status, downloads results, and (unless --keep is passed) deletes the file on success.
~/.loom/cache/<sha256>.json is the response cache used by --sync. Each file holds {provider, model, response, created_at}.

Both directories are safe to delete by hand: cache will rebuild itself; deleting batches/ orphans any in-flight batch jobs (they still complete on the provider's side, you just lose Loom's view of them).

<<<<<<< HEAD

Troubleshooting

ModuleNotFoundError: No module named 'pandas' right after pip install -e ".[dev]" Your .venv was probably created with uv venv, which doesn't install pip inside. Your pip install ran against the system / conda pip and dropped the packages elsewhere. Fix with uv pip install -e ".[dev]", or recreate the venv with python -m venv .venv && source .venv/bin/activate && pip install -e ".[dev]".

which loom shows /opt/miniconda3/bin/loom even after source .venv/bin/activate Conda's path is being prepended after the venv. Either reorder your shell init, or just call the venv binary directly: ./.venv/bin/loom <cmd>.

Google batch results mapped to the wrong rows Google's batch API can return inlined responses out of submission order (especially for 100+ requests). Loom matches each response using metadata.custom_id from the request — not list position. If you see mismatched results from an older Loom version, upgrade (pip install -U loom-batch) and re-submit the batch. Requires google-genai>=1.61.0, which restores metadata on batch responses.

Google batch Invalid batch job name: jqpem7... The stored batch_id is missing the required batches/ prefix (an older Loom version stripped it). Edit ~/.loom/batches/google_<id>.json, change "batch_id": "<id>" to "batch_id": "batches/<id>", and rename the file to google_batches_<id>.json so the on-disk filename and the in-file id stay consistent.

status=unknown in loom fetch The previous fetch attempt raised an exception (bad id, network blip, expired key, or an API response Loom doesn't recognise). Re-running loom fetch retries; if it persists, run with the provider's SDK directly to surface the underlying error.

Error: OpenRouter has no batch API OpenRouter doesn't offer batch processing. Re-run with --sync.

=======

1aa28882f247bac94323b7f6bce01fe5235b6ce7

4. Developer instructions

Repository layout

loom/
  main.py                       # CLI entry point (Typer commands)
  core/
    orchestrator.py             # run_batch, fetch_batch, generate_sync, count_tokens
    models.py                   # Pydantic models, ProviderName, BatchStatus
  eval/
    eval_providers.py           # Provider evaluation script (init / fetch)
  providers/
    base.py                     # Batch provider ABC (submit/check_status/download)
    sync_base.py                # Sync provider ABC (generate/count_tokens)
    openai.py, anthropic.py,
    google.py                   # Batch implementations
    openai_sync.py, anthropic_sync.py,
    google_sync.py, openrouter_sync.py   # Sync implementations
  utils/
    converters.py               # Load / merge JSON, CSV & Parquet
    storage.py                  # ~/.loom/batches/ persistence
    cache.py                    # ~/.loom/cache/ response cache
    keys.py                     # API-key resolution
tests/                          # pytest suite
.github/workflows/              # CI: test.yml, publish.yml
pyproject.toml                  # Dependencies and package metadata

Running unit tests

pip install -e ".[dev]"
pytest                 # quiet
pytest -v              # verbose
pytest tests/test_converters.py     # one file
pytest tests/test_storage.py::test_save_and_load_roundtrip   # one test

Running provider evaluations

Loom includes a provider-level evaluation script to test both synchronous and batch APIs for all supported providers using a small dataset of 3 prompts with predictable single-word outputs. This requires the API keys to be configured and it generates costs.

# 1. Initialize evaluation: test sync APIs and submit batch jobs (default: all providers)
python -m loom.eval.eval_providers init

# Alternatively, initialize for a single provider (e.g. google, openai, or anthropic)
python -m loom.eval.eval_providers init google

# 2. Fetch evaluation results: check batch statuses and download/validate results (default: all providers)
python -m loom.eval.eval_providers fetch

# Alternatively, fetch for a single provider only
python -m loom.eval.eval_providers fetch google

This runs against live provider APIs. Configure your API keys (e.g. OPENAI_API_KEY, GOOGLE_API_KEY, ANTHROPIC_API_KEY) in your environment or a .env file before running. Any provider without a configured API key will be skipped automatically.

GitHub Actions

.github/workflows/test.yml — runs on every push and PR, with a matrix over Python 3.10 / 3.11 / 3.12. Installs the project with pip install -e ".[dev]" and runs pytest -v.
.github/workflows/publish.yml — manual release workflow (workflow_dispatch). Pick patch, minor, or major, and it bumps pyproject.toml + loom/__init__.py, runs tests, builds an sdist + wheel, commits and tags the release, creates a GitHub Release, and uploads to PyPI via OIDC Trusted Publishing — no PyPI token is stored in repo secrets.

Releasing

Open Actions → publish → Run workflow on main.
Choose patch, minor, or major (e.g. 0.1.0 → 0.1.1 / 0.2.0 / 1.0.0).
The workflow bumps the version, runs tests, builds, commits Release vX.Y.Z, pushes the tag, creates a GitHub Release, and publishes to PyPI.

The version bump is only pushed if tests and the build succeed.

5. License

MIT — see LICENSE.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.3.0

Jun 29, 2026

0.2.1

Jun 16, 2026

0.2

Jun 13, 2026

0.1.0

May 24, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

loom_batch-0.3.0.tar.gz (40.6 kB view details)

Uploaded Jun 29, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

loom_batch-0.3.0-py3-none-any.whl (36.5 kB view details)

Uploaded Jun 29, 2026 Python 3

File details

Details for the file loom_batch-0.3.0.tar.gz.

File metadata

Download URL: loom_batch-0.3.0.tar.gz
Upload date: Jun 29, 2026
Size: 40.6 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for loom_batch-0.3.0.tar.gz
Algorithm	Hash digest
SHA256	`3d1906a01967a60e3a55cdf68a87ef386cb4afd3b2f551191fc7ed11b4d723ad`
MD5	`a34e93da108cabd002bb9974d900fc26`
BLAKE2b-256	`307be5d0fa7c75310a9c4a3cd1f1359016df679be8a00b3748e4fd20fb8b5797`

See more details on using hashes here.

Provenance

The following attestation bundles were made for loom_batch-0.3.0.tar.gz:

Publisher: publish.yml on jnehring/loom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: loom_batch-0.3.0.tar.gz
- Subject digest: 3d1906a01967a60e3a55cdf68a87ef386cb4afd3b2f551191fc7ed11b4d723ad
- Sigstore transparency entry: 2006965412
- Sigstore integration time: Jun 29, 2026
Source repository:
- Permalink: jnehring/loom@53f65de02f799a8beda89d5a53079bdef3f82b0b
- Branch / Tag: refs/heads/main
- Owner: https://github.com/jnehring
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@53f65de02f799a8beda89d5a53079bdef3f82b0b
- Trigger Event: workflow_dispatch

File details

Details for the file loom_batch-0.3.0-py3-none-any.whl.

File metadata

Download URL: loom_batch-0.3.0-py3-none-any.whl
Upload date: Jun 29, 2026
Size: 36.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for loom_batch-0.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f934577620b6d6d538867e63da667ece0e2d44ddb10561389c40193615e7638c`
MD5	`f6abeb21fd393f7606e12357333add3a`
BLAKE2b-256	`d36b2093d8476453a0bbffb70b1ae92ec7ed89682513e63c1da7b5a5b5269cb8`

See more details on using hashes here.

Provenance

The following attestation bundles were made for loom_batch-0.3.0-py3-none-any.whl:

Publisher: publish.yml on jnehring/loom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: loom_batch-0.3.0-py3-none-any.whl
- Subject digest: f934577620b6d6d538867e63da667ece0e2d44ddb10561389c40193615e7638c
- Sigstore transparency entry: 2006965501
- Sigstore integration time: Jun 29, 2026
Source repository:
- Permalink: jnehring/loom@53f65de02f799a8beda89d5a53079bdef3f82b0b
- Branch / Tag: refs/heads/main
- Owner: https://github.com/jnehring
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@53f65de02f799a8beda89d5a53079bdef3f82b0b
- Trigger Event: workflow_dispatch

loom-batch 0.3.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Loom: LLM Batch Processing Made Easy

1. Introduction

Supported providers

Table of contents

2. Getting started

Installation

Preparing the data

Submit a batch request

3. Usage

Command-line reference

loom run

loom fetch

loom list

loom tokens

loom cache clear

Batch vs sequential

Storing API keys

Caching

Token counter

Where Loom stores state

Troubleshooting

4. Developer instructions

Repository layout

Running unit tests

Running provider evaluations

GitHub Actions

Releasing

5. License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

`loom run`

`loom fetch`

`loom list`

`loom tokens`

`loom cache clear`