Lightweight local-first job queue manager and Ollama wrapper for resource-constrained environments.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

Hoglah

Hoglah is a lightweight, local-first job queue manager and Ollama wrapper designed for resource-constrained environments.

It lets applications submit LLM inference requests (generate or chat) asynchronously, receive a job ID immediately, monitor progress, retrieve full results, and receive completion callbacks — even when the underlying hardware can only run one (or very few) model inferences at a time.

Named after one of the daughters of Zelophehad (Numbers 26/27/36, Joshua 17), continuing the Old Testament women's names pattern used by sister projects in the domains family (Mahalath, Tirzah, etc.).

Core Value Proposition

Simple Python-native interface for internal/library use
Reliable queuing with durable persistence (survives restarts)
Smart handling of context windows and model capabilities
Fire-and-forget + callback patterns for workflow orchestration
Fully local, privacy-focused, zero-cloud dependency
Extensible foundation (web API / webhooks / distributed backends planned for later versions)

Target users: Developers building multi-agent systems, background task processors, or local AI tooling that needs to safely queue and manage LLM calls.

Goals (V1)

Clean, reliable abstraction over Ollama for queuing
Configurable concurrency (default: 1 for low-resource setups)
Model discovery, context calibration, and basic resource awareness
Easy integration into existing Python applications
Persistent job state across process restarts
Keep V1 simple, focused, and production-ready for local use

Non-Goals (V1)

Full distributed orchestration or high-availability clustering
Built-in web UI (deferred to V2)
Advanced authentication / multi-tenancy
Non-Ollama backends
Real-time streaming UI surfaces (file + callback sufficient)

Installation

From PyPI (recommended)

pip install hoglah
# With the CLI
pip install "hoglah[cli]"

Hoglah is published on PyPI: https://pypi.org/project/hoglah/

From GitHub Releases (no PyPI needed)

Every vX.Y.Z tag publishes a GitHub Release with a wheel + sdist:

# Latest wheel
pip install "hoglah[cli] @ https://github.com/gellsmore-svg/hoglah/releases/latest/download/hoglah-0.2.2-py3-none-any.whl"

# Or a specific version
pip install "hoglah[cli] @ https://github.com/gellsmore-svg/hoglah/releases/download/v0.2.2/hoglah-0.2.2-py3-none-any.whl"

From source (for development)

git clone https://github.com/gellsmore-svg/hoglah
cd hoglah
python -m venv .venv
.venv/bin/pip install -e ".[dev,cli]"

Maintainers: releasing

Releases are automated. Pushing a vX.Y.Z tag runs .github/workflows/release.yml, which builds the wheel + sdist, creates the GitHub Release, and publishes to PyPI via OIDC trusted publishing (no API token stored). PyPI trusted publishing is already configured for this repo (publisher: gellsmore-svg/hoglah, workflow release.yml, no environment). So a release is just:

# bump version in pyproject.toml + update CHANGELOG, commit, then:
git tag vX.Y.Z && git push origin vX.Y.Z

Quick Start (Planned)

Once implemented:

git clone https://github.com/gellsmore-svg/hoglah
cd hoglah
python -m venv .venv && .venv/bin/pip install -e ".[dev,cli]"

from hoglah import Hoglah

h = Hoglah()  # or Hoglah(config_path="...")

job_id = h.submit(
    prompt="Explain the significance of Hoglah in the biblical land allotment.",
    model="gemma3:1b",
    tags=["research", "bible"],
    callback=lambda result: print("Done:", result.job_id, result.output[:100]),
)

print("Submitted:", job_id)
print(h.status(job_id))

result = h.wait(job_id, timeout=120)
print(result.output)

# Recommended: context manager for auto cleanup of the background worker
with Hoglah() as h:
    job_id = h.submit(prompt="...", model="gemma3:1b")
    print(h.wait(job_id).output)

CLI:

hoglah submit "Explain Hoglah" --model gemma3:1b --wait
hoglah list --status completed
hoglah ps --json                 # alias for list, machine-readable
hoglah stats --json              # queue overview (counts by status)
hoglah info --json               # config + adapter + log_level + stats snapshot
hoglah show gemma3:1b --json     # model details (context, template, etc.)
hoglah clear --status completed --older-than 7 --yes  # prune old jobs
hoglah rm <job-id> --yes  # remove specific job
hoglah wait <job-id> --timeout 60 --json  # block until done, machine readable
hoglah doctor --real  # diagnose setup and real Ollama/llama.cpp connectivity
hoglah status <job-id> --json

## V1 Scope

Hoglah 0.2.1 implements the full V1 specification from `docs/requirements-v1.0.md` and `docs/project-brief.md`.

**Included (V1):**
- Submit (prompt or messages/chat), immediate UUID.
- Status, get result (with output, usage, timings, metadata, parent, **truncated** reporting + effective_num_ctx).
- List (status, tags, **parent_job_id** filters; rich human + --json with preview).
- Cancel (best-effort).
- Wait (standalone or via submit --wait).
- rm / clear (per-job or bulk by status/age).
- info / stats (config, adapter, queue overview).
- Models: list + show (details, context size, template, family).
- pull (auto on real submit, or explicit).
- run (foreground worker).
- In-process callbacks (direct + named registry for restart re-delivery).
- Restart recovery (interrupted jobs + callback re-delivery).
- Pluggable adapters (safe Stub default + real Ollama with auto-pull, model-aware context, truncation via done_reason).
- Configurable concurrency (default 1), log_level, db, ollama host.
- Full submit surface (temperature, top_p/k, num_ctx, format, keep_alive, metadata, parent, etc.).
- Persistence (SQLite), context manager, --json everywhere.

**Explicitly not in V1 (per non-goals):**
- Web UI / HTTP server (V2).
- Webhooks / callback_url.
- Distributed / multi-node.
- Non-Ollama backends.
- Complex dependency graph execution (parent_job_id is for traceability only; no automatic waiting/fan-out).
- Real-time streaming UI (polling wait + final callbacks sufficient).

See the full requirements review and V1 completeness note in `.restart.md`.

You can also run the packaged install smoke test after installing the wheel:
```bash
python scripts/test_packaged_install.py

To validate with your working local Ollama (full real adapter paths including show, pull, context auto-detect):

RUN_OLLAMA_TESTS=1 python scripts/test_packaged_install.py
# or
HOGLAH_USE_REAL_ADAPTER=1 python scripts/test_packaged_install.py

Real Ollama / llama.cpp: Opt-in via use_real=True / HOGLAH_USE_REAL_ADAPTER=1 / --real. The "real" adapter talks to Ollama (which uses llama.cpp for inference).

Real-Ollama validation status: v0.2.2 has been validated end-to-end against a live Ollama (submit → worker → real inference, plus the gated integration test and the packaged-wheel smoke test in real mode). To reproduce on your machine:

python3 -m venv /tmp/hoglah-validate
/tmp/hoglah-validate/bin/pip install "hoglah[cli]"            # from PyPI
RUN_OLLAMA_TESTS=1 /tmp/hoglah-validate/bin/python scripts/test_packaged_install.py

# Or the gated integration test from a source checkout
RUN_OLLAMA_TESTS=1 python -m pytest tests/test_worker_execution.py::test_real_ollama_adapter_end_to_end -q -s

WSL2 note: if Ollama runs as the Windows binary and your code runs in WSL, the daemon is not reachable at localhost over HTTP. Set OLLAMA_HOST=0.0.0.0 on the Windows side (setx OLLAMA_HOST "0.0.0.0", then restart Ollama) and point the client at the WSL2 gateway IP, e.g.:

OLLAMA_HOST="http://$(ip route show default | awk '{print $3}'):11434" \
  RUN_OLLAMA_TESTS=1 python scripts/test_packaged_install.py

hoglah cancel hoglah models hoglah run --real # foreground worker using real Ollama


By default `hoglah` and `Hoglah()` use the safe stub adapter (no LLM calls). Use `--real` (CLI) or pass `adapter=OllamaAdapter(...)` (library) when you want actual inference.

`hoglah --version` / `-V` and `hoglah version` are supported. Use `with Hoglah(...) as h:` for automatic cleanup.

CLI now also includes `hoglah ps` (list alias) and `--json` output on list/ps/status/models. `hoglah submit` supports `--metadata` (JSON) and `--parent-job-id`. Real integration tests are gated behind `RUN_OLLAMA_TESTS=1`.

See `docs/requirements-v1.0.md` for the full initial specification.

## Submit API (Initial Draft)

```python
job_id = hoglah.submit(
    prompt: str | None = None,                    # or messages for chat
    messages: list[dict] | None = None,           # OpenAI-style chat history
    model: str,                                   # e.g. "gemma:7b", "mistral"
    system_prompt: str | None = None,
    num_ctx: int | None = None,                   # Context window size
    options: dict | None = None,                  # Passthrough for llama.cpp params
    callback: Callable[[JobResult], None] | None = None,  # Python callable
    callback_url: str | None = None,              # V2: HTTP webhook
    tags: list[str] | None = None,
    priority: int = 0,                            # Higher = earlier
    timeout_seconds: int | None = None,
    max_retries: int = 2,
    metadata: dict | None = None,                 # User-defined data
    parent_job_id: str | None = None,             # For chaining/dependencies
    temperature: float | None = None,
    top_p: float | None = None,
    top_k: int | None = None,
    repeat_penalty: float | None = None,
    seed: int | None = None,                      # Reproducibility
    stop: list[str] | None = None,                # Stop sequences
    num_predict: int | None = None,               # Max output tokens
    format: str | None = None,                    # e.g. "json"
    keep_alive: str | int | None = None,
    # ... full options dict covers the rest
)

Current Status

2026-06-12 (updated): Core implementation complete (Chunks 1-3 + follow-on polish).

Full durable queue + background asyncio worker (concurrency=1 default)
Pluggable adapters: StubAdapter (default, safe) + OllamaAdapter (real, opt-in via use_real=True or --real)
Hoglah(use_real=True) convenience + HOGLAH_USE_REAL_ADAPTER env var
Submit (prompt or messages/chat), rich generation params, status, get, list, cancel, wait, named+direct callbacks
Restart recovery (interrupted jobs + callback re-delivery)
Truncation metadata always surfaced (never fails the job)
CLI: list, status, cancel, submit (with --messages, --temperature, --num-ctx etc.), run, models, version
examples/basic_usage.py demonstrating the common patterns
26 passing tests (+1 gated real-Ollama test that passes against a live server); the default suite needs no Ollama (stub adapter).

See docs/requirements-v1.0.md, docs/architecture-decisions.md, and .restart.md for history and how to continue.

See sister domains for style and quality references:

Architecture Sketch (Early)

Client library (Hoglah or similar) for submit / status / wait / list / cancel
SQLite-backed job store (jobs table + results / events)
Worker loop (thread or task) with concurrency semaphore
Ollama adapter (generate + chat paths, model info)
In-process callback dispatch after completion
CLI entrypoint for inspection and operations
Config via constructor + env + small config file

Full details will evolve in docs/architecture-decisions.md and implementation docs.

License

Apache 2.0 — see LICENSE.

Contributing

See CONTRIBUTING.md.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

gellsmore

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.7.0

Jun 15, 2026

0.6.0

Jun 15, 2026

0.5.1

Jun 15, 2026

0.5.0

Jun 15, 2026

0.4.1

Jun 15, 2026

0.4.0

Jun 15, 2026

This version

0.3.3

Jun 15, 2026

0.3.1

Jun 14, 2026

0.3.0

Jun 13, 2026

0.2.2

Jun 13, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hoglah-0.3.3.tar.gz (58.2 kB view details)

Uploaded Jun 15, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

hoglah-0.3.3-py3-none-any.whl (45.0 kB view details)

Uploaded Jun 15, 2026 Python 3

File details

Details for the file hoglah-0.3.3.tar.gz.

File metadata

Download URL: hoglah-0.3.3.tar.gz
Upload date: Jun 15, 2026
Size: 58.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for hoglah-0.3.3.tar.gz
Algorithm	Hash digest
SHA256	`20cc00c21eb6144890e8c061068246d2b83f5c1e5f10defdf0f106ef75ba423d`
MD5	`b07d9b14146fb07309194ec740ba9fef`
BLAKE2b-256	`76f1ce041e7059a6cd146bed41d3f05c5801331bbab6f5511ed6601ed420ddbd`

See more details on using hashes here.

Provenance

The following attestation bundles were made for hoglah-0.3.3.tar.gz:

Publisher: release.yml on gellsmore-svg/hoglah

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: hoglah-0.3.3.tar.gz
- Subject digest: 20cc00c21eb6144890e8c061068246d2b83f5c1e5f10defdf0f106ef75ba423d
- Sigstore transparency entry: 1823165931
- Sigstore integration time: Jun 15, 2026
Source repository:
- Permalink: gellsmore-svg/hoglah@cb726cf66553403098b85433f01424d16a246fb6
- Branch / Tag: refs/tags/v0.3.3
- Owner: https://github.com/gellsmore-svg
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@cb726cf66553403098b85433f01424d16a246fb6
- Trigger Event: push

File details

Details for the file hoglah-0.3.3-py3-none-any.whl.

File metadata

Download URL: hoglah-0.3.3-py3-none-any.whl
Upload date: Jun 15, 2026
Size: 45.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for hoglah-0.3.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`723c1011a4c16f7f7fe52864334e5b100e86bc4b877f5ad10158b2a34a2e77b0`
MD5	`295470c2955e3078be57c96c40713fa6`
BLAKE2b-256	`18ff077a2541feeaa536b49cf519ca6e0911dd586f0b9c4a7cab848bc2ad30a2`

See more details on using hashes here.

Provenance

The following attestation bundles were made for hoglah-0.3.3-py3-none-any.whl:

Publisher: release.yml on gellsmore-svg/hoglah

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: hoglah-0.3.3-py3-none-any.whl
- Subject digest: 723c1011a4c16f7f7fe52864334e5b100e86bc4b877f5ad10158b2a34a2e77b0
- Sigstore transparency entry: 1823165968
- Sigstore integration time: Jun 15, 2026
Source repository:
- Permalink: gellsmore-svg/hoglah@cb726cf66553403098b85433f01424d16a246fb6
- Branch / Tag: refs/tags/v0.3.3
- Owner: https://github.com/gellsmore-svg
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@cb726cf66553403098b85433f01424d16a246fb6
- Trigger Event: push

hoglah 0.3.3

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Hoglah

Core Value Proposition

Goals (V1)

Non-Goals (V1)

Installation

From PyPI (recommended)

From GitHub Releases (no PyPI needed)

From source (for development)

Maintainers: releasing

Quick Start (Planned)

Current Status

Architecture Sketch (Early)

License

Contributing

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance