llm-distiller

Model distiller automator — recursively drives an LLM with seed prompts and stores compressed outputs in SQLite

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

dclavijo

These details have not been verified by PyPI

Project description

The llm-distiller

Running

usage: main.py [-h] [--prompt PROMPT] [--model MODEL] [--db DB] [--max-depth MAX_DEPTH] [--max-tokens MAX_TOKENS] [--compression-level {1,2,3,4,5,6,7,8,9}] [--seed SEED]
               [--bloom-size BLOOM_SIZE] [--bloom-hash-count BLOOM_HASH_COUNT] [--max-ngrams MAX_NGRAMS] [--no-color] [--retrieve-to-bloom] [--use-unsloth] [--api-url API_URL]
               [--api-key API_KEY] [--system-prompt SYSTEM_PROMPT] [--threads THREADS] [--secrets-file SECRETS_FILE] [--load-prompts-from-file LOAD_PROMPTS_FROM_FILE]
               [--api-hf-provider API_HF_PROVIDER] [--compression-algo COMPRESSION_ALGO] [--prompt-prefixes PROMPT_PREFIXES [PROMPT_PREFIXES ...]] [--batch-size BATCH_SIZE]
               [--remove-prompt] [--ngram-mode] [--min-tfidf-score MIN_TFIDF_SCORE] [--save-to-textfile SAVE_TO_TEXTFILE] [--q-mode] [--randomize-prompts] [--randomize-model-retry]
               [--randomize-remote-endpoint] [--strip-think-tag-form-prompt] [--exp-backoff] [--stream]

LLM Distiller with Bloom filter and SQLite storage.

options:
  -h, --help            show this help message and exit
  --prompt PROMPT       Root word or prompt to distill.
  --model MODEL         Huggingface model name (default: distilgpt2).
  --db DB               Path to SQLite database (default: words/data.db).
  --max-depth MAX_DEPTH
                        Max recursion depth (default: 10).
  --max-tokens MAX_TOKENS
                        Max tokens (default: 1024).
  --compression-level {1,2,3,4,5,6,7,8,9}
                        Zlib compression level (1-9, default: 6).
  --seed SEED           Torch manual seed (optional).
  --bloom-size BLOOM_SIZE
                        Bloom filter size (default: 100,000,000).
  --bloom-hash-count BLOOM_HASH_COUNT
                        Bloom filter hash count (default: 6).
  --max-ngrams MAX_NGRAMS
                        Max ngrams (default: 10).
  --no-color            Disable colored output.
  --retrieve-to-bloom   Retrieve words from the database to the Bloom filter.
  --use-unsloth         Use unsloth
  --api-url API_URL     OpenAI compatible API url.
  --api-key API_KEY     API key for auth.
  --system-prompt SYSTEM_PROMPT
                        System prompt
  --threads THREADS     Number of CPU threads for PyTorch (default: auto)
  --secrets-file SECRETS_FILE
                        Specify the secrets json file.
  --load-prompts-from-file LOAD_PROMPTS_FROM_FILE
                        Specify the prompts file file.
  --api-hf-provider API_HF_PROVIDER
                        Specify the hugging face inference provider
  --compression-algo COMPRESSION_ALGO
                        Specify the compresion algo to use.
  --prompt-prefixes PROMPT_PREFIXES [PROMPT_PREFIXES ...]
                        List of strings with spaces allowed
  --batch-size BATCH_SIZE
                        Number of prompts to process in parallel (default: 1)
  --remove-prompt       Remove the prompt from generation.
  --ngram-mode          ngram mode from generation.
  --min-tfidf-score MIN_TFIDF_SCORE
                        Specify the min_tfidf_score.
  --save-to-textfile SAVE_TO_TEXTFILE
                        Specify a text file to save generated text.
  --q-mode              Q-mode.
  --randomize-prompts   Randomize prompts when read from file.
  --randomize-model-retry
                        Randomize model to retry.
  --randomize-remote-endpoint
                        Randomize remote endpoint.
  --strip-think-tag-form-prompt
                        Strip the <think> and </think> tags from prompts.
  --exp-backoff         Set exponential backoff.
  --stream              Set stream.

With a local endpoint:

#!/bin/bash
set -x

PROMPT='make a list of the most important people in history'
MODEL=meta-llama/llama-4-scout-17b-16e-instruct

python main.py "$PROMPT" \
     --compression-level 9 \
     --max-tokens=2048 \
     --max-depth 100 \
     --seed=0 \
     --model=$MODEL \
     --use-unsloth \
     --db /content/drive/MyDrive/IA/data.db \
     --prompt-prefixes 'please explain' 'please elaborate' 'think about' 'formulate a theory about' 'demonstrate that' \
     --batch-size 8 \
     --remove-prompt \
     --min-tfidf-score=0.1

With a remote inference endpoint:

#!/bin/bash
set -x

PROMPT='make a list of the most important people in history'
PROVIDER=https://api.groq.com/openai/v1/
MODEL=meta-llama/llama-4-scout-17b-16e-instruct

python main.py "$PROMPT" \
  --api-url=$PROVIDER \
  --model=$MODEL \
  --max-tokens=4096 \
  --prompt-prefixes "please explain" "please elaborate" "think about" "formulate a theory about" "demonstrate that" \
  --remove-prompt \
  --secrets-file=.secrets.json \
  --min-tfidf-score=0.1

The license is MIT.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

dclavijo

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.1.0

Mar 28, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llm_distiller-0.1.0.tar.gz (11.5 kB view details)

Uploaded Mar 28, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

llm_distiller-0.1.0-py3-none-any.whl (14.5 kB view details)

Uploaded Mar 28, 2026 Python 3

File details

Details for the file llm_distiller-0.1.0.tar.gz.

File metadata

Download URL: llm_distiller-0.1.0.tar.gz
Upload date: Mar 28, 2026
Size: 11.5 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for llm_distiller-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`e9bd52a7536a2c0b84448b9ae863770e96bf92c771e480fd5ba299a69638c38f`
MD5	`970f7950032ee8be321e01081842390a`
BLAKE2b-256	`8183ddab39525082cf96d2a797f772ae8b65c02b45973cf3d1c519aeb1ebdcab`

See more details on using hashes here.

Provenance

The following attestation bundles were made for llm_distiller-0.1.0.tar.gz:

Publisher: pypi-publish.yml on daedalus/llm-distiller

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: llm_distiller-0.1.0.tar.gz
- Subject digest: e9bd52a7536a2c0b84448b9ae863770e96bf92c771e480fd5ba299a69638c38f
- Sigstore transparency entry: 1191544482
- Sigstore integration time: Mar 28, 2026
Source repository:
- Permalink: daedalus/llm-distiller@9b7b7d3258b0c3172a192b61518afd0b32fa1875
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/daedalus
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: pypi-publish.yml@9b7b7d3258b0c3172a192b61518afd0b32fa1875
- Trigger Event: release

File details

Details for the file llm_distiller-0.1.0-py3-none-any.whl.

File metadata

Download URL: llm_distiller-0.1.0-py3-none-any.whl
Upload date: Mar 28, 2026
Size: 14.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for llm_distiller-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b6bd7898651eaf2c197258e9f0bf3bbb6674c61f1d5ffd71a85ac417c5264771`
MD5	`ac04931466bd2ca88328456c6ec5c7aa`
BLAKE2b-256	`095307e7a645b7bc52af101e0b0366e040e7109979d28da06e1a721b7a7ebdf9`

See more details on using hashes here.

Provenance

The following attestation bundles were made for llm_distiller-0.1.0-py3-none-any.whl:

Publisher: pypi-publish.yml on daedalus/llm-distiller

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: llm_distiller-0.1.0-py3-none-any.whl
- Subject digest: b6bd7898651eaf2c197258e9f0bf3bbb6674c61f1d5ffd71a85ac417c5264771
- Sigstore transparency entry: 1191544484
- Sigstore integration time: Mar 28, 2026
Source repository:
- Permalink: daedalus/llm-distiller@9b7b7d3258b0c3172a192b61518afd0b32fa1875
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/daedalus
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: pypi-publish.yml@9b7b7d3258b0c3172a192b61518afd0b32fa1875
- Trigger Event: release

llm-distiller 0.1.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

The llm-distiller

Running

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance