Run models distributed as GGUF files

These details have not been verified by PyPI

Project links

Project description

llm-gguf

Run models distributed as GGUF files using LLM

Installation

Install this plugin in the same environment as LLM:

llm install llm-gguf

Usage

This plugin runs models that have been distributed as GGUF files.

You can either ask the plugin to download these directly, or you can register models you have already downloaded.

To download the LM Studio GGUF of Llama 3.1 8B Instruct, run the following command:

llm gguf download-model \
  https://huggingface.co/lmstudio-community/Meta-Llama-3.1-8B-Instruct-GGUF/resolve/main/Meta-Llama-3.1-8B-Instruct-Q4_K_M.gguf \
  --alias llama-3.1-8b-instruct --alias l31i

The --alias options set aliases for that model, you can omit them if you don't want to set any.

This command will download the 4.92GB file and store it in the directory revealed by running llm gguf models-dir - on macOS this will be ~/Library/Application Support/io.datasette.llm/gguf/models.

Run llm models to confirm that the model has been installed.

You can then run prompts through that model like this:

llm -m gguf/Meta-Llama-3.1-8B-Instruct-Q4_K_M 'Five great names for a pet lemur'

Or using one of the aliases that you set like this:

llm -m l31i 'Five great names for a pet lemur'

You can start a persistent chat session with the model using llm chat - this will avoid having to load the model into memory for each prompt:

llm chat -m l31i

Chatting with gguf/Meta-Llama-3.1-8B-Instruct-Q4_K_M
Type 'exit' or 'quit' to exit
Type '!multi' to enter multiple lines, then '!end' to finish
> tell me a joke about a walrus, a pelican and a lemur getting lunch
Here's one: Why did the walrus, the pelican, and the lemur go to the cafeteria for lunch? ...

If you have downloaded the model already you can register it with the plugin while keeping the file in its current location like this:

llm gguf register-model \
  ~/Downloads/Meta-Llama-3.1-8B-Instruct-Q4_K_M.gguf \
  --alias llama-3.1-8b-instruct --alias l31i

This plugin currently only works with chat models - these are usually distributed in files with the prefix -Instruct or -Chat or similar.

For non-chat models you may have better luck with the older llm-llama-cpp plugin.

Development

To set up this plugin locally, first checkout the code. Then create a new virtual environment:

cd llm-gguf
python3 -m venv venv
source venv/bin/activate

Now install the dependencies and test dependencies:

llm install -e '.[test]'

To run the tests:

pytest

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.1a0 pre-release

Jul 23, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llm_gguf-0.1a0.tar.gz (8.5 kB view hashes)

Uploaded Jul 23, 2024 Source

Built Distribution

llm_gguf-0.1a0-py3-none-any.whl (8.8 kB view hashes)

Uploaded Jul 23, 2024 Python 3

Hashes for llm_gguf-0.1a0.tar.gz

Hashes for llm_gguf-0.1a0.tar.gz
Algorithm	Hash digest
SHA256	`4c5ed29f9fbafc5bb917b5b93115650d707b24fefcc83039c8bf0e6dbe120fb4`
MD5	`d2f6f22bfb25d1325ae51ec963a8dbe6`
BLAKE2b-256	`24f64502d240c03bcd16839bbf8921d9963f2a8fd18e28ee225c782087a44665`

Hashes for llm_gguf-0.1a0-py3-none-any.whl

Hashes for llm_gguf-0.1a0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`8ec81023b844f35b6280cd8c868279ca04e5a1da6978e9c03b23bb2f6b6f8b09`
MD5	`04471f7b9b85939dbe42a2b3fa015dd0`
BLAKE2b-256	`6ae48ad223859b52207f0c507a288884de4893bb09838b8a7c3a9596e10a16d8`