Zero-friction GPU bootstrap for llama-cpp-python on Windows.
Project description
easyllama
Small helper package so your llama_cpp scripts can stay short while still using GPU.
What it does
- Adds CUDA DLL folders to runtime search path automatically
- Detects pip-installed NVIDIA runtime DLL folders inside the active venv
- Forces dedicated GPU usage defaults for
llama_cpp.Llama(unless you override)
Install (new laptop / fresh Windows)
- Install CUDA-enabled
llama-cpp-pythonin your venv:
python -m pip install --upgrade --force-reinstall --no-cache-dir llama-cpp-python --extra-index-url https://abetlen.github.io/llama-cpp-python/whl/cu124
- Install this package:
cd "C:\Users\divid\Desktop\Data analytics\easyllama"
python -m pip install -e .
Use
from easyllama import Llama
llm = Llama(
model_path=r"C:\AI\Model\Qwen3-4B.Q4_K_M.gguf",
n_ctx=2048,
)
n_gpu_layers=-1, main_gpu=0, and offload_kqv=True are auto-set unless you pass your own values.
Also, logs are quiet by default (verbose=False, no_perf=True) so CUDA graph spam does not flood your terminal.
If you want debug logs back:
$env:LLAMA_VERBOSE="1"
$env:LLAMA_NO_PERF="0"
Quick check
python -m easyllama
PyPI Trusted Publishing (GitHub Actions)
This repo includes:
.github/workflows/publish.yml
How to enable:
- Push this project to GitHub.
- On PyPI, configure Trusted Publishing:
- If
easyllamadoes not exist yet: use account settings ->Publishing-> add a pending publisher foreasyllama. - If
easyllamaalready exists: open project settings ->Publishing-> add a publisher.
- If
- Fill with your GitHub details:
- Owner: your GitHub username/org
- Repository: your repo name
- Workflow name:
publish.yml - Environment name:
pypi
- Create and push a version tag like
v1.0.0. - GitHub Actions publishes automatically to PyPI using OIDC (no API token needed).
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file easyllama_divi845-1.0.0.tar.gz.
File metadata
- Download URL: easyllama_divi845-1.0.0.tar.gz
- Upload date:
- Size: 4.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3b07ed9b96c254a95bba02a016424a0867a99cc99e90cf0114be002bc7509ecd
|
|
| MD5 |
bdbe72f373dfbfbffd91ef1f75b412e2
|
|
| BLAKE2b-256 |
53a2efacdceaa4310b1f7df5ae6b5bc0d5aa7614c93200cb660d0e5f092ff17b
|
Provenance
The following attestation bundles were made for easyllama_divi845-1.0.0.tar.gz:
Publisher:
publish.yml on Divi845/Python-Libraries
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
easyllama_divi845-1.0.0.tar.gz -
Subject digest:
3b07ed9b96c254a95bba02a016424a0867a99cc99e90cf0114be002bc7509ecd - Sigstore transparency entry: 1497791216
- Sigstore integration time:
-
Permalink:
Divi845/Python-Libraries@204e3709bb3872ea8a4ab8808b213a708d86bc80 -
Branch / Tag:
refs/heads/main - Owner: https://github.com/Divi845
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@204e3709bb3872ea8a4ab8808b213a708d86bc80 -
Trigger Event:
workflow_dispatch
-
Statement type:
File details
Details for the file easyllama_divi845-1.0.0-py3-none-any.whl.
File metadata
- Download URL: easyllama_divi845-1.0.0-py3-none-any.whl
- Upload date:
- Size: 4.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9cff25937de22d046674797995c5bbe737e018e4922471cc19859c15ff36a1ab
|
|
| MD5 |
ca5a56358c935f75f753ab9537cec1bf
|
|
| BLAKE2b-256 |
720fc5dfd339e64fa47d680b87fb37ca546d72f7114ebf9be1488bec00324953
|
Provenance
The following attestation bundles were made for easyllama_divi845-1.0.0-py3-none-any.whl:
Publisher:
publish.yml on Divi845/Python-Libraries
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
easyllama_divi845-1.0.0-py3-none-any.whl -
Subject digest:
9cff25937de22d046674797995c5bbe737e018e4922471cc19859c15ff36a1ab - Sigstore transparency entry: 1497791316
- Sigstore integration time:
-
Permalink:
Divi845/Python-Libraries@204e3709bb3872ea8a4ab8808b213a708d86bc80 -
Branch / Tag:
refs/heads/main - Owner: https://github.com/Divi845
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@204e3709bb3872ea8a4ab8808b213a708d86bc80 -
Trigger Event:
workflow_dispatch
-
Statement type: