llama.cpp server binary built from source

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

vladlearns

Project description

llama-cpp-bin

Pre-built llama.cpp server binaries as a py package. Install a wheel for your platform and run it.

Install

Pre-built wheels (recommended)

pip install --extra-index-url https://vladlearns.github.io/llama-cpp-bin/simple llama-cpp-bin

pip automatically selects the correct wheel for your platform (cpu, cuda, ROCm, Vulkan, or metal).

PyPI (builds from source)

If no pre-built wheel matches your platform, pip falls back to building from the sdist on PyPI:

pip install llama-cpp-bin

You will need CMake, a c++ compiler, and the llama.cpp source submodule.

Dev

git clone --recurse-submodules https://github.com/vladlearns/llama-cpp-bin
cd llama-cpp-bin
CMAKE_ARGS="-DGGML_CUDA=ON" pip install -v .

Run

CLI:

llama-cpp-server -m your-model.gguf --port 8080

Python:

from llama_cpp_bin import run_server
proc = run_server("your-model.gguf", port=8080)
proc.wait()

Or get the binary path and run it yourself:

import llama_cpp_bin
import subprocess
binary = llama_cpp_bin.get_binary_path()
subprocess.Popen([binary, "--model", "your-model.gguf"])

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

vladlearns

Release history Release notifications | RSS feed

9113.0.0

May 12, 2026

9101.0.0

May 11, 2026

9097.0.0

May 10, 2026

9095.0.0

May 10, 2026

9094.0.0

May 10, 2026

This version

9093.0.0

May 10, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llama_cpp_bin-9093.0.0.tar.gz (4.1 MB view details)

Uploaded May 10, 2026 Source

File details

Details for the file llama_cpp_bin-9093.0.0.tar.gz.

File metadata

Download URL: llama_cpp_bin-9093.0.0.tar.gz
Upload date: May 10, 2026
Size: 4.1 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for llama_cpp_bin-9093.0.0.tar.gz
Algorithm	Hash digest
SHA256	`cf7b3d717dbe3cd4fe409e0434fb889d740874c363cef5bc7855c2fba6bf7bfc`
MD5	`1903e4ed451ddd45c67da02619ae85f5`
BLAKE2b-256	`f27dff03345ec6774ff425cc850a3b74f95595230fc77b2bc6fd041071fec149`

See more details on using hashes here.

Provenance

The following attestation bundles were made for llama_cpp_bin-9093.0.0.tar.gz:

Publisher: build-everything.yml on vladlearns/llama-cpp-bin

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: llama_cpp_bin-9093.0.0.tar.gz
- Subject digest: cf7b3d717dbe3cd4fe409e0434fb889d740874c363cef5bc7855c2fba6bf7bfc
- Sigstore transparency entry: 1492838440
- Sigstore integration time: May 10, 2026
Source repository:
- Permalink: vladlearns/llama-cpp-bin@84e939a1d424f001befd94d74b542c80be967eef
- Branch / Tag: refs/tags/v9093.0.0
- Owner: https://github.com/vladlearns
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: build-everything.yml@84e939a1d424f001befd94d74b542c80be967eef
- Trigger Event: push

llama-cpp-bin 9093.0.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Project description

llama-cpp-bin

Install

Pre-built wheels (recommended)

PyPI (builds from source)

Dev

Run

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes

Provenance