llama.cpp server binary built from source

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

vladlearns

Project description

llama-cpp-bin

Pre-built llama.cpp server binaries as a py package. Install a wheel for your platform and run it.

Install

Pre-built wheels (recommended)

pip install --index-url https://vladlearns.github.io/llama-cpp-bin/whl/cpu llama-cpp-bin
pip install --index-url https://vladlearns.github.io/llama-cpp-bin/whl/cu124 llama-cpp-bin
pip install --index-url https://vladlearns.github.io/llama-cpp-bin/whl/cu131 llama-cpp-bin
pip install --index-url https://vladlearns.github.io/llama-cpp-bin/whl/rocm llama-cpp-bin
pip install --index-url https://vladlearns.github.io/llama-cpp-bin/whl/vulkan llama-cpp-bin

Pin to a specific version:

pip install --index-url https://vladlearns.github.io/llama-cpp-bin/whl/cu124 llama-cpp-bin==9095.0.0

PyPI (builds from source)

If no pre-built wheel matches your platform, pip falls back to building from the sdist on PyPI:

pip install llama-cpp-bin

You will need CMake, a c++ compiler, and the llama.cpp source submodule.

Dev

git clone --recurse-submodules https://github.com/vladlearns/llama-cpp-bin
cd llama-cpp-bin
CMAKE_ARGS="-DGGML_CUDA=ON" pip install -v .

Run

CLI:

llama-cpp-server -m your-model.gguf --port 8080

Python:

from llama_cpp_bin import run_server
proc = run_server("your-model.gguf", port=8080)
proc.wait()

Or get the binary path and run it yourself:

import llama_cpp_bin
import subprocess
binary = llama_cpp_bin.get_binary_path()
subprocess.Popen([binary, "--model", "your-model.gguf"])

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

vladlearns

Release history Release notifications | RSS feed

9856.0.0

Jul 1, 2026

9851.0.0

Jul 1, 2026

9847.0.0

Jun 30, 2026

9843.0.0

Jun 30, 2026

9840.0.0

Jun 29, 2026

9837.0.0

Jun 29, 2026

9830.0.0

Jun 28, 2026

9828.0.0

Jun 28, 2026

9827.0.0

Jun 27, 2026

9821.0.0

Jun 27, 2026

9811.0.0

Jun 26, 2026

This version

9802.0.0

Jun 26, 2026

9789.0.0

Jun 25, 2026

9785.0.0

Jun 25, 2026

9780.0.0

Jun 24, 2026

9775.0.0

Jun 24, 2026

9771.0.0

Jun 23, 2026

9763.0.0

Jun 23, 2026

9760.0.0

Jun 22, 2026

9754.0.0

Jun 22, 2026

9745.0.0

Jun 21, 2026

9743.0.0

Jun 21, 2026

9737.0.0

Jun 20, 2026

9733.0.0

Jun 20, 2026

9725.0.0

Jun 19, 2026

9713.0.0

Jun 19, 2026

9702.0.0

Jun 18, 2026

9692.0.0

Jun 18, 2026

9682.0.0

Jun 17, 2026

9672.0.0

Jun 17, 2026

9670.0.0

Jun 16, 2026

9660.0.0

Jun 16, 2026

9649.0.0

Jun 15, 2026

9637.0.0

Jun 15, 2026

9631.0.0

Jun 14, 2026

9628.0.0

Jun 14, 2026

9621.0.0

Jun 13, 2026

9616.0.0

Jun 13, 2026

9611.0.0

Jun 12, 2026

9601.0.0

Jun 12, 2026

9596.0.0

Jun 11, 2026

9592.0.0

Jun 11, 2026

9589.0.0

Jun 10, 2026

9585.0.0

Jun 10, 2026

9581.0.0

Jun 9, 2026

9568.0.0

Jun 9, 2026

9563.0.0

Jun 8, 2026

9553.0.0

Jun 8, 2026

9548.0.0

Jun 7, 2026

9544.0.0

Jun 7, 2026

9542.0.0

Jun 6, 2026

9536.0.0

Jun 6, 2026

9528.0.0

Jun 5, 2026

9518.0.0

Jun 5, 2026

9509.0.0

Jun 4, 2026

9496.0.0

Jun 4, 2026

9493.0.0

Jun 3, 2026

9484.0.0

Jun 3, 2026

9479.0.0

Jun 2, 2026

9464.0.0

Jun 2, 2026

9453.0.0

Jun 1, 2026

9444.0.0

Jun 1, 2026

9442.0.0

May 31, 2026

9437.0.0

May 31, 2026

9432.0.0

May 30, 2026

9415.0.0

May 30, 2026

9409.0.0

May 29, 2026

9388.0.0

May 29, 2026

9374.0.0

May 28, 2026

9371.0.0

May 28, 2026

9360.0.0

May 27, 2026

9352.0.0

May 27, 2026

9341.0.0

May 26, 2026

9326.0.0

May 26, 2026

9310.0.0

May 25, 2026

9305.0.0

May 24, 2026

9297.0.0

May 24, 2026

9295.0.0

May 23, 2026

9294.0.0

May 23, 2026

9283.0.0

May 22, 2026

9279.0.0

May 22, 2026

9265.0.0

May 21, 2026

9254.0.0

May 21, 2026

9247.0.0

May 20, 2026

9246.0.0

May 20, 2026

9113.0.0

May 12, 2026

9101.0.0

May 11, 2026

9097.0.0

May 10, 2026

9095.0.0

May 10, 2026

9094.0.0

May 10, 2026

9093.0.0

May 10, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llama_cpp_bin-9802.0.0.tar.gz (34.5 MB view details)

Uploaded Jun 26, 2026 Source

File details

Details for the file llama_cpp_bin-9802.0.0.tar.gz.

File metadata

Download URL: llama_cpp_bin-9802.0.0.tar.gz
Upload date: Jun 26, 2026
Size: 34.5 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for llama_cpp_bin-9802.0.0.tar.gz
Algorithm	Hash digest
SHA256	`f6a49a118f1dc18a9b57e3d8437fe0a1af8c68017a367792fc4d016378aca5cb`
MD5	`ce1c5975488951f77628b91eb55b379b`
BLAKE2b-256	`1b95a35203706b0ec367190ef685359c311f67ed76fd6b6a7ec4f0b26ad44e14`

See more details on using hashes here.

Provenance

The following attestation bundles were made for llama_cpp_bin-9802.0.0.tar.gz:

Publisher: build-everything.yml on vladlearns/llama-cpp-bin

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: llama_cpp_bin-9802.0.0.tar.gz
- Subject digest: f6a49a118f1dc18a9b57e3d8437fe0a1af8c68017a367792fc4d016378aca5cb
- Sigstore transparency entry: 1960993751
- Sigstore integration time: Jun 26, 2026
Source repository:
- Permalink: vladlearns/llama-cpp-bin@1662f978942bdb22ba0b5ab482f403919147c5f8
- Branch / Tag: refs/heads/main
- Owner: https://github.com/vladlearns
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: build-everything.yml@1662f978942bdb22ba0b5ab482f403919147c5f8
- Trigger Event: workflow_dispatch

llama-cpp-bin 9802.0.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Project description

llama-cpp-bin

Install

Pre-built wheels (recommended)

PyPI (builds from source)

Dev

Run

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes

Provenance