A Python wrapper for llama.cpp

These details have not been verified by PyPI

Project description

🦙 Python Bindings for `llama.cpp`

Simple Python bindings for @ggerganov's llama.cpp library. This package provides:

Low-level access to C API via ctypes interface.
High-level Python API for text completion
- OpenAI-like API
- LangChain compatibility

Installation

Install from PyPI (requires a c compiler):

pip install llama-cpp-python

The above command will attempt to install the package and build build llama.cpp from source. This is the recommended installation method as it ensures that llama.cpp is built with the available optimizations for your system.

This method defaults to using make to build llama.cpp on Linux / MacOS and cmake on Windows. You can force the use of cmake on Linux / MacOS setting the FORCE_CMAKE=1 environment variable before installing.

High-level API

>>> from llama_cpp import Llama
>>> llm = Llama(model_path="./models/7B/ggml-model.bin")
>>> output = llm("Q: Name the planets in the solar system? A: ", max_tokens=32, stop=["Q:", "\n"], echo=True)
>>> print(output)
{
  "id": "cmpl-xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx",
  "object": "text_completion",
  "created": 1679561337,
  "model": "./models/7B/ggml-model.bin",
  "choices": [
    {
      "text": "Q: Name the planets in the solar system? A: Mercury, Venus, Earth, Mars, Jupiter, Saturn, Uranus, Neptune and Pluto.",
      "index": 0,
      "logprobs": None,
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 14,
    "completion_tokens": 28,
    "total_tokens": 42
  }
}

Web Server

llama-cpp-python offers a web server which aims to act as a drop-in replacement for the OpenAI API. This allows you to use llama.cpp compatible models with any OpenAI compatible client (language libraries, services, etc).

To install the server package and get started:

pip install llama-cpp-python[server]
export MODEL=./models/7B/ggml-model.bin
python3 -m llama_cpp.server

Navigate to http://localhost:8000/docs to see the OpenAPI documentation.

Low-level API

The low-level API is a direct ctypes binding to the C API provided by llama.cpp. The entire API can be found in llama_cpp/llama_cpp.py and should mirror llama.h.

Documentation

Documentation is available at https://abetlen.github.io/llama-cpp-python. If you find any issues with the documentation, please open an issue or submit a PR.

Development

This package is under active development and I welcome any contributions.

To get started, clone the repository and install the package in development mode:

git clone git@github.com:abetlen/llama-cpp-python.git
git submodule update --init --recursive
# Will need to be re-run any time vendor/llama.cpp is updated
python3 setup.py develop

How does this compare to other Python bindings of `llama.cpp`?

I originally wrote this package for my own use with two goals in mind:

Provide a simple process to install llama.cpp and access the full C API in llama.h from Python
Provide a high-level Python API that can be used as a drop-in replacement for the OpenAI API so existing apps can be easily ported to use llama.cpp

Any contributions and changes to this package will be made with these goals in mind.

License

This project is licensed under the terms of the MIT license.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.3.16

Aug 15, 2025

0.3.15

Aug 7, 2025

0.3.14

Jul 18, 2025

0.3.13

Jul 15, 2025

0.3.12

Jul 6, 2025

0.3.11

Jul 5, 2025

0.3.10

Jul 3, 2025

0.3.9

May 8, 2025

0.3.8

Mar 12, 2025

0.3.7

Jan 29, 2025

0.3.6

Jan 8, 2025

0.3.5

Dec 10, 2024

0.3.4

Dec 9, 2024

0.3.3

Dec 9, 2024

0.3.2

Nov 16, 2024

0.3.1

Sep 29, 2024

0.3.0

Sep 25, 2024

0.2.90

Aug 29, 2024

0.2.89

Aug 21, 2024

0.2.88

Aug 13, 2024

0.2.87

Aug 7, 2024

0.2.86

Aug 7, 2024

0.2.85

Jul 31, 2024

0.2.84

Jul 28, 2024

0.2.83

Jul 22, 2024

0.2.82

Jul 9, 2024

0.2.81

Jul 2, 2024

0.2.80

Jul 2, 2024

0.2.79

Jun 19, 2024

0.2.78

Jun 10, 2024

0.2.77

Jun 4, 2024

0.2.76

May 24, 2024

0.2.75

May 16, 2024

0.2.74

May 12, 2024

0.2.73

May 10, 2024

0.2.72

May 10, 2024

0.2.71

May 9, 2024

0.2.70

May 8, 2024

0.2.69

May 2, 2024

0.2.68

Apr 30, 2024

0.2.67

Apr 30, 2024

0.2.66

Apr 30, 2024

0.2.65

Apr 26, 2024

0.2.64

Apr 23, 2024

0.2.63

Apr 20, 2024

0.2.62

Apr 18, 2024

0.2.61

Apr 10, 2024

0.2.60

Apr 6, 2024

0.2.59

Apr 3, 2024

0.2.58

Apr 1, 2024

0.2.57

Mar 18, 2024

0.2.56

Mar 9, 2024

0.2.55

Mar 3, 2024

0.2.54

Mar 1, 2024

0.2.53

Feb 28, 2024

0.2.52

Feb 26, 2024

0.2.51

Feb 26, 2024

0.2.50

Feb 23, 2024

0.2.49

Feb 23, 2024

0.2.48

Feb 23, 2024

0.2.47

Feb 22, 2024

0.2.46

Feb 21, 2024

0.2.45

Feb 21, 2024

0.2.44

Feb 16, 2024

0.2.43

Feb 14, 2024

0.2.42

Feb 13, 2024

0.2.41

Feb 13, 2024

0.2.40

Feb 12, 2024

0.2.39

Feb 6, 2024

0.2.38

Jan 31, 2024

0.2.37

Jan 30, 2024

0.2.36

Jan 29, 2024

0.2.35

Jan 29, 2024

0.2.34

Jan 27, 2024

0.2.33

Jan 25, 2024

0.2.32

Jan 22, 2024

0.2.31

Jan 19, 2024

0.2.30

Jan 19, 2024

0.2.29

Jan 15, 2024

0.2.28

Jan 10, 2024

0.2.27

Jan 4, 2024

0.2.26

Dec 27, 2023

0.2.25

Dec 22, 2023

0.2.24

Dec 18, 2023

0.2.23

Dec 14, 2023

0.2.22

Dec 11, 2023

0.2.20

Nov 28, 2023

0.2.19

Nov 21, 2023

0.2.18

Nov 14, 2023

0.2.17

Nov 10, 2023

0.2.16

Nov 10, 2023

0.2.15

Nov 8, 2023

0.2.14

Nov 6, 2023

0.2.13

Nov 2, 2023

0.2.12

Nov 1, 2023

0.2.11

Sep 30, 2023

0.2.10

Sep 30, 2023

0.2.9

Sep 30, 2023

0.2.8 yanked

Sep 30, 2023

Reason this release was yanked:

Broken build

0.2.7

Sep 25, 2023

0.2.6

Sep 15, 2023

0.2.5

Sep 14, 2023

0.2.4

Sep 14, 2023

0.2.3

Sep 13, 2023

0.2.2

Sep 13, 2023

0.2.1

Sep 13, 2023

0.2.0

Sep 12, 2023

0.1.85

Sep 12, 2023

0.1.84

Sep 9, 2023

0.1.83

Aug 29, 2023

0.1.82

Aug 28, 2023

0.1.81

Aug 27, 2023

0.1.80

Aug 27, 2023

0.1.79

Aug 25, 2023

0.1.78

Aug 18, 2023

0.1.77

Jul 24, 2023

0.1.76

Jul 24, 2023

0.1.74

Jul 20, 2023

0.1.73

Jul 18, 2023

0.1.72

Jul 15, 2023

0.1.71

Jul 14, 2023

0.1.70

Jul 9, 2023

0.1.69

Jul 9, 2023

0.1.68

Jul 5, 2023

0.1.67

Jun 29, 2023

0.1.66

Jun 26, 2023

0.1.65

Jun 20, 2023

0.1.64

Jun 18, 2023

0.1.63

Jun 15, 2023

0.1.62

Jun 10, 2023

0.1.61

Jun 10, 2023

0.1.59

Jun 8, 2023

0.1.57

Jun 1, 2023

0.1.56

May 30, 2023

0.1.55

May 26, 2023

0.1.54

May 23, 2023

0.1.53

May 21, 2023

0.1.52

May 20, 2023

0.1.51

May 19, 2023

0.1.50

May 14, 2023

0.1.49

May 12, 2023

0.1.48

May 8, 2023

0.1.47

May 8, 2023

0.1.46

May 8, 2023

0.1.45

May 8, 2023

0.1.44

May 7, 2023

0.1.43

May 5, 2023

0.1.42

May 4, 2023

0.1.41

May 2, 2023

This version

0.1.40

May 1, 2023

0.1.39

Apr 28, 2023

0.1.38

Apr 25, 2023

0.1.37

Apr 25, 2023

0.1.36

Apr 22, 2023

0.1.35

Apr 20, 2023

0.1.34

Apr 16, 2023

0.1.33

Apr 13, 2023

0.1.32

Apr 10, 2023

0.1.31

Apr 10, 2023

0.1.30

Apr 10, 2023

0.1.29

Apr 10, 2023

0.1.28

Apr 10, 2023

0.1.27

Apr 8, 2023

0.1.26

Apr 8, 2023

0.1.25

Apr 7, 2023

0.1.24

Apr 7, 2023

0.1.23

Apr 5, 2023

0.1.22

Apr 5, 2023

0.1.21

Apr 5, 2023

0.1.20

Apr 4, 2023

0.1.19

Apr 4, 2023

0.1.18

Apr 3, 2023

0.1.17

Apr 3, 2023

0.1.16

Apr 2, 2023

0.1.15

Apr 2, 2023

0.1.14

Apr 2, 2023

0.1.13

Apr 1, 2023

0.1.12

Apr 1, 2023

0.1.11

Apr 1, 2023

0.1.10

Mar 29, 2023

0.1.9

Mar 28, 2023

0.1.8

Mar 28, 2023

0.1.7

Mar 26, 2023

0.1.6

Mar 25, 2023

0.1.5

Mar 25, 2023

0.1.4

Mar 24, 2023

0.1.3

Mar 24, 2023

0.1.2

Mar 24, 2023

0.1.1

Mar 23, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llama_cpp_python-0.1.40.tar.gz (1.1 MB view details)

Uploaded May 1, 2023 Source

File details

Details for the file llama_cpp_python-0.1.40.tar.gz.

File metadata

Download URL: llama_cpp_python-0.1.40.tar.gz
Upload date: May 1, 2023
Size: 1.1 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.1 CPython/3.11.3

File hashes

Hashes for llama_cpp_python-0.1.40.tar.gz
Algorithm	Hash digest
SHA256	`4f4391e88458a0a234d03e1a6b6b1285d29ca1030e7f5e76ad9b50a1dc940fef`
MD5	`7bb24b0c547b412f1ebf664900d6bc58`
BLAKE2b-256	`fc2c62c5ce16f88348f928320565cf6c0dfe8220a03615bff14e47e4f3b4e439`

See more details on using hashes here.

llama-cpp-python 0.1.40

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

🦙 Python Bindings for `llama.cpp`

Installation

High-level API

Web Server

Low-level API

Documentation

Development

How does this compare to other Python bindings of `llama.cpp`?

License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes

llama-cpp-python 0.1.40

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

🦙 Python Bindings for llama.cpp

Installation

High-level API

Web Server

Low-level API

Documentation

Development

How does this compare to other Python bindings of llama.cpp?

License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes

🦙 Python Bindings for `llama.cpp`

How does this compare to other Python bindings of `llama.cpp`?