OpenLLM: REST/gRPC API server for running any open Large-Language Model - StableLM, Llama, Alpaca, Dolly, Flan-T5, Custom

These details have not been verified by PyPI

Project links

Project description

OpenLLM

REST/gRPC API server for running any Open Large-Language Model - StableLM, Llama, Alpaca, Dolly, Flan-T5, and more
Powered by BentoML 🍱 + HuggingFace 🤗

To get started, simply install OpenLLM with pip:

pip install openllm

To start a LLM server, openllm start allows you to start any supported LLM with a single command. For example, to start a dolly-v2 server:

😌 tl;dr?

openllm start dolly-v2

# Starting LLM Server for 'dolly_v2'
#
# 2023-05-27T04:55:36-0700 [INFO] [cli] Environ for worker 0: set CPU thread coun t to 10
# 2023-05-27T04:55:36-0700 [INFO] [cli] Prometheus metrics for HTTP BentoServer f rom "_service.py:svc" can be accessed at http://localhost:3000/metrics.
# 2023-05-27T04:55:36-0700 [INFO] [cli] Starting production HTTP BentoServer from "_service.py:svc" listening on http://0.0.0.0:3000 (Press CTRL+C to quit)

To see a list of supported LLMs, run openllm start --help.

On a different terminal window, open a IPython session and create a client to start interacting with the model:

>>> import openllm
>>> client = openllm.client.HTTPClient('http://localhost:3000')
>>> client.query('Explain to me the difference between "further" and "farther"')

To package the LLM into a Bento, simply use openllm build:

openllm build dolly-v2

NOTE: To build OpenLLM from git source, pass in OPENLLM_DEV_BUILD=True to include the generated wheels into the bundle.

To fine-tune your own LLM, either use LLM.tuning():

>>> import openllm
>>> flan_t5 = openllm.LLM.from_pretrained("flan-t5")
>>> def fine_tuning():
...     fined_tune = flan_t5.tuning(method=openllm.tune.LORA | openllm.tune.P_TUNING, dataset='wikitext-2', ...)
...     fined_tune.save_pretrained('./fine-tuned-flan-t5', version='wikitext')
...     return fined_tune.path  # get the path of the pretrained
>>> finetune_path = fine_tuning()
>>> fined_tune_flan_t5 = openllm.LLM.from_pretrained('flan-t5', pretrained=finetune_path)
>>> fined_tune_flan_t5.generate('Explain to me the difference between "further" and "farther"')

📚 Features

🚂 SOTA LLMs: One-click stop-and-go supports for state-of-the-art LLMs, including StableLM, Llama, Alpaca, Dolly, Flan-T5, ChatGLM, Falcon, and more.

📦 Fine-tuning your own LLM: Easily fine-tune any LLM with LLM.tuning().

🔥 BentoML 🤝 HuggingFace: Built on top of BentoML and HuggingFace's ecosystem (transformers, optimum, peft, accelerate, datasets), provides similar APIs for ease-of-use.

⛓️ Interoperability: First class support for LangChain and 🤗 Hub allows you to easily chain LLMs together.

🎯 Streamline production deployment: Easily deploy any LLM via openllm bundle with the following:

☁️ BentoML Cloud: the fastest way to deploy your bento, simple and at scale
🦄️ Yatai: Model Deployment at scale on Kubernetes
🚀 bentoctl: Fast model deployment on AWS SageMaker, Lambda, ECE, GCP, Azure, Heroku, and more!

🍇 Telemetry

OpenLLM collects usage data that helps the team to improve the product. Only OpenLLM's internal API calls are being reported. We strip out as much potentially sensitive information as possible, and we will never collect user code, model data, or stack traces. Here's the code for usage tracking. You can opt-out of usage tracking by the --do-not-track CLI option:

openllm [command] --do-not-track

Or by setting environment variable OPENLLM_DO_NOT_TRACK=True:

export OPENLLM_DO_NOT_TRACK=True

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.6.30

Apr 21, 2025

0.6.29

Apr 16, 2025

0.6.28

Apr 16, 2025

0.6.27

Apr 16, 2025

0.6.26

Apr 16, 2025

0.6.25

Apr 11, 2025

0.6.24

Apr 10, 2025

0.6.23

Apr 2, 2025

0.6.22

Apr 1, 2025

0.6.21

Apr 1, 2025

0.6.20

Mar 12, 2025

0.6.19

Feb 15, 2025

0.6.18

Feb 7, 2025

0.6.17

Jan 13, 2025

0.6.16

Dec 19, 2024

0.6.15

Dec 3, 2024

0.6.14

Oct 29, 2024

0.6.13

Oct 17, 2024

0.6.12

Oct 17, 2024

0.6.11

Sep 30, 2024

0.6.10

Aug 19, 2024

0.6.9

Aug 12, 2024

0.6.8

Aug 12, 2024

0.6.7

Aug 2, 2024

0.6.6

Aug 1, 2024

0.6.5

Jul 15, 2024

0.6.4

Jul 12, 2024

0.6.3

Jul 11, 2024

0.6.2

Jul 11, 2024

0.6.1

Jul 10, 2024

0.6.0

Jul 10, 2024

0.5.7

Jun 14, 2024

0.5.6

Jun 11, 2024

0.5.5

Jun 3, 2024

0.5.4

Jun 1, 2024

0.5.3

May 30, 2024

0.5.2

May 29, 2024

0.5.1

May 29, 2024

0.5.0 yanked

May 27, 2024

Reason this release was yanked:

bug with prompt_token_ids

0.5.0a15 pre-release

May 27, 2024

0.5.0a14 pre-release

May 23, 2024

0.5.0a13 pre-release

May 22, 2024

0.5.0a12 pre-release

May 14, 2024

0.5.0a11 pre-release

May 12, 2024

0.5.0a10 pre-release

May 9, 2024

0.5.0a9 pre-release

May 9, 2024

0.5.0a8 pre-release

May 9, 2024

0.5.0a7 pre-release

May 9, 2024

0.5.0a6 pre-release

May 9, 2024

0.5.0a5 pre-release

May 8, 2024

0.5.0a4 pre-release

May 8, 2024

0.5.0a3 pre-release

Apr 2, 2024

0.5.0a2 pre-release

Apr 2, 2024

0.5.0a1 pre-release

Mar 21, 2024

0.5.0a0 pre-release

Mar 15, 2024

0.4.44

Feb 6, 2024

0.4.43

Feb 5, 2024

0.4.42

Feb 2, 2024

0.4.41

Dec 18, 2023

0.4.40

Dec 15, 2023

0.4.39

Dec 14, 2023

0.4.38

Dec 13, 2023

0.4.37

Dec 13, 2023

0.4.36

Dec 12, 2023

0.4.35

Dec 7, 2023

0.4.34

Nov 30, 2023

0.4.33

Nov 29, 2023

0.4.32

Nov 29, 2023

0.4.31

Nov 26, 2023

0.4.30

Nov 26, 2023

0.4.29

Nov 26, 2023

0.4.28

Nov 24, 2023

0.4.27

Nov 24, 2023

0.4.26

Nov 22, 2023

0.4.25

Nov 22, 2023

0.4.24

Nov 22, 2023

0.4.23

Nov 22, 2023

0.4.22

Nov 21, 2023

0.4.21

Nov 20, 2023

0.4.20

Nov 20, 2023

0.4.19

Nov 20, 2023

0.4.18

Nov 20, 2023

0.4.17

Nov 20, 2023

0.4.16

Nov 19, 2023

0.4.15

Nov 19, 2023

0.4.14

Nov 17, 2023

0.4.13

Nov 17, 2023

0.4.12

Nov 17, 2023

0.4.11

Nov 17, 2023

0.4.10

Nov 17, 2023

0.4.9

Nov 15, 2023

0.4.8

Nov 15, 2023

0.4.7

Nov 15, 2023

0.4.6

Nov 14, 2023

0.4.5

Nov 13, 2023

0.4.4

Nov 12, 2023

0.4.3

Nov 12, 2023

0.4.2

Nov 12, 2023

0.4.1

Nov 8, 2023

0.4.0

Nov 7, 2023

0.3.14

Nov 4, 2023

0.3.13

Oct 31, 2023

0.3.12

Oct 30, 2023

0.3.10

Oct 30, 2023

0.3.9

Oct 17, 2023

0.3.8

Oct 16, 2023

0.3.7

Oct 12, 2023

0.3.6

Sep 19, 2023

0.3.5

Sep 18, 2023

0.3.4

Sep 14, 2023

0.3.3

Sep 7, 2023

0.3.2

Sep 6, 2023

0.3.1

Sep 6, 2023

0.3.0

Sep 4, 2023

0.2.27

Aug 25, 2023

0.2.26

Aug 17, 2023

0.2.25

Aug 16, 2023

0.2.24

Aug 15, 2023

0.2.23

Aug 15, 2023

0.2.22

Aug 11, 2023

0.2.21 yanked

Aug 11, 2023

Reason this release was yanked:

broken client

0.2.20

Aug 10, 2023

0.2.19 yanked

Aug 10, 2023

Reason this release was yanked:

broken imports from compiled init

0.2.18

Aug 9, 2023

0.2.17

Aug 8, 2023

0.2.16

Aug 4, 2023

0.2.15 yanked

Aug 4, 2023

Reason this release was yanked:

include a regression with vllm

0.2.14 yanked

Aug 4, 2023

Reason this release was yanked:

include a regression with vllm

0.2.13

Aug 3, 2023

0.2.12

Aug 1, 2023

0.2.11

Jul 28, 2023

0.2.10

Jul 25, 2023

0.2.9

Jul 24, 2023

0.2.8

Jul 24, 2023

0.2.7

Jul 23, 2023

0.2.6

Jul 22, 2023

0.2.5

Jul 21, 2023

0.2.4

Jul 21, 2023

0.2.3

Jul 21, 2023

0.2.2

Jul 21, 2023

0.2.1 yanked

Jul 20, 2023

Reason this release was yanked:

Broken installation with openllm[llama]

0.2.0

Jul 20, 2023

0.1.20

Jul 5, 2023

0.1.19

Jun 29, 2023

0.1.18

Jun 29, 2023

0.1.17

Jun 27, 2023

0.1.16

Jun 27, 2023

0.1.15

Jun 26, 2023

0.1.14

Jun 25, 2023

0.1.13

Jun 24, 2023

0.1.12

Jun 24, 2023

0.1.11

Jun 23, 2023

0.1.10

Jun 21, 2023

0.1.9

Jun 21, 2023

0.1.8

Jun 19, 2023

0.1.7

Jun 19, 2023

0.1.6

Jun 17, 2023

0.1.5

Jun 15, 2023

0.1.4

Jun 14, 2023

0.1.3

Jun 14, 2023

0.1.2

Jun 13, 2023

0.1.1

Jun 12, 2023

0.1.0

Jun 12, 2023

0.0.34

Jun 11, 2023

0.0.33

Jun 10, 2023

0.0.32

Jun 9, 2023

0.0.31

Jun 8, 2023

0.0.30

Jun 8, 2023

0.0.29

Jun 8, 2023

0.0.28

Jun 8, 2023

0.0.27

Jun 8, 2023

0.0.26

Jun 7, 2023

0.0.25

Jun 6, 2023

0.0.24

Jun 6, 2023

0.0.23

Jun 6, 2023

0.0.22

Jun 6, 2023

0.0.21

Jun 4, 2023

0.0.19

Jun 4, 2023

0.0.18

Jun 4, 2023

This version

0.0.17

Jun 4, 2023

0.0.16

Jun 2, 2023

0.0.15

Jun 1, 2023

0.0.14

May 31, 2023

0.0.13

May 30, 2023

0.0.12

May 30, 2023

0.0.11

May 30, 2023

0.0.10

May 29, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

openllm-0.0.17.tar.gz (12.0 MB view details)

Uploaded Jun 4, 2023 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

openllm-0.0.17-py3-none-any.whl (105.5 kB view details)

Uploaded Jun 4, 2023 Python 3

File details

Details for the file openllm-0.0.17.tar.gz.

File metadata

Download URL: openllm-0.0.17.tar.gz
Upload date: Jun 4, 2023
Size: 12.0 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: python-httpx/0.24.0

File hashes

Hashes for openllm-0.0.17.tar.gz
Algorithm	Hash digest
SHA256	`a5aca48bf0b95937dfc4f44c0b885ac5760ecda169bce90fdf41ecf8f212c30c`
MD5	`d16edd4ef17ba8fcdb141b519aaea491`
BLAKE2b-256	`430bcb768f566a1efc10423da9a7a2e829bd2b0896ebc836235e224eeccbd724`

See more details on using hashes here.

File details

Details for the file openllm-0.0.17-py3-none-any.whl.

File metadata

Download URL: openllm-0.0.17-py3-none-any.whl
Upload date: Jun 4, 2023
Size: 105.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: python-httpx/0.24.0

File hashes

Hashes for openllm-0.0.17-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a50d84a041bbef6704a6e80799c3394b5b18b8116e82522c22dde42bfe898705`
MD5	`6a04bb923dce86389d654a54d5391143`
BLAKE2b-256	`276ac0d87a974d0a1851a696f1cc50f50b270358111e455bde0475c1720c4558`

See more details on using hashes here.

openllm 0.0.17

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

OpenLLM

😌 tl;dr?

📚 Features

🍇 Telemetry

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes