OpenLLM: REST/gRPC API server for running any open Large-Language Model - StableLM, Llama, Alpaca, Dolly, Flan-T5, Custom

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

OpenLLM

REST/gRPC API server for running any Open Large-Language Model - StableLM, Llama, Alpaca, Dolly, Flan-T5, and more
Powered by BentoML 🍱 + HuggingFace 🤗

To get started, simply install OpenLLM with pip:

pip install openllm

To start a LLM server, openllm start allows you to start any supported LLM with a single command. For example, to start a dolly-v2 server:

😌 tl;dr?

openllm start dolly-v2

# Starting LLM Server for 'dolly_v2'
#
# 2023-05-27T04:55:36-0700 [INFO] [cli] Environ for worker 0: set CPU thread coun t to 10
# 2023-05-27T04:55:36-0700 [INFO] [cli] Prometheus metrics for HTTP BentoServer f rom "_service.py:svc" can be accessed at http://localhost:3000/metrics.
# 2023-05-27T04:55:36-0700 [INFO] [cli] Starting production HTTP BentoServer from "_service.py:svc" listening on http://0.0.0.0:3000 (Press CTRL+C to quit)

To see a list of supported LLMs, run openllm start --help.

On a different terminal window, open a IPython session and create a client to start interacting with the model:

>>> import openllm
>>> client = openllm.client.HTTPClient('http://localhost:3000')
>>> client.query('Explain to me the difference between "further" and "farther"')

To package the LLM into a Bento, simply use openllm build:

openllm build dolly-v2

NOTE: To build OpenLLM from git source, pass in OPENLLM_DEV_BUILD=True to include the generated wheels into the bundle.

To fine-tune your own LLM, either use LLM.tuning():

>>> import openllm
>>> flan_t5 = openllm.LLM.from_pretrained("flan-t5")
>>> def fine_tuning():
...     fined_tune = flan_t5.tuning(method=openllm.tune.LORA | openllm.tune.P_TUNING, dataset='wikitext-2', ...)
...     fined_tune.save_pretrained('./fine-tuned-flan-t5', version='wikitext')
...     return fined_tune.path  # get the path of the pretrained
>>> finetune_path = fine_tuning()
>>> fined_tune_flan_t5 = openllm.LLM.from_pretrained('flan-t5', pretrained=finetune_path)
>>> fined_tune_flan_t5.generate('Explain to me the difference between "further" and "farther"')

📚 Features

🚂 SOTA LLMs: One-click stop-and-go supports for state-of-the-art LLMs, including StableLM, Llama, Alpaca, Dolly, Flan-T5, ChatGLM, Falcon, and more.

📦 Fine-tuning your own LLM: Easily fine-tune any LLM with LLM.tuning().

🔥 BentoML 🤝 HuggingFace: Built on top of BentoML and HuggingFace's ecosystem (transformers, optimum, peft, accelerate, datasets), provides similar APIs for ease-of-use.

⛓️ Interoperability: First class support for LangChain and 🤗 Hub allows you to easily chain LLMs together.

🎯 Streamline production deployment: Easily deploy any LLM via openllm bundle with the following:

☁️ BentoML Cloud: the fastest way to deploy your bento, simple and at scale
🦄️ Yatai: Model Deployment at scale on Kubernetes
🚀 bentoctl: Fast model deployment on AWS SageMaker, Lambda, ECE, GCP, Azure, Heroku, and more!

🍇 Telemetry

OpenLLM collects usage data that helps the team to improve the product. Only OpenLLM's internal API calls are being reported. We strip out as much potentially sensitive information as possible, and we will never collect user code, model data, or stack traces. Here's the code for usage tracking. You can opt-out of usage tracking by the --do-not-track CLI option:

openllm [command] --do-not-track

Or by setting environment variable OPENLLM_DO_NOT_TRACK=True:

export OPENLLM_DO_NOT_TRACK=True

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.5.0a12 pre-release

May 14, 2024

0.5.0a11 pre-release

May 12, 2024

0.5.0a10 pre-release

May 9, 2024

0.5.0a9 pre-release

May 9, 2024

0.5.0a8 pre-release

May 9, 2024

0.5.0a7 pre-release

May 9, 2024

0.5.0a6 pre-release

May 9, 2024

0.5.0a5 pre-release

May 8, 2024

0.5.0a4 pre-release

May 8, 2024

0.5.0a3 pre-release

Apr 2, 2024

0.5.0a2 pre-release

Apr 2, 2024

0.5.0a1 pre-release

Mar 21, 2024

0.5.0a0 pre-release

Mar 15, 2024

0.4.44

Feb 6, 2024

0.4.43

Feb 5, 2024

0.4.42

Feb 2, 2024

0.4.41

Dec 18, 2023

0.4.40

Dec 15, 2023

0.4.39

Dec 14, 2023

0.4.38

Dec 13, 2023

0.4.37

Dec 13, 2023

0.4.36

Dec 12, 2023

0.4.35

Dec 7, 2023

0.4.34

Nov 30, 2023

0.4.33

Nov 29, 2023

0.4.32

Nov 29, 2023

0.4.31

Nov 26, 2023

0.4.30

Nov 26, 2023

0.4.29

Nov 26, 2023

0.4.28

Nov 24, 2023

0.4.27

Nov 24, 2023

0.4.26

Nov 22, 2023

0.4.25

Nov 22, 2023

0.4.24

Nov 22, 2023

0.4.23

Nov 22, 2023

0.4.22

Nov 21, 2023

0.4.21

Nov 20, 2023

0.4.20

Nov 20, 2023

0.4.19

Nov 20, 2023

0.4.18

Nov 20, 2023

0.4.17

Nov 20, 2023

0.4.16

Nov 19, 2023

0.4.15

Nov 19, 2023

0.4.14

Nov 17, 2023

0.4.13

Nov 17, 2023

0.4.12

Nov 17, 2023

0.4.11

Nov 17, 2023

0.4.10

Nov 17, 2023

0.4.9

Nov 15, 2023

0.4.8

Nov 15, 2023

0.4.7

Nov 15, 2023

0.4.6

Nov 14, 2023

0.4.5

Nov 13, 2023

0.4.4

Nov 12, 2023

0.4.3

Nov 12, 2023

0.4.2

Nov 12, 2023

0.4.1

Nov 8, 2023

0.4.0

Nov 7, 2023

0.3.14

Nov 4, 2023

0.3.13

Oct 31, 2023

0.3.12

Oct 30, 2023

0.3.10

Oct 30, 2023

0.3.9

Oct 17, 2023

0.3.8

Oct 16, 2023

0.3.7

Oct 12, 2023

0.3.6

Sep 19, 2023

0.3.5

Sep 18, 2023

0.3.4

Sep 14, 2023

0.3.3

Sep 7, 2023

0.3.2

Sep 6, 2023

0.3.1

Sep 6, 2023

0.3.0

Sep 4, 2023

0.2.27

Aug 25, 2023

0.2.26

Aug 17, 2023

0.2.25

Aug 16, 2023

0.2.24

Aug 15, 2023

0.2.23

Aug 15, 2023

0.2.22

Aug 11, 2023

0.2.21 yanked

Aug 11, 2023

Reason this release was yanked:

broken client

0.2.20

Aug 10, 2023

0.2.19 yanked

Aug 10, 2023

Reason this release was yanked:

broken imports from compiled init

0.2.18

Aug 9, 2023

0.2.17

Aug 8, 2023

0.2.16

Aug 4, 2023

0.2.15 yanked

Aug 4, 2023

Reason this release was yanked:

include a regression with vllm

0.2.14 yanked

Aug 4, 2023

Reason this release was yanked:

include a regression with vllm

0.2.13

Aug 3, 2023

0.2.12

Aug 1, 2023

0.2.11

Jul 28, 2023

0.2.10

Jul 25, 2023

0.2.9

Jul 24, 2023

0.2.8

Jul 24, 2023

0.2.7

Jul 23, 2023

0.2.6

Jul 22, 2023

0.2.5

Jul 21, 2023

0.2.4

Jul 21, 2023

0.2.3

Jul 21, 2023

0.2.2

Jul 21, 2023

0.2.1 yanked

Jul 20, 2023

Reason this release was yanked:

Broken installation with openllm[llama]

0.2.0

Jul 20, 2023

0.1.20

Jul 5, 2023

0.1.19

Jun 29, 2023

0.1.18

Jun 29, 2023

0.1.17

Jun 27, 2023

0.1.16

Jun 27, 2023

0.1.15

Jun 26, 2023

0.1.14

Jun 25, 2023

0.1.13

Jun 24, 2023

0.1.12

Jun 24, 2023

0.1.11

Jun 23, 2023

0.1.10

Jun 21, 2023

0.1.9

Jun 21, 2023

0.1.8

Jun 19, 2023

0.1.7

Jun 19, 2023

0.1.6

Jun 17, 2023

0.1.5

Jun 15, 2023

0.1.4

Jun 14, 2023

0.1.3

Jun 14, 2023

0.1.2

Jun 13, 2023

0.1.1

Jun 12, 2023

0.1.0

Jun 12, 2023

0.0.34

Jun 11, 2023

0.0.33

Jun 10, 2023

0.0.32

Jun 9, 2023

0.0.31

Jun 8, 2023

0.0.30

Jun 8, 2023

0.0.29

Jun 8, 2023

0.0.28

Jun 8, 2023

0.0.27

Jun 8, 2023

0.0.26

Jun 7, 2023

0.0.25

Jun 6, 2023

0.0.24

Jun 6, 2023

0.0.23

Jun 6, 2023

0.0.22

Jun 6, 2023

0.0.21

Jun 4, 2023

0.0.19

Jun 4, 2023

This version

0.0.18

Jun 4, 2023

0.0.17

Jun 4, 2023

0.0.16

Jun 2, 2023

0.0.15

Jun 1, 2023

0.0.14

May 31, 2023

0.0.13

May 30, 2023

0.0.12

May 30, 2023

0.0.11

May 30, 2023

0.0.10

May 29, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

openllm-0.0.18.tar.gz (80.0 kB view hashes)

Uploaded Jun 4, 2023 Source

Built Distribution

openllm-0.0.18-py3-none-any.whl (105.1 kB view hashes)

Uploaded Jun 4, 2023 Python 3

Hashes for openllm-0.0.18.tar.gz

Hashes for openllm-0.0.18.tar.gz
Algorithm	Hash digest
SHA256	`ef13dc510c8790ee5769c9916a633242eeb318c8c53c2d2d639a1e2f4f26bc3e`
MD5	`67d14bf785427591bf7cf9f2e652de28`
BLAKE2b-256	`fd97c028738189af37e31451a104c0ccb9e64f48cbb10c99a3f4fe6c142686ea`

Hashes for openllm-0.0.18-py3-none-any.whl

Hashes for openllm-0.0.18-py3-none-any.whl
Algorithm	Hash digest
SHA256	`bf35666286ae14c938228a3262dce05c1e84b4832c4aec3ab2e80924c4afc5f6`
MD5	`43cdca65c755e957c9be01115c90502b`
BLAKE2b-256	`044849183151bd6d2b1d95a3e0d7954010dd02f8dfed2c87451fef3aa84e03bd`