ByzerLLM: Byzer LLM

These details have not been verified by PyPI

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Byzer-LLM

Byzer-LLM is a LLM full lifecycle solution that includes pretrain, fintune, deployment and serving based on Ray.

The key differences between Byzer-LLM and other LLM solutions have two. The first one is that Byzer-LLM supports Byzer-SQL which is a SQL dialect that can be used to manage the LLM lifecycle as the other solutions only support Python API.

Python (alpha)
Byzer-SQL (stable)
Rest API (todo...)

The second one is that Byzer-LLM is totally based on Ray. This means you can deploy multiple LLM models on a single machine or a cluster. This is very useful for large scale LLM deployment. And Byzer-LLM also supports vLLM/DeepSpeed/Transformers as the inference backend transparently.

Versions

0.1.13: support shutdown cluster for byzer-retrieval
0.1.12: Support Python API (alpha)
0.1.5: Support python wrapper for byzer-retrieval

Installation

pip install -r requirements.txt
pip install -U byzerllm
ray start --head

Usage (Python)

import ray
from byzerllm.utils.client import ByzerLLM,LLMRequest,InferBackend
ray.init(address="auto",namespace="default",ignore_reinit_error=True)

llm = ByzerLLM()

llm.setup_gpus_per_worker(4).setup_num_workers(1)
llm.setup_infer_backend(InferBackend.transformers)

llm.deploy(model_path="/home/byzerllm/models/openbuddy-llama-13b-v5-fp16",
           pretrained_model_type="custom/llama2",
           udf_name="llama2_chat",infer_params={})

llm.chat("llama2_chat",LLMRequest(instruction="hello world"))[0].output

The above code will deploy a llama2 model and then use the model to infer the input text. The Python API is very simple and easy to use and it is very useful to explore the LLM model.

Usage (Byzer-SQL)

The following code have the same effect as the above python code.

!byzerllm setup single;
!byzerllm setup "num_gpus=4";
!byzerllm setup "maxConcurrency=1";
!byzerllm setup "infer_backend=transformers";

run command as LLM.`` where 
action="infer"
and pretrainedModelType="custom/llama2"
and localModelDir="/home/byzerllm/models/openbuddy-llama-13b-v5-fp16"
and reconnect="false"
and udfName="llama2_chat"
and modelTable="command";

select 
llama2_chat(llm_param(map(
              "user_role","User",
              "assistant_role","Assistant",
              "system_msg",'You are a helpful assistant. Think it over and answer the user question correctly.',
              "instruction",llm_prompt('
Please remenber my name: {0}              
',array("Zhu William"))

)))

 as q as q1;

Once you deploy the model with run command as LLM, then you can ues the model as a SQL function. This feature is very useful for data scientists who want to use LLM in their data analysis or data engineers who want to use LLM in their data pipeline.

Cooperate with Byzer-Retrieval

Byzer-LLM can cooperate with Byzer-Retrieval to build a RAG application. The following code shows how to use Byzer-LLM and Byzer-Retrieval together.

The first step is connect to Ray cluster:

code_search_path=["/home/byzerllm/softwares/byzer-retrieval-lib/"]
env_vars = {"JAVA_HOME": "/home/byzerllm/softwares/jdk-21",
            "PATH":"/home/byzerllm/softwares/jdk-21/bin:/home/byzerllm/.rvm/gems/ruby-3.2.2/bin:/home/byzerllm/.rvm/gems/ruby-3.2.2@global/bin:/home/byzerllm/.rvm/rubies/ruby-3.2.2/bin:/home/byzerllm/.rbenv/shims:/home/byzerllm/.rbenv/bin:/home/byzerllm/softwares/byzer-lang-all-in-one-linux-amd64-3.3.0-2.3.7/jdk8/bin:/usr/local/cuda/bin:/usr/local/cuda/bin:/home/byzerllm/.rbenv/shims:/home/byzerllm/.rbenv/bin:/home/byzerllm/miniconda3/envs/byzerllm-dev/bin:/home/byzerllm/miniconda3/condabin:/home/byzerllm/.local/bin:/home/byzerllm/bin:/usr/local/cuda/bin:/usr/local/bin:/usr/bin:/usr/local/sbin:/usr/sbin:/home/byzerllm/.rvm/bin:/home/byzerllm/.rvm/bin"}

import ray

ray.init(address="auto",namespace="default",
                 job_config=ray.job_config.JobConfig(code_search_path=code_search_path,
                                                      runtime_env={"env_vars": env_vars})
                 )

The second step is to create ByzerLLM/ByzerRetrieval:

from byzerllm.utils.retrieval import ByzerRetrieval
from byzerllm.utils.client import ByzerLLM,LLMRequest,LLMResponse,LLMHistoryItem,LLMRequestExtra
from byzerllm.records import SearchQuery

retrieval = ByzerRetrieval()
retrieval.launch_gateway()

llm = ByzerLLM()

Now you can use llm.chat and retrieval.search to build a RAG application.

Project details

These details have not been verified by PyPI

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.1.102

Jun 14, 2024

0.1.101

Jun 14, 2024

0.1.99

Jun 13, 2024

0.1.98

Jun 8, 2024

0.1.97

Jun 8, 2024

0.1.96

Jun 7, 2024

0.1.95

Jun 5, 2024

0.1.94

Jun 4, 2024

0.1.93

Jun 4, 2024

0.1.92

May 27, 2024

0.1.91

May 26, 2024

0.1.90

May 24, 2024

0.1.89

May 17, 2024

0.1.88

May 14, 2024

0.1.87

May 14, 2024

0.1.85

May 13, 2024

0.1.83

May 10, 2024

0.1.82

May 10, 2024

0.1.81

May 9, 2024

0.1.80

May 1, 2024

0.1.79

Apr 30, 2024

0.1.78

Apr 30, 2024

0.1.77

Apr 29, 2024

0.1.76

Apr 28, 2024

0.1.75

Apr 27, 2024

0.1.73

Apr 26, 2024

0.1.72

Apr 24, 2024

0.1.71

Apr 23, 2024

0.1.70

Apr 22, 2024

0.1.69

Apr 22, 2024

0.1.68

Apr 18, 2024

0.1.67

Apr 18, 2024

0.1.66

Apr 17, 2024

0.1.65

Apr 17, 2024

0.1.64

Apr 15, 2024

0.1.63

Apr 15, 2024

0.1.62

Apr 15, 2024

0.1.61

Apr 15, 2024

0.1.60

Apr 11, 2024

0.1.59

Apr 11, 2024

0.1.57

Apr 8, 2024

0.1.56

Apr 5, 2024

0.1.55

Mar 28, 2024

0.1.54

Mar 26, 2024

0.1.53

Mar 22, 2024

0.1.52

Mar 22, 2024

0.1.51

Mar 19, 2024

0.1.50

Mar 19, 2024

0.1.49

Mar 18, 2024

0.1.48

Mar 17, 2024

0.1.47

Mar 14, 2024

0.1.46

Mar 12, 2024

0.1.45

Mar 12, 2024

0.1.44

Mar 8, 2024

0.1.43

Mar 6, 2024

0.1.42

Mar 4, 2024

0.1.41

Mar 3, 2024

0.1.40

Feb 27, 2024

0.1.39

Jan 29, 2024

0.1.38

Jan 24, 2024

0.1.37

Jan 17, 2024

0.1.36

Jan 16, 2024

0.1.35

Jan 16, 2024

0.1.34

Jan 15, 2024

0.1.33

Jan 7, 2024

0.1.31

Jan 5, 2024

0.1.30

Jan 2, 2024

0.1.29

Dec 31, 2023

0.1.28

Dec 30, 2023

0.1.26

Dec 29, 2023

0.1.24

Dec 27, 2023

0.1.23

Dec 22, 2023

0.1.22

Dec 19, 2023

0.1.21

Dec 19, 2023

0.1.20

Dec 19, 2023

0.1.19

Dec 14, 2023

0.1.18

Dec 14, 2023

0.1.17

Dec 12, 2023

0.1.16

Nov 20, 2023

0.1.15

Nov 17, 2023

This version

0.1.14

Nov 8, 2023

0.1.13

Nov 3, 2023

0.1.12

Oct 31, 2023

0.1.11

Oct 18, 2023

0.1.10

Oct 16, 2023

0.1.9

Oct 14, 2023

0.1.8

Oct 14, 2023

0.1.7

Oct 12, 2023

0.1.6

Oct 12, 2023

0.1.5

Oct 10, 2023

0.1.4

Oct 4, 2023

0.1.3

Sep 26, 2023

0.1.2

Sep 19, 2023

0.1.1

Sep 2, 2023

0.0.6

Jun 24, 2023

0.0.4

May 15, 2023

0.0.3

Apr 26, 2023

0.0.2

Apr 24, 2023

0.0.1

Apr 20, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

byzerllm-0.1.14.tar.gz (2.7 MB view hashes)

Uploaded Nov 8, 2023 Source

Built Distribution

byzerllm-0.1.14-py3-none-any.whl (2.9 MB view hashes)

Uploaded Nov 8, 2023 Python 3

Hashes for byzerllm-0.1.14.tar.gz

Hashes for byzerllm-0.1.14.tar.gz
Algorithm	Hash digest
SHA256	`97ff59e43aff5bf6d7a6765e16f6e89cb47afebe8b08dde43722c985dd3e8f5b`
MD5	`8ba7650f9fad40e004d8be40f22f4d13`
BLAKE2b-256	`6417ea18d8f9d87baf6c50038b7534cc4faa50d0605e54295a02882544645d7f`

Hashes for byzerllm-0.1.14-py3-none-any.whl

Hashes for byzerllm-0.1.14-py3-none-any.whl
Algorithm	Hash digest
SHA256	`216f9450b193717d20780f20a55af842a7c438d9b1876be709735b91eab45d8b`
MD5	`489825652737e77b2d1718ba04107730`
BLAKE2b-256	`4bd074f36aa4f4308730aad02bb57c3cc0eca4c292dd0b27ffd0c405e9f59949`