rwkv · PyPI

The RWKV Language Model

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

The RWKV Language Model

https://github.com/BlinkDL/ChatRWKV

https://github.com/BlinkDL/RWKV-LM

# set these before import RWKV
os.environ['RWKV_JIT_ON'] = '1'
os.environ["RWKV_CUDA_ON"] = '0' #  if '1' then compile CUDA kernel for seq mode (much faster)

########################################################################################################
#
# Use '/' in model path, instead of '\'. Use ctx4096 models if you need long ctx.
#
# fp16 = good for GPU (!!! DOES NOT support CPU !!!)
# fp32 = good for CPU
# bf16 = worse accuracy, supports CPU
#
# Strategy examples: (device = cpu/cuda/cuda:0/cuda:1/...)
# Here we consider [ln_out+head] to be an extra layer, so L12-D768 model has "13" layers, L24-D2048 model has "25" layers, etc.
#
# 'cpu fp32' = everything on cpu fp32
# 'cuda fp16' = everything on cuda fp16
#
# 'cuda fp16 *6 -> cpu fp32' = first 6 layers on cuda fp16, then on cpu fp32
# 'cuda:0 fp16 *10 -> cuda:1 fp16 *8 -> cpu fp32' = first 10 layers on cuda:0 fp16, then 8 layers on cuda:1 fp16, then on cpu fp32
#
# Use '+' for STREAM mode (do it on your fastest GPU), requires some VRAM to store streamed layers
# 'cuda fp16 *6+' = first 6 layers on cuda fp16, then stream the rest on it
# (for best speed: try *1+ *2+ *3+ ... until you run out of VRAM)
#
# Extreme STREAM: 3G VRAM is enough to run RWKV 14B (slow. will be faster in future)
# 'cuda fp16 *0+ -> cpu fp32 *1' = stream all layers on cuda fp16, then [ln_out+head] on cpu fp32
#
# ########################################################################################################

from rwkv.model import RWKV
from rwkv.utils import PIPELINE, PIPELINE_ARGS

pipeline = PIPELINE(model, "20B_tokenizer.json") # find it in https://github.com/BlinkDL/ChatRWKV

# download models: https://huggingface.co/BlinkDL
model = RWKV(model='/fsx/BlinkDL/HF-MODEL/rwkv-4-pile-169m/RWKV-4-Pile-169M-20220807-8023', strategy='cpu fp32')

ctx = "\nIn a shocking finding, scientist discovered a herd of dragons living in a remote, previously unexplored valley, in Tibet. Even more surprising to the researchers was the fact that the dragons spoke perfect Chinese."
print(ctx, end='')

def my_print(s):
    print(s, end='', flush=True)

# For alpha_frequency and alpha_presence, see "Frequency and presence penalties":
# https://platform.openai.com/docs/api-reference/parameter-details

args = PIPELINE_ARGS(temperature = 1.0, top_p = 0.7,
                     alpha_frequency = 0.25,
                     alpha_presence = 0.25,
                     token_ban = [0], # ban the generation of some tokens
                     token_stop = []) # stop generation whenever you see any token here

pipeline.generate(ctx, token_count=512, args=args, callback=my_print)
print('\n')

out, state = model.forward([187, 510, 1563, 310, 247], None)
print(out.detach().cpu().numpy())                   # get logits
out, state = model.forward([187, 510], None)
out, state = model.forward([1563], state)           # RNN has state (use deepcopy if you want to clone it)
out, state = model.forward([310, 247], state)
print(out.detach().cpu().numpy())                   # same result as above
print('\n')

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.8.26

Apr 26, 2024

0.8.25

Feb 10, 2024

0.8.24

Feb 1, 2024

0.8.23

Feb 1, 2024

0.8.22

Nov 17, 2023

0.8.21

Nov 15, 2023

0.8.20

Nov 3, 2023

0.8.19

Oct 31, 2023

0.8.18

Oct 31, 2023

0.8.17

Oct 30, 2023

0.8.16

Oct 8, 2023

0.8.15

Oct 7, 2023

0.8.14

Oct 5, 2023

0.8.13

Sep 27, 2023

0.8.12

Sep 4, 2023

0.8.11

Sep 2, 2023

0.8.10

Sep 2, 2023

0.8.9

Aug 5, 2023

0.8.8

Aug 4, 2023

0.8.7

Jul 29, 2023

0.8.6

Jul 29, 2023

0.8.5

Jul 29, 2023

0.8.0

Jun 26, 2023

0.7.5

Jun 10, 2023

0.7.4

May 19, 2023

0.7.3

Apr 4, 2023

0.7.2

Mar 30, 2023

0.7.1

Mar 22, 2023

0.7.0

Mar 19, 2023

0.6.2

Mar 18, 2023

0.6.1

Mar 18, 2023

0.6.0

Mar 17, 2023

0.5.0

Mar 15, 2023

0.4.2

Mar 13, 2023

0.4.1

Mar 13, 2023

0.4.0

Mar 13, 2023

0.3.1

Mar 12, 2023

0.3.0

Mar 12, 2023

0.2.1

Mar 11, 2023

0.2.0

Mar 8, 2023

0.1.0

Mar 7, 2023

0.0.9

Mar 7, 2023

0.0.8

Mar 6, 2023

0.0.7

Mar 5, 2023

0.0.6

Mar 1, 2023

This version

0.0.5

Mar 1, 2023

0.0.4

Mar 1, 2023

0.0.3

Mar 1, 2023

0.0.2

Mar 1, 2023

0.0.1

Mar 1, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rwkv-0.0.5.tar.gz (14.0 kB view hashes)

Uploaded Mar 1, 2023 Source

Built Distribution

rwkv-0.0.5-py3-none-any.whl (13.6 kB view hashes)

Uploaded Mar 1, 2023 Python 3

Hashes for rwkv-0.0.5.tar.gz

Hashes for rwkv-0.0.5.tar.gz
Algorithm	Hash digest
SHA256	`51f77b0ac2143512b77b8352fa536054be6135319d81dcaaae6bee582daf1445`
MD5	`bb5915ed19b69c8f27dfce00e2867d0f`
BLAKE2b-256	`dfad4a64a5ff146e503d87cfb6b5a7fba6a8d58800de7482888b7ffe72c9dd55`

Hashes for rwkv-0.0.5-py3-none-any.whl

Hashes for rwkv-0.0.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`49e9f7e5f28e6389851eede72f640e42f01e1ad1e03ff1a3e179c7d53137dead`
MD5	`82d7e5a36decfca2a17c91319be34bf6`
BLAKE2b-256	`fcff13fea92ebf7aa89e85032bd4bc801f6c6c34f63ba5158ea11a4e476a7fc7`