Project description

OSC-LLM

📌 简介

📌 安装

安装最新版本pytorch
安装osc-llm: pip install osc-llm

📌 快速开始

# 下面以llama3为例演示如何转换为osc-llm格式,并进行聊天。
# 假设你已经下载好huggingface的llama3模型在checkpoints/meta-llama目录下
# 1. 转换模型
llm convert --checkpoint_dir checkpoints/meta-llama/Meta-Llama-3-8B-Instruct
# 2. 量化模型
llm quantize int8 --checkpoint_dir checkpoints/meta-llama/Meta-Llama-3-8B-Instruct --save_dir checkpoints/meta-llama/Meta-Llama-3-8B-Instruct-int8
# 3. 聊天(使用编译功能加速推理速度,需要等待几分钟编译时间)
llm chat --checkpoint_dir checkpoints/meta-llama/Meta-Llama-3-8B-Instruct-int8 --compile true
# 4. 部署简易版本openai服务
llm serve --checkpoint_dir checkpoints/meta-llama/Meta-Llama-3-8B-Instruct-int8

📌 模型支持

以下huggingface中的模型结构(查看config.json)已经支持转换为osc-llm格式:

LlamaForCausalLM: llama2, llama3, chinese-alpaca2等。
Qwen2ForCausalLM: qwen1.5系列。
Qwen2MoeForCausalLM: qwen2-moe系列(目前无法完成编译,推理速度很慢)。

致敬

本项目参考了大量的开源项目，特别是以下项目：

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.5

May 16, 2024

This version

0.1.4

May 14, 2024

0.1.3

May 9, 2024

0.1.2

May 7, 2024

0.1.1

Apr 30, 2024

0.1.0

Sep 17, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

osc_llm-0.1.4.tar.gz (33.8 kB view hashes)

Uploaded May 14, 2024 Source

Built Distribution

osc_llm-0.1.4-py3-none-any.whl (47.5 kB view hashes)

Uploaded May 14, 2024 Python 3

Hashes for osc_llm-0.1.4.tar.gz

Hashes for osc_llm-0.1.4.tar.gz
Algorithm	Hash digest
SHA256	`e3285e2eba8ef19c9b3f31379518725eb3db2241355d2d6787327c0be1b798fc`
MD5	`3e4d5e899beae735f2f1f7e803cbf499`
BLAKE2b-256	`cbd88d80b746be3727b69197efcdb80b5053ee8a7806399e3c1ce056eeabd8bf`

Hashes for osc_llm-0.1.4-py3-none-any.whl

Hashes for osc_llm-0.1.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f423096858f0cb9b2e2b3df235d30b8e08604016600f43236546bacb6ba6a442`
MD5	`c0319dc638dd5235483bd1aa8bf228e6`
BLAKE2b-256	`966ad3b793b6c80a4ad9f4100c9380387e0540f3480c4cdb212f858ffee41bf3`