Skip to main content

LLM tokenizers tools

Project description

llm_tokenizers

介绍

收集llm的各种 tokenizer

软件架构

软件架构说明

项目安装教程

  1. 克隆项目到本地:
git clone https://gitee.com/sky_flash/llm_tokenizers.git
  1. 进入项目目录:
cd llm_tokenizers
  1. 使用 pip 安装依赖:
pip install -r requirements.txt

软件包安装程

使用说明

  1. xxxx
  2. xxxx
  3. xxxx

项目打包

  1. 确保已安装构建工具:
pip install build
  1. 在项目根目录下执行打包命令:
python -m build

打包完成后,生成的 .whl.tar.gz 文件会保存在 dist/ 目录下。

  1. 安装打包好的 .whl 文件(以生成的文件名为例):
pip install dist/llm_tokenizers-0.1.0-py3-none-any.whl

参与贡献

  1. Fork 本仓库
  2. 新建 Feat_xxx 分支
  3. 提交代码
  4. 新建 Pull Request

特技

  1. 使用 Readme_XXX.md 来支持不同的语言,例如 Readme_en.md, Readme_zh.md
  2. Gitee 官方博客 blog.gitee.com
  3. 你可以 https://gitee.com/explore 这个地址来了解 Gitee 上的优秀开源项目
  4. GVP 全称是 Gitee 最有价值开源项目,是综合评定出的优秀开源项目
  5. Gitee 官方提供的使用手册 https://gitee.com/help
  6. Gitee 封面人物是一档用来展示 Gitee 会员风采的栏目 https://gitee.com/gitee-stars/

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llm_tokenizers-0.1.0.tar.gz (1.9 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

llm_tokenizers-0.1.0-py3-none-any.whl (2.0 MB view details)

Uploaded Python 3

File details

Details for the file llm_tokenizers-0.1.0.tar.gz.

File metadata

  • Download URL: llm_tokenizers-0.1.0.tar.gz
  • Upload date:
  • Size: 1.9 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.18

File hashes

Hashes for llm_tokenizers-0.1.0.tar.gz
Algorithm Hash digest
SHA256 902defa2b62dd1f704de3c3235d9ec8ba97919e88d1f4e587bd8d4bfe9ea427b
MD5 ed7e857aeb0627d4c6afeb14a0d5e289
BLAKE2b-256 4a7eecb705f993dcf9480ce58e54aab488a6335bddc2f920feaa56d45e60288b

See more details on using hashes here.

File details

Details for the file llm_tokenizers-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: llm_tokenizers-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 2.0 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.18

File hashes

Hashes for llm_tokenizers-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 bf66b522ac891410e8f7b247ccb08d32a82ddfd79a3aaf1660e15f95e76a887d
MD5 f2b00372165e06eedab3ee52ba9cca34
BLAKE2b-256 86fd7bd6877903d0cfc487229c1144ff79078bb3aae1dbc69bcbb92f7ada9755

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page