baselibs
Project description
# 通用基础库
版本: v0.1.15
- 修正LLM_TASK类的BUG
- 优化所有代码
版本: v0.1.14
- 优化 RedisCache 类
- 优化 TimeCount 类
版本: v0.1.12
- 完善 RedisCache 类,增加密码和数据库参数;
版本: v0.1.11
- 修正LLM_TASK类的BUG
- LLM_TASK类支持qwen3, qwq模型;
- 优化计时器类
版本: v0.1.10
- 增加了LLM_TASK类,用于大模型通用任务
- 修正 RedisCache类中 data_table 无效的bug
- 优化原有代码;
版本: v0.1.9
- 扩展了对json格式文件的读写支持:readjson, readjsonp, savejson, savejsonp
- 增加了MutliTask 多进程任务类
- 优化了计时器类:TimeCount
版本: v0.1.5
- 修改了splitset方法,可用于拆分数据集
- 增加 split_dataframe方法,可对DataFrame进行拆分数据集;
- 增加 分层抽取方法: data_split, save_data_split
版本: v0.1.4
- 修改了TimeCount类
版本: v0.1.1
可对目录下的文件进行以下批量处理:
- 清除空格 空行 按句子分行;
- 删除空文件,找到后改名(改为"原文件名.del") 或者直接删除
- 删除重复的文件: 根据文件的MD5判断文件是否相同,找到后改名(原文件.same)或者直接删除
- 批量重命名: 可按序号进行重命名,默认从1开始,文件名会自动在前面补0,例如"0001.txt"
- 可统计文本文件的行数 [2019/1/18 添加]
- 对数据进行检查;
- 对数据重复数据检查并删除;
- 对数据进行随机抽样;
- 处理参数可以自定义顺序,
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
baselibs-0.1.15.tar.gz
(45.0 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
baselibs-0.1.15-py3-none-any.whl
(45.3 kB
view details)
File details
Details for the file baselibs-0.1.15.tar.gz.
File metadata
- Download URL: baselibs-0.1.15.tar.gz
- Upload date:
- Size: 45.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.8.10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2b60399d9c223b03436e01d247140a7df1182ccee01cef2cf054d5e0cd392352
|
|
| MD5 |
2271f6bc7ffbb8464607e8d486117bcf
|
|
| BLAKE2b-256 |
a1144bde28c34dd010b0404f2173f44d3ae2c03706be3dd1001e12d460b219e4
|
File details
Details for the file baselibs-0.1.15-py3-none-any.whl.
File metadata
- Download URL: baselibs-0.1.15-py3-none-any.whl
- Upload date:
- Size: 45.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.8.10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7f48dc8e64efa469a33df4630fd471fbe4ac1ad6ff49411f19c5688fe600d1c9
|
|
| MD5 |
abd7ab8ef781dd8885a93443c4649505
|
|
| BLAKE2b-256 |
131cce6c84d1f0cb22e8f91385502148b8006fa5ce033b26b985fd8a04a71024
|