Terry toolkit tkitDatasetEx 构建数据集过程中的方便,
Project description
BulidDataset
数据集预处理
包含lm,孪生,seq2seq,文本分类,示例参考dataDome目录下
默认使用Bert 21128分词方案,如果想要修改自己的分词可以修改config下的词典方案。
Getting Started
Download links:
SSH clone URL: ssh://git@git.jetbrains.space/terrychanorg/yuxunlianlm-bert/BulidDataset.git
HTTPS clone URL: https://git.jetbrains.space/terrychanorg/yuxunlianlm-bert/BulidDataset.git
These instructions will get you a copy of the project up and running on your local machine for development and testing purposes.
Prerequisites
What things you need to install the software and how to install them.
Examples
Deployment
Add additional notes about how to deploy this on a production system.
Resources
Add links to external resources for this project, such as CI server, bug tracker, etc.
关于tkitDatasetEx
tkitDatasetEx各种函数
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file tkitDatasetEx-0.0.0.116399031.tar.gz
.
File metadata
- Download URL: tkitDatasetEx-0.0.0.116399031.tar.gz
- Upload date:
- Size: 4.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.7.1 importlib_metadata/4.9.0 pkginfo/1.8.2 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.8.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6caf560569e833bed17fdeb4f587b97912c9b637e80533c6338ab2a39239c164 |
|
MD5 | e9f26fcde0ac8822c770b71d5403a164 |
|
BLAKE2b-256 | 605efe2fb3570076e4f294e603854056d550257eb96b8dae65846b39f7b18099 |
File details
Details for the file tkitDatasetEx-0.0.0.116399031-py3-none-any.whl
.
File metadata
- Download URL: tkitDatasetEx-0.0.0.116399031-py3-none-any.whl
- Upload date:
- Size: 4.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.7.1 importlib_metadata/4.9.0 pkginfo/1.8.2 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.8.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 583263853a9a6f43d82d2e1c22d5fd93825e2ced3ef73e5e2cc3f528ed3c1c5b |
|
MD5 | 0395cdad5f41165db76d5900177553b5 |
|
BLAKE2b-256 | c1b0a2392fadeac98e30fa3c7be91ab179bf19904cc83422b71705055596eae0 |