A library for manipulating, loading, and saving corpus in iob2 format.
Project description
English description is under construction.
日本語での説明
import iob2
# iob2コーパス読み込み(ファイルから) [iob2]
corpus = iob2.load("./test_corpus.iob2")
# コーパスを文区切りにする
div_ls = [".", "?", "!"] # 文区切り文字一覧
# iob2コーパス書き出し(ファイルへ) [iob2]
iob2.dump(sent_corpus, "./sent_corpus.iob2")
詳細な説明は執筆中です。
ライセンスに関する注意
当ソフトウエアはクリエイティブ・コモンズライセンス(CC0)ですが、内部でMITライセンスのpypiツールseqeval
を利用しています。
下記はseqeval
のライセンス表示です。
@misc{seqeval,
title={{seqeval}: A Python framework for sequence labeling evaluation},
url={https://github.com/chakki-works/seqeval},
note={Software available from https://github.com/chakki-works/seqeval},
author={Hiroki Nakayama},
year={2018},
}
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
iob2-1.1.1.tar.gz
(4.5 kB
view details)
Built Distribution
iob2-1.1.1-py3-none-any.whl
(5.1 kB
view details)
File details
Details for the file iob2-1.1.1.tar.gz
.
File metadata
- Download URL: iob2-1.1.1.tar.gz
- Upload date:
- Size: 4.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/50.3.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.6.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | c913016e8a87a4b4732f2ff9b1aae663156e5361417e06ed537fbf6a15e88732 |
|
MD5 | 8ef93f2006cf9c88c62d93b06db39e5a |
|
BLAKE2b-256 | cee8a206db52eb963258e60d8da028dcd6941f047cdebcc0594c62929e41f78b |
File details
Details for the file iob2-1.1.1-py3-none-any.whl
.
File metadata
- Download URL: iob2-1.1.1-py3-none-any.whl
- Upload date:
- Size: 5.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/50.3.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.6.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 009c80e193534496fe94f4d958fe097c4654480a38b271a36743b38b3790efbe |
|
MD5 | 0a2f81a26e91d060b437a35986f5f107 |
|
BLAKE2b-256 | 8678c113774368af1cf9b91122111ce97e71cf27042074b6e723730ab97d79e4 |