A library for manipulating, loading, and saving corpus in iob2 format.
Reason this release was yanked:
Due to a lack of notes on licensing.
Project description
English description is under construction.
日本語での説明
import iob2
# iob2コーパス読み込み(ファイルから) [iob2]
corpus = iob2.load("./test_corpus.iob2")
# コーパスを文区切りにする
div_ls = [".", "?", "!"] # 文区切り文字一覧
# iob2コーパス書き出し(ファイルへ) [iob2]
iob2.dump(sent_corpus, "./sent_corpus.iob2")
詳細な説明は執筆中です。
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
iob2-1.1.0.tar.gz
(4.2 kB
view hashes)
Built Distribution
iob2-1.1.0-py3-none-any.whl
(4.8 kB
view hashes)