Thai Word Segmentation using TCC + Bidirectional RNNs
Project description
NokCut
Thai Word Segmentation using TCC + Bidirectional RNNs
Credit code from A Beginner's Guide to Deep NLP with PyTorch - Dr. Prachya Boonkwan
Colab Notebook : https://colab.research.google.com/drive/1WS08VsjlZGAmCGsoI7AlRm-Do3zo-b-g
Train by BEST I Corpus Training set. (90% training , 10% test)
ep 6
loss: 0.017879242024514966
f1 : 98.47012481095481
F1 From BEST I Corpus Test set
F-measure: 96.94929
Recall: 122271.00000/125850.00000 = 97.15614
Precision: 122271.00000/126387.00000 = 96.74333
Number of incorrect : 3579.00000 words
Mr. Wannaphong Phatthiyaphaibun wannaphong@kkumail.com
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
No source distribution files available for this release.See tutorial on generating distribution archives.
Built Distribution
nokcut-0.4-py3-none-any.whl
(5.9 MB
view details)
File details
Details for the file nokcut-0.4-py3-none-any.whl
.
File metadata
- Download URL: nokcut-0.4-py3-none-any.whl
- Upload date:
- Size: 5.9 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.19.1 setuptools/39.1.0 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6687f5292a4031e13688f220cc4f79770b12d86773842c5fb4d23aee1d797419 |
|
MD5 | 1137ce8459f8390d03121c848f363790 |
|
BLAKE2b-256 | fd2c7b46a8a468e6e9dab19c97c738c6c1e52a3a1995cc0997aa2b36f867461e |