Data loaders and abstractions for text and NLP
Project description
LanguageFlow
Data loaders and abstractions for text and NLP
Requirements
Install dependencies
$ pip install future, tox, joblib
$ pip install numpy scipy pandas scikit-learn==0.19.1
$ pip install python-crfsuite==0.9.5
$ pip install Cython
$ pip install -U fasttext --no-cache-dir --no-deps --force-reinstall
$ pip install xgboost
Installation
$ pip install languageflow
Components
Transformers: NumberRemover, CountVectorizer, TfidfVectorizer
Models: SGDClassifier, XGBoostClassifier, KimCNNClassifier, FastTextClassifier, CRF
Data
Download a dataset using download command
$ languageflow download DATASET
List all dataset
$ languageflow list
Datasets
The datasets module currently contains:
PlaintextCorpus: VNESES, VNTQ_SMALL, VNTQ_BIG
CategorizedCorpus: VNTC
History
1.1.7 (2018-04-12)
Automatic deploy with travis and pypi
Fix dependencies hell
1.1.6 (2017-12-26)
Add data module to handle data downloading and data preprocessing
Add many new models: SGDClassifier, XGBoostClassier, FastTextClassifier, CRF
Add new feature: LanguageBoard
Automatic continuous integration with travis-ci
Build docs with readthedocs.org
1.1.5 (2017-12-11)
Refactor project to integrate with underthesea experiment
0.1.0 (2017-09-18)
First release on PyPI.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for languageflow-1.1.12a2-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | fd3395d60060293675aaf9077a24255e059b3666879e7ab734be18e0db23c2e6 |
|
MD5 | 55e168304188ba3a8aabf6410ee72828 |
|
BLAKE2b-256 | f2ce90f3629beaea8ea248775945aa3bb73d1da54e5be10de6ad2f9b8a3e64b3 |