A python package that allows the user to annotate corpuses for FastText
Project description
Annotates corpuses for use in FastText over a Command Line Interface.
Raison D'être :thought_balloon:
This allows a large corpus to be annotated easily using a command line interface for purposes of being used in a FastText supervised classifier.
Architecture :triangular_ruler:
fasttext-annotator
has a very simple architecture that basically reads each line of the file fed to it, and asks the user for an annotation given the options provided to it via the command line.
Dependencies :globe_with_meridians:
Installation :inbox_tray:
You can install this via the PyPi package manager, or manually.
Manually
- Run
python setup.py --install
Via Pypi
- Run
pip install fasttextannotator
Usage example :eyes:
In order to annotate a corpus of text, execute the following:
fasttextannotator [corpus-file] [output-file] [labels...]
You will then be presented with a line of text from the corpus, and a range of options based on the labels you provided.
License :memo:
The project is available under the MIT license.
Acknowledgements
- Icon in README banner is text by Chameleon Design from the Noun Project.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
File details
Details for the file fasttextannotator-1.0.0-py2.7.egg
.
File metadata
- Download URL: fasttextannotator-1.0.0-py2.7.egg
- Upload date:
- Size: 4.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.21.0 setuptools/40.6.2 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/2.7.15
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 461ecfbdd918fd2164250a32e929c3fbb32c4f3eaca5dee6d093070f3c9d5404 |
|
MD5 | c3bf56b3931617ca78004876a755ae17 |
|
BLAKE2b-256 | 8ed586cb0face32cfa1096deab1612b866ef0360ae57f9a3f5dd1c03c30967f9 |