Skip to main content

A python package that allows the user to annotate corpuses for FastText

Project description

fasttext-annotator

Annotates corpuses for use in FastText over a Command Line Interface.

CircleCI PyPi version

Raison D'être :thought_balloon:

This allows a large corpus to be annotated easily using a command line interface for purposes of being used in a FastText supervised classifier.

Architecture :triangular_ruler:

fasttext-annotator has a very simple architecture that basically reads each line of the file fed to it, and asks the user for an annotation given the options provided to it via the command line.

Dependencies :globe_with_meridians:

Installation :inbox_tray:

You can install this via the PyPi package manager, or manually.

Manually

  1. Run python setup.py --install

Via Pypi

  1. Run pip install fasttextannotator

Usage example :eyes:

In order to annotate a corpus of text, execute the following:

fasttextannotator [corpus-file] [output-file] [labels...]

You will then be presented with a line of text from the corpus, and a range of options based on the labels you provided.

License :memo:

The project is available under the MIT license.

Acknowledgements

  • Icon in README banner is text by Chameleon Design from the Noun Project.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for fasttextannotator, version 1.0.0
Filename, size File type Python version Upload date Hashes
Filename, size fasttextannotator-1.0.0-py2.7.egg (4.3 kB) File type Egg Python version 2.7 Upload date Hashes View

Supported by

Pingdom Pingdom Monitoring Google Google Object Storage and Download Analytics Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page