Skip to main content

A python package that allows the user to annotate corpuses for FastText

Project description

fasttext-annotator

Annotates corpuses for use in FastText over a Command Line Interface.

CircleCI PyPi version

Raison D'être :thought_balloon:

This allows a large corpus to be annotated easily using a command line interface for purposes of being used in a FastText supervised classifier.

Architecture :triangular_ruler:

fasttext-annotator has a very simple architecture that basically reads each line of the file fed to it, and asks the user for an annotation given the options provided to it via the command line.

Dependencies :globe_with_meridians:

Installation :inbox_tray:

You can install this via the PyPi package manager, or manually.

Manually

  1. Run python setup.py --install

Via Pypi

  1. Run pip install fasttextannotator

Usage example :eyes:

In order to annotate a corpus of text, execute the following:

fasttextannotator [corpus-file] [output-file] [labels...]

You will then be presented with a line of text from the corpus, and a range of options based on the labels you provided.

License :memo:

The project is available under the MIT license.

Acknowledgements

  • Icon in README banner is text by Chameleon Design from the Noun Project.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

fasttextannotator-1.0.0-py2.7.egg (4.3 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page