Skip to main content

A python package that allows the user to annotate corpuses for FastText

Project description

fasttext-annotator

Annotates corpuses for use in FastText over a Command Line Interface.

CircleCI PyPi version

Raison D'être :thought_balloon:

This allows a large corpus to be annotated easily using a command line interface for purposes of being used in a FastText supervised classifier.

Architecture :triangular_ruler:

fasttext-annotator has a very simple architecture that basically reads each line of the file fed to it, and asks the user for an annotation given the options provided to it via the command line.

Dependencies :globe_with_meridians:

Installation :inbox_tray:

You can install this via the PyPi package manager, or manually.

Manually

  1. Run python setup.py --install

Via Pypi

  1. Run pip install fasttextannotator

Usage example :eyes:

In order to annotate a corpus of text, execute the following:

fasttextannotator [corpus-file] [output-file] [labels...]

You will then be presented with a line of text from the corpus, and a range of options based on the labels you provided.

License :memo:

The project is available under the MIT license.

Acknowledgements

  • Icon in README banner is text by Chameleon Design from the Noun Project.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

fasttextannotator-1.0.0-py2.7.egg (4.3 kB view details)

Uploaded Source

File details

Details for the file fasttextannotator-1.0.0-py2.7.egg.

File metadata

  • Download URL: fasttextannotator-1.0.0-py2.7.egg
  • Upload date:
  • Size: 4.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.21.0 setuptools/40.6.2 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/2.7.15

File hashes

Hashes for fasttextannotator-1.0.0-py2.7.egg
Algorithm Hash digest
SHA256 461ecfbdd918fd2164250a32e929c3fbb32c4f3eaca5dee6d093070f3c9d5404
MD5 c3bf56b3931617ca78004876a755ae17
BLAKE2b-256 8ed586cb0face32cfa1096deab1612b866ef0360ae57f9a3f5dd1c03c30967f9

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page