Skip to main content

Sequence Tagging powered by the Averaged Perceptron.

Project description

Part of Speech Tagging

A Part of Speech tagger using the Average Perceptron.

Based on the tagger from here

This uses the following features:

  • The Suffix (last 3 characters) of the current word (unnormalized).
  • The Prefix (first character) of the current word (unnormalized).
  • The current word.
  • The previous Part of Speech tag and the current word.
  • The Previous Part of Speech tag.
  • The Part of Speech tag from the word before last.
  • Both of the previous Part of Speech tags.
  • The previous word.
  • The previous word suffix.
  • The word from 2 steps back.
  • The next word.
  • The next word suffix.
  • The word after next.
  • A Bias

Includes the following Pretrained models.

  • POS Tagger, Trained on the CoNLL 2000 Chunking data
  • Chunker, Trained on the CoNLL 2000 Chunking data
  • Slot filler, Trained on ATIS data

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for sequence-tagging, version 0.1.6
Filename, size File type Python version Upload date Hashes
Filename, size sequence_tagging-0.1.6.tar.gz (2.9 MB) File type Source Python version None Upload date Hashes View

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring DigiCert DigiCert EV certificate Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page