Skip to main content

Sequence Tagging powered by the Averaged Perceptron.

Project description

Part of Speech Tagging

A Part of Speech tagger using the Average Perceptron.

Based on the tagger from here

This uses the following features:

  • The Suffix (last 3 characters) of the current word (unnormalized).
  • The Prefix (first character) of the current word (unnormalized).
  • The current word.
  • The previous Part of Speech tag and the current word.
  • The Previous Part of Speech tag.
  • The Part of Speech tag from the word before last.
  • Both of the previous Part of Speech tags.
  • The previous word.
  • The previous word suffix.
  • The word from 2 steps back.
  • The next word.
  • The next word suffix.
  • The word after next.
  • A Bias

Includes the following Pretrained models.

  • POS Tagger, Trained on the CoNLL 2000 Chunking data
  • Chunker, Trained on the CoNLL 2000 Chunking data
  • Slot filler, Trained on ATIS data

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sequence_tagging-0.1.6.tar.gz (2.9 MB view hashes)

Uploaded source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page