Skip to main content

Sequence Tagging powered by the Averaged Perceptron.

Project description

Part of Speech Tagging

A Part of Speech tagger using the Average Perceptron.

Based on the tagger from here

This uses the following features:

  • The Suffix (last 3 characters) of the current word (unnormalized).
  • The Prefix (first character) of the current word (unnormalized).
  • The current word.
  • The previous Part of Speech tag and the current word.
  • The Previous Part of Speech tag.
  • The Part of Speech tag from the word before last.
  • Both of the previous Part of Speech tags.
  • The previous word.
  • The previous word suffix.
  • The word from 2 steps back.
  • The next word.
  • The next word suffix.
  • The word after next.
  • A Bias

Includes the following Pretrained models.

  • POS Tagger, Trained on the CoNLL 2000 Chunking data
  • Chunker, Trained on the CoNLL 2000 Chunking data
  • Slot filler, Trained on ATIS data

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sequence_tagging-0.1.6.tar.gz (2.9 MB view details)

Uploaded Source

File details

Details for the file sequence_tagging-0.1.6.tar.gz.

File metadata

File hashes

Hashes for sequence_tagging-0.1.6.tar.gz
Algorithm Hash digest
SHA256 daf080cca294ae075d6f02e214019e1d242a7e9275be31bd9634dcde04675adf
MD5 4b41ac7afa3d460a3470ef4a6d3ca5a5
BLAKE2b-256 8f281eb9cd5d5ea6dcb8dbfcc58df3dc7b3333a3db16da2ee18717aaf876db6d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page