Skip to main content

Sequence Tagging powered by the Averaged Perceptron.

Project description

Part of Speech Tagging

A Part of Speech tagger using the Average Perceptron.

Based on the tagger from here

This uses the following features:

  • The Suffix (last 3 characters) of the current word (unnormalized).
  • The Prefix (first character) of the current word (unnormalized).
  • The current word.
  • The previous Part of Speech tag and the current word.
  • The Previous Part of Speech tag.
  • The Part of Speech tag from the word before last.
  • Both of the previous Part of Speech tags.
  • The previous word.
  • The previous word suffix.
  • The word from 2 steps back.
  • The next word.
  • The next word suffix.
  • The word after next.
  • A Bias

Includes the following Pretrained models.

  • POS Tagger, Trained on the CoNLL 2000 Chunking data
  • Chunker, Trained on the CoNLL 2000 Chunking data
  • Slot filler, Trained on ATIS data

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sequence_tagging-0.1.6.tar.gz (2.9 MB view details)

Uploaded Source

File details

Details for the file sequence_tagging-0.1.6.tar.gz.

File metadata

File hashes

Hashes for sequence_tagging-0.1.6.tar.gz
Algorithm Hash digest
SHA256 daf080cca294ae075d6f02e214019e1d242a7e9275be31bd9634dcde04675adf
MD5 4b41ac7afa3d460a3470ef4a6d3ca5a5
BLAKE2b-256 8f281eb9cd5d5ea6dcb8dbfcc58df3dc7b3333a3db16da2ee18717aaf876db6d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page