Skip to main content

Rungsted. An efficient HMM-based structured prediction model for sequential labeling tasks, with extras.

Project description

[![Build Status](https://travis-ci.org/coastalcph/rungsted.svg?branch=master)](https://travis-ci.org/coastalcph/rungsted)

## Rungsted structured perceptron sequential tagger

### Install

The software is installable via PyPI, e.g. do

` pip install rungsted `

### Demo

The repository contains a subset of the part-of-speech tagged Brown corpus. To run the structured perceptron labeler on this dataset, execute:

python src/labeler.py --train data/brown.train --test data/brown.test.vw

Rungsted’s input format is closely modeled on the powerful and flexible format of [Vowpal Wabbit](https://github.com/JohnLangford/vowpal_wabbit/wiki/Input-format), with the exception that Rungsted is perfectly fine with labels that are not integers.

### Datasets

Provided you have a working installation of NLTK, you can recreate the Brown dataset with this command.

python rungsted/datasets/cr_brown_pos_data.py data/brown.train.vw data/brown.test.vw

There is also a script rungsted/datasets/conll_to_vw.py to convert from CONLL-formatted input to Rungsted

### Building and uploading to PyPI

First, run python setup.py sdist to generate a source distribution. Then upload the distribution files to PyPI with twine: twine upload dist/*.

To develop locally, use python setup.py develop.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rungsted-1.2.4.tar.gz (582.7 kB view details)

Uploaded Source

File details

Details for the file rungsted-1.2.4.tar.gz.

File metadata

  • Download URL: rungsted-1.2.4.tar.gz
  • Upload date:
  • Size: 582.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for rungsted-1.2.4.tar.gz
Algorithm Hash digest
SHA256 1dd75704417ab3242f8c21cacc2a306857cdc5299d9fa22c705650b95c32e944
MD5 cd19a9e4c80b83ae9c804248c42fdb28
BLAKE2b-256 a9899fbb8e9946b5ba8dc6ba7315af9238a2dbc8570b137c6bf89512bf592b76

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page