Skip to main content

Lightweight implementation of the Typecraft XML format in python.

Project description

Typecraft Python

https://img.shields.io/pypi/v/typecraft_python.svg https://img.shields.io/travis/Typecraft/typecraft_python.svg Documentation Status Updates

This repository contains an IGT model based on the Typecraft IGT format. It also contains a simple CLI for performing various NLP tasks, interfacing with both NLTK and other tools such as the TreeTagger.

Installation

pip install typecraft_python

Features

  • Parsing of the Typecraft XML format.

  • Manipulation of the Typecraft IGT model format.
    • Integrating with NLTK

    • Integrating with TreeTagger

  • Provides a CLI that can be used to load, convert and manipulate raw text and Typecraft XML files.

Usage

Usage: tpy [OPTIONS] COMMAND [ARGS]...

Options:
  --help  Show this message and exit.

Commands:
  convert
  ntexts   This command lists the number of texts in a...
  raw
  xml

Examples

Load a raw file, tokenize and tag it, and output xml (to stdout):

tpy raw your_file.txt

To save to a file

tpy raw your_file.txt -o output.xml
# or
tpy raw your_file.txt > output.xml

To tag using a specific tagger:

$ tpy raw your_file.txt --tagger=tree  # Tags using the tree tagger

To load a Typecraft xml file and tag it:

tpy xml your_file.xml --tag --tagger=nltk -o tagged_output.xml

Credits

This package was created with Cookiecutter and the audreyr/cookiecutter-pypackage project template.

History

0.1.1 (2016-08-15)

  • Fixed some small bugs.

0.1.0 (2016-08-14)

  • First release. Added main bulk of initial code:
    • Parser works in its most basic inception and parses TC-XML documents into its object-tree

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

typecraft_python-0.11.0.tar.gz (56.5 kB view details)

Uploaded Source

Built Distribution

typecraft_python-0.11.0-py2.py3-none-any.whl (56.6 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file typecraft_python-0.11.0.tar.gz.

File metadata

File hashes

Hashes for typecraft_python-0.11.0.tar.gz
Algorithm Hash digest
SHA256 6efad285bac551c4d81a2d16bea5d1cd173ac8df862e54cc1fb57f3e10cf4c12
MD5 dd8e7d700e24fa2e604bde7fc9bd1145
BLAKE2b-256 96958687b940cd5db5dea6091b6c35d579f232802099013afd4464075e76fc48

See more details on using hashes here.

File details

Details for the file typecraft_python-0.11.0-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for typecraft_python-0.11.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 ccf81b25a1bdc67a89197f7713180d6e240025b0430e2f0e556d604d469c68b7
MD5 82c57c9adac5fb67eec103d07a35e848
BLAKE2b-256 9036d09ccb42df9940426c4f94494a714734218fa6d360f27b20d7df8fef9756

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page