Skip to main content

A utility library to assist in parsing natural language text.

Project description

Zensols Natural Language Parsing

PyPI Python 3.7 Python 3.8 Python 3.9 Build Status

This framework wraps the spaCy framework and creates features. The motivation is to generate features from the parsed text in an object oriented fashion that is fast and easy to pickle. Other features include:

  • Token normalization as a stream of strings by lemmatization, stop word and/or punctuation filters, up/down casing, porter stemming and others.
  • Detached features that are safe and easy to pickle to disk.
  • Configuration drive parsing and token normalization using configuration factories.
  • Pretty print functionality for easy natural language feature selection.

Documentation

Obtaining / Installing

  1. The easist way to install the command line program is via the pip installer: pip3 install zensols.nlp
  2. Install at least one spaCy model: python -m spacy download en_core_web_sm

Binaries are also available on pypi.

Attribution

This project, or example code, uses:

Changelog

An extensive changelog is available here.

License

MIT License

Copyright (c) 2020 - 2021 Paul Landes

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

zensols.nlp-0.1.1-py3.9.egg (47.7 kB view hashes)

Uploaded Source

zensols.nlp-0.1.1-py3-none-any.whl (19.2 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page