Skip to main content

Deep learning utility library for natural language processing that aids in feature engineering and embedding layers.

Project description

DeepZensols Natural Language Processing

PyPI Python 3.9 Python 3.10

Deep learning utility library for natural language processing that aids in feature engineering and embedding layers.

Features:

  • Configurable layers with little to no need to write code.
  • Natural language specific layers:
  • NLP specific vectorizers that generate zensols deeplearn encoded and decoded batched tensors for spaCy parsed features, dependency tree features, overlapping text features and others.
  • Easily swapable during runtime embedded layers as batched tensors and other linguistic vectorized features.
  • Support for token, document and embedding level vectorized features.
  • Transformer word piece to linguistic token mapping.
  • Two full documented reference models provided as both command line and Jupyter notebooks.
  • Command line support for training, testing, debugging, and creating predictions.

Documentation

Obtaining

The easiest way to install the command line program is via the pip installer:

pip3 install zensols.deepnlp

Binaries are also available on pypi.

Usage and Reference Models

If you're in a rush, you can dive right in to the Clickbate Text Classification reference model, which is a working project that uses this library. However, you'll either end up reading up on the zensols deeplearn library before or during the tutorial.

The usage of this library is explained in terms of the reference models:

The unit test cases are also a good resource for the more detailed programming integration with various parts of the library.

Attribution

This project, or reference model code, uses:

Corpora used include:

Citation

If you use this project in your research please use the following BibTeX entry:

@article{Landes_DiEugenio_Caragea_2021,
  title={DeepZensols: Deep Natural Language Processing Framework},
  url={http://arxiv.org/abs/2109.03383},
  note={arXiv: 2109.03383},
  journal={arXiv:2109.03383 [cs]},
  author={Landes, Paul and Di Eugenio, Barbara and Caragea, Cornelia},
  year={2021},
  month={Sep}
}

Community

Please star the project and let me know how and where you use this API. Contributions as pull requests, feedback and any input is welcome.

Changelog

An extensive changelog is available here.

License

MIT License

Copyright (c) 2020 - 2021 Paul Landes

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

zensols.deepnlp-1.3.0-py3.10.egg (216.1 kB view details)

Uploaded Egg

zensols.deepnlp-1.3.0-py3-none-any.whl (97.5 kB view details)

Uploaded Python 3

File details

Details for the file zensols.deepnlp-1.3.0-py3.10.egg.

File metadata

  • Download URL: zensols.deepnlp-1.3.0-py3.10.egg
  • Upload date:
  • Size: 216.1 kB
  • Tags: Egg
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.10.2

File hashes

Hashes for zensols.deepnlp-1.3.0-py3.10.egg
Algorithm Hash digest
SHA256 7aa9012b6f94098d4fa9b1bb0e56f56e4e2bc7c847bd9e6e338fdf4dde33f63b
MD5 0c693a3cbe829e8bff5d9ea0c826b577
BLAKE2b-256 166dc602638f5880e3b79dbce15bfb641c1c22a5346bd4f1e79e64dd10273857

See more details on using hashes here.

File details

Details for the file zensols.deepnlp-1.3.0-py3-none-any.whl.

File metadata

File hashes

Hashes for zensols.deepnlp-1.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 28ffe9dd2cb68c89259c97fdd5f2dc83ff43062c1ee3890b6bd91aedeed02c47
MD5 2947303eb339a48fe7f0fb5d6642a485
BLAKE2b-256 9f38c33022f7b9b0b93d6f6cbbcdc368580b356f9ccb6e2df61cc7e7697cb3f9

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page