Skip to main content

Text Mining and Topic Modeling Toolkit

Project description

tmtoolkit is a set of tools for text mining and topic modeling with Python. It contains functions for text preprocessing like lemmatization, stemming or POS tagging especially for English and German texts. Preprocessing is done in parallel by using all available processors on your machine. The topic modeling features include topic model evaluation metrics, allowing to calculate models with different parameters in parallel and comparing them (e.g. in order to find the best number of topics for a given set of documents). Topic models can be generated in parallel for different copora and/or parameter sets using the LDA implementations either from lda, scikit-learn or gensim.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tmtoolkit-0.1.2.tar.gz (35.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

tmtoolkit-0.1.2-py2.py3-none-any.whl (36.0 kB view details)

Uploaded Python 2Python 3

File details

Details for the file tmtoolkit-0.1.2.tar.gz.

File metadata

  • Download URL: tmtoolkit-0.1.2.tar.gz
  • Upload date:
  • Size: 35.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for tmtoolkit-0.1.2.tar.gz
Algorithm Hash digest
SHA256 f5575b30192b456b9b5d10ceaf6b0fae60964ba3ef8cbe573b4715f9d9221503
MD5 6ecc40b69817ca6f0cc6160dffe8580c
BLAKE2b-256 dd7ff2651e29f7bd206f730989f2116949ee69a1e5a5cf9b2a322916a124d87f

See more details on using hashes here.

File details

Details for the file tmtoolkit-0.1.2-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for tmtoolkit-0.1.2-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 b7837c140f6911ac906c013c979d5a4c92164a232ac0bb4168cbcb1ffcbbb522
MD5 53fcb3aaf1c4568a665629dc440f4608
BLAKE2b-256 9e57cee5d85dd50a44662ee2d98744ae737ee36e3ca6c6d1a2ea76c718caece7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page