Skip to main content

Text Mining and Topic Modeling Toolkit

Project description

tmtoolkit is a set of tools for text mining and topic modeling with Python. It contains functions for text preprocessing like lemmatization, stemming or POS tagging especially for English and German texts. Preprocessing is done in parallel by using all available processors on your machine. The topic modeling features include topic model evaluation metrics, allowing to calculate models with different parameters in parallel and comparing them (e.g. in order to find the best number of topics for a given set of documents). Topic models can be generated in parallel for different copora and/or parameter sets using the LDA implementations either from lda, scikit-learn or gensim.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tmtoolkit-0.5.0.tar.gz (15.2 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

tmtoolkit-0.5.0-py2.py3-none-any.whl (15.3 MB view details)

Uploaded Python 2Python 3

File details

Details for the file tmtoolkit-0.5.0.tar.gz.

File metadata

  • Download URL: tmtoolkit-0.5.0.tar.gz
  • Upload date:
  • Size: 15.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for tmtoolkit-0.5.0.tar.gz
Algorithm Hash digest
SHA256 0dcac0ba6a8e4a39888ec405b68e573c01aa21bf83cbc0e091dc957626b33252
MD5 a817c2e8c21500ff5aa7de7a42de6b84
BLAKE2b-256 90bbc52da23e21f0239a94ac9a53baac223ac9cefe2f7a8df701c88845035bdc

See more details on using hashes here.

File details

Details for the file tmtoolkit-0.5.0-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for tmtoolkit-0.5.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 551c9ee7fdfd3386f84b9630be822b234497de2fc8310782587547e7985035ae
MD5 8b463791a421d3d5736ce7392c1dd795
BLAKE2b-256 d2561812b2884a2bdf1681f500dbf22557bb0726fd5f0e3549e9dba20d5e8761

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page