Skip to main content

Text Mining and Topic Modeling Toolkit

Project description

tmtoolkit is a set of tools for text mining and topic modeling with Python. It contains functions for text preprocessing like lemmatization, stemming or POS tagging especially for English and German texts. Preprocessing is done in parallel by using all available processors on your machine. The topic modeling features include topic model evaluation metrics, allowing to calculate models with different parameters in parallel and comparing them (e.g. in order to find the best number of topics for a given set of documents). Topic models can be generated in parallel for different copora and/or parameter sets using the LDA implementations either from lda, scikit-learn or gensim.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tmtoolkit-0.3.1.tar.gz (15.2 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

tmtoolkit-0.3.1-py2.py3-none-any.whl (15.3 MB view details)

Uploaded Python 2Python 3

File details

Details for the file tmtoolkit-0.3.1.tar.gz.

File metadata

  • Download URL: tmtoolkit-0.3.1.tar.gz
  • Upload date:
  • Size: 15.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for tmtoolkit-0.3.1.tar.gz
Algorithm Hash digest
SHA256 20086cabd0beb43b9a283af2c5dff8e6d78648086023a6992c0c894743f87caa
MD5 86237a561b119521e94a9ddbe1031ae6
BLAKE2b-256 76fca4725a4a57c88330676c1c96aab2762ab72ad68783a4c2a9271d82ffda1f

See more details on using hashes here.

File details

Details for the file tmtoolkit-0.3.1-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for tmtoolkit-0.3.1-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 a40bed59e6581287ff3a174aedbc864558143de4ccc6a1de9b0569054619b357
MD5 a353068cca723c23de8b3be7bd3f36b9
BLAKE2b-256 f6de65239d6a030e7071c54065b47417d01313bd3438bf81d032d6a5afbacb25

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page