Skip to main content

Text Mining and Topic Modeling Toolkit

Project description

tmtoolkit is a set of tools for text mining and topic modeling with Python. It contains functions for text preprocessing like lemmatization, stemming or POS tagging especially for English and German texts. Preprocessing is done in parallel by using all available processors on your machine. The topic modeling features include topic model evaluation metrics, allowing to calculate models with different parameters in parallel and comparing them (e.g. in order to find the best number of topics for a given set of documents). Topic models can be generated in parallel for different copora and/or parameter sets using the LDA implementations either from lda, scikit-learn or gensim.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tmtoolkit-0.4.1.tar.gz (15.2 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

tmtoolkit-0.4.1-py2.py3-none-any.whl (15.3 MB view details)

Uploaded Python 2Python 3

File details

Details for the file tmtoolkit-0.4.1.tar.gz.

File metadata

  • Download URL: tmtoolkit-0.4.1.tar.gz
  • Upload date:
  • Size: 15.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for tmtoolkit-0.4.1.tar.gz
Algorithm Hash digest
SHA256 616aec3efbcc931922843b74368b7a91e5b61e75ce1dbd1873dc1e53d00bac9c
MD5 855525aa0cc7a038a888c79180e31fcd
BLAKE2b-256 9e90b57236dc619bb92fec15566cc8b8f418e923f57741af13b743ec2d72f401

See more details on using hashes here.

File details

Details for the file tmtoolkit-0.4.1-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for tmtoolkit-0.4.1-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 d7a201faccfe613f3844ff06228aeee8b0949d9d9047578ef239d2259a6dade5
MD5 6b5603f39f54c1661613603e96909ea1
BLAKE2b-256 05b545e443a1de223cdde2db9c4e50bc8e306f3a5c461ed62203f2a408122229

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page