Skip to main content

Text Mining and Topic Modeling Toolkit

Project description

tmtoolkit is a set of tools for text mining and topic modeling with Python. It contains functions for text preprocessing like lemmatization, stemming or POS tagging especially for English and German texts. Preprocessing is done in parallel by using all available processors on your machine. The topic modeling features include topic model evaluation metrics, allowing to calculate models with different parameters in parallel and comparing them (e.g. in order to find the best number of topics for a given set of documents). Topic models can be generated in parallel for different copora and/or parameter sets using the LDA implementations either from lda, scikit-learn or gensim.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tmtoolkit-0.3.0.tar.gz (15.2 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

tmtoolkit-0.3.0-py2.py3-none-any.whl (15.3 MB view details)

Uploaded Python 2Python 3

File details

Details for the file tmtoolkit-0.3.0.tar.gz.

File metadata

  • Download URL: tmtoolkit-0.3.0.tar.gz
  • Upload date:
  • Size: 15.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for tmtoolkit-0.3.0.tar.gz
Algorithm Hash digest
SHA256 c5b6810c0f9ac87d0f16c6ff2a91857d02d7a7af553c4664974610a8cec53d11
MD5 c1a1a46f49dcd572c68c9457513ee19c
BLAKE2b-256 f5ceb11accefc710c8b41920412166860268161ba86e7972010fac3578775af2

See more details on using hashes here.

File details

Details for the file tmtoolkit-0.3.0-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for tmtoolkit-0.3.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 9b675362a79d82b475fda8f73681148ec027ab28c7c629d7c3e7529795e4cba5
MD5 451324c5939b03b2c2d046fe93cca49d
BLAKE2b-256 7575333b255dd1d9dfd60aa7a3c16175b0a080dd5bad6ae56c72a5b909808565

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page