Text Mining and Topic Modeling Toolkit
Project description
tmtoolkit is a set of tools for text mining and topic modeling with Python. It contains functions for text preprocessing like lemmatization, stemming or POS tagging especially for English and German texts. Preprocessing is done in parallel by using all available processors on your machine. The topic modeling features include topic model evaluation metrics, allowing to calculate models with different parameters in parallel and comparing them (e.g. in order to find the best number of topics for a given set of documents). Topic models can be generated in parallel for different copora and/or parameter sets using the LDA implementations either from lda, scikit-learn or gensim.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file tmtoolkit-0.3.0.tar.gz.
File metadata
- Download URL: tmtoolkit-0.3.0.tar.gz
- Upload date:
- Size: 15.2 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c5b6810c0f9ac87d0f16c6ff2a91857d02d7a7af553c4664974610a8cec53d11
|
|
| MD5 |
c1a1a46f49dcd572c68c9457513ee19c
|
|
| BLAKE2b-256 |
f5ceb11accefc710c8b41920412166860268161ba86e7972010fac3578775af2
|
File details
Details for the file tmtoolkit-0.3.0-py2.py3-none-any.whl.
File metadata
- Download URL: tmtoolkit-0.3.0-py2.py3-none-any.whl
- Upload date:
- Size: 15.3 MB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9b675362a79d82b475fda8f73681148ec027ab28c7c629d7c3e7529795e4cba5
|
|
| MD5 |
451324c5939b03b2c2d046fe93cca49d
|
|
| BLAKE2b-256 |
7575333b255dd1d9dfd60aa7a3c16175b0a080dd5bad6ae56c72a5b909808565
|