Cohesion measurement to evaluate topic modeling score. call cohesion_df(df)
Project description
Cohesion POC - Topic Detection Measurement
Cohesion
The Topic-Detection field deals mainly with providing names to given divisions of documents and lacks a quality measurement that provides a rating for the division, that represent a human-subjective score.
Cohesion is here to overcome it, includes NLP techniques and considers intra and inter scores in the cohesion formula.
The goal of the POC, it to prove that Cohesion has better correlation with NMI heuristic than Coherence which consider as SOA of Topic-Detecion-Measurement domain.
Part of Final project at SISE: BGU university.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file cohesion_pipeline-0.1.0.tar.gz
.
File metadata
- Download URL: cohesion_pipeline-0.1.0.tar.gz
- Upload date:
- Size: 8.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.8.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7cee37cf247f663484ed97cc034c1fa9b76664c0326f46453e909dde40d1db5e |
|
MD5 | 7f8d17464e7646c00b2ae1d0e5504c8a |
|
BLAKE2b-256 | 60bf61cd0cd4ffa45aa135e85f0d345891aa97af50abc63ff0580adb046928e0 |
Provenance
File details
Details for the file cohesion_pipeline-0.1.0-py3-none-any.whl
.
File metadata
- Download URL: cohesion_pipeline-0.1.0-py3-none-any.whl
- Upload date:
- Size: 9.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.8.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 58a70a4500226c569845b2e62d47ae69041046474a90cd0815d95a77d9301737 |
|
MD5 | 444481525553616a280002057d99945b |
|
BLAKE2b-256 | 265bc5c90aeb961cab87168ca4bfe6710a5ac9c58f6f0f710b7c70b872154f3d |