Calculate jargon distance metric between texts
Project description
Licensed under the Apache License, Version 2.0 (the “License”); you may not use this file except in compliance with the License. You may obtain a copy of the License at
Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an “AS IS” BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.
Description: # Jargon Distance
Jason Portenoy 2018
Implement the jargon distance measure from <a href=”https://doi.org/10.15195/v1.a15” target=”_blank”>Vilhena et al. “Finding Cultural Holes: How Structure and Culture Diverge in Networks of Scholarly Communication.” (2014)</a>
### Installation
Install from PyPI:
pip install jargon-distance
### Usage
Initialize a JargonDistance instance with a term_counts dict and (optionally) a group_map dict. term_counts is a mapping of document -> term counter (Counter object).
` from jargon_distance import JargonDistance j = JargonDistance(term_counts) `
Then, calculate the jargon distances:
` j.calculate_jargon_distance() j.write_to_file('jargon_distance.csv') `
The jargon_distance.analysis module can be used to visualize the jargon distance calculations. For example, below is a dendrogram showing the jargon distances between the text (unigrams) of different works of literature (from the [demo.ipynb](demo.ipynb) notebook).
![Dendrogram showing the jargon distance between different works of literature.](literature_dendrogram.png)
See [demo.ipynb](demo.ipynb) for a more detailed example.
Platform: UNKNOWN Description-Content-Type: text/markdown
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file jargon-distance-0.1.5.tar.gz
.
File metadata
- Download URL: jargon-distance-0.1.5.tar.gz
- Upload date:
- Size: 9.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.7.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 76ca491b3d71d9b216890be0104c7d4c4233d44dec18afe9992c7c6916063243 |
|
MD5 | 2f30b7dd60c99357f7e8105d893f5b40 |
|
BLAKE2b-256 | d94b4df2418dff7885f5c6c2108315557206b441430086ce0963ca9802edbabc |
File details
Details for the file jargon_distance-0.1.5-py3-none-any.whl
.
File metadata
- Download URL: jargon_distance-0.1.5-py3-none-any.whl
- Upload date:
- Size: 10.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.7.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 11bee98b1cdd38f20a0aaa9035eb6f0b03322ce9c37dd1fe7199351c5cf8361e |
|
MD5 | e8f5797c145a5e31dd5ae7e3e357d161 |
|
BLAKE2b-256 | fe47bd2cde4307f5a885b3a831056c1d99e9500a06a6d5a4ce4b4fc232d1818d |