Skip to main content

Calculate jargon distance metric between texts

Project description

Licensed under the Apache License, Version 2.0 (the “License”); you may not use this file except in compliance with the License. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an “AS IS” BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

Description: # Jargon Distance

Jason Portenoy 2018

Implement the jargon distance measure from <a href=”https://doi.org/10.15195/v1.a15” target=”_blank”>Vilhena et al. “Finding Cultural Holes: How Structure and Culture Diverge in Networks of Scholarly Communication.” (2014)</a>

### Installation

Install from PyPI:

pip install jargon-distance

### Usage

Initialize a JargonDistance instance with a term_counts dict and (optionally) a group_map dict. term_counts is a mapping of document -> term counter (Counter object).

` from jargon_distance import JargonDistance j = JargonDistance(term_counts) `

Then, calculate the jargon distances:

` j.calculate_jargon_distance() j.write_to_file('jargon_distance.csv') `

The jargon_distance.analysis module can be used to visualize the jargon distance calculations. For example, below is a dendrogram showing the jargon distances between the text (unigrams) of different works of literature (from the [demo.ipynb](demo.ipynb) notebook).

![Dendrogram showing the jargon distance between different works of literature.](literature_dendrogram.png)

See [demo.ipynb](demo.ipynb) for a more detailed example.

Platform: UNKNOWN Description-Content-Type: text/markdown

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

jargon-distance-0.1.5.tar.gz (9.2 kB view details)

Uploaded Source

Built Distribution

jargon_distance-0.1.5-py3-none-any.whl (10.2 kB view details)

Uploaded Python 3

File details

Details for the file jargon-distance-0.1.5.tar.gz.

File metadata

  • Download URL: jargon-distance-0.1.5.tar.gz
  • Upload date:
  • Size: 9.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.7.3

File hashes

Hashes for jargon-distance-0.1.5.tar.gz
Algorithm Hash digest
SHA256 76ca491b3d71d9b216890be0104c7d4c4233d44dec18afe9992c7c6916063243
MD5 2f30b7dd60c99357f7e8105d893f5b40
BLAKE2b-256 d94b4df2418dff7885f5c6c2108315557206b441430086ce0963ca9802edbabc

See more details on using hashes here.

File details

Details for the file jargon_distance-0.1.5-py3-none-any.whl.

File metadata

  • Download URL: jargon_distance-0.1.5-py3-none-any.whl
  • Upload date:
  • Size: 10.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.7.3

File hashes

Hashes for jargon_distance-0.1.5-py3-none-any.whl
Algorithm Hash digest
SHA256 11bee98b1cdd38f20a0aaa9035eb6f0b03322ce9c37dd1fe7199351c5cf8361e
MD5 e8f5797c145a5e31dd5ae7e3e357d161
BLAKE2b-256 fe47bd2cde4307f5a885b3a831056c1d99e9500a06a6d5a4ce4b4fc232d1818d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page