Skip to main content

Calculate jargon distance metric between texts

Project description

Licensed under the Apache License, Version 2.0 (the “License”); you may not use this file except in compliance with the License. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an “AS IS” BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

Description: # Jargon Distance

Jason Portenoy 2018

Implement the jargon distance measure from <a href=”https://doi.org/10.15195/v1.a15” target=”_blank”>Vilhena et al. “Finding Cultural Holes: How Structure and Culture Diverge in Networks of Scholarly Communication.” (2014)</a>

### Installation

Install from PyPI:

pip install jargon-distance

### Usage

Initialize a JargonDistance instance with a term_counts dict and (optionally) a group_map dict. term_counts is a mapping of document -> term counter (Counter object).

` from jargon_distance import JargonDistance j = JargonDistance(term_counts) `

Then, calculate the jargon distances:

` j.calculate_jargon_distance() j.write_to_file('jargon_distance.csv') `

The jargon_distance.analysis module can be used to visualize the jargon distance calculations. For example, below is a dendrogram showing the jargon distances between the text (unigrams) of different works of literature (from the [demo.ipynb](demo.ipynb) notebook).

![Dendrogram showing the jargon distance between different works of literature.](literature_dendrogram.png)

See [demo.ipynb](demo.ipynb) for a more detailed example.

Platform: UNKNOWN Description-Content-Type: text/markdown

Project details


Release history Release notifications

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for jargon-distance, version 0.1.5
Filename, size File type Python version Upload date Hashes
Filename, size jargon_distance-0.1.5-py3-none-any.whl (10.2 kB) File type Wheel Python version py3 Upload date Hashes View hashes
Filename, size jargon-distance-0.1.5.tar.gz (9.2 kB) File type Source Python version None Upload date Hashes View hashes

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN SignalFx SignalFx Supporter DigiCert DigiCert EV certificate StatusPage StatusPage Status page