Skip to main content

Analyzing the evolution of ideas using citation analysis

Project description

For the full story, see this paper, or these notes.

This Python package, knowknow, is an attempt to make powerful, modern tools for analyzing the structure of knowledge open to anyone. Although I hope we can continue to improve the methods and documentation written here, and I intend that this grow larger than myself, this package acts as a stabilizing force for the field, giving us all access to the common methods and data for analyzing these structures.

I have included every inch of code here, leaving no stone unturned. With every pip install knowknow-amcgail, you download the following:

  • creating variables, a collection of pre-processing algorithms for cleaning and summarizing Web of Science search results, or JSTOR Data for Research data dumps.
  • analyses, a set of descriptive notebooks which illustrate these datasets
  • A connector to pre-computed cooccurrence sets, hosted on OSF

Projects built on knowknow

  • amcgail/citation-death applies the concept of 'death' to attributes of citations, and analyzes the lifecourse of cited works, cited authors, and the authors writing the citations, using the sociology-wos dataset.

Datasets built with knowknow

Installation

  1. Install Python 3.7
  2. Install Build Tools for Visual Studio
  3. Run pip install science2-amcgail

Quick start

The following command starts jupyterlab in the base directory of this repository. This is a good place to start.

python -m science2 start

Developing

If you want to contribute edits of your own, fork this repository into your own GitHub account, make the changes, and submit a request for me to incorporate the code (a "pull request"). This process is really easy with GitHub Desktop (tutorial here).

There is a lot to do! If you find this useful to your work, and would like to contribute (even to the following list of possible next steps) but can't figure out how, please don't hesitate to reach out. My website is here, Twitter here.

Aimed completion by 5/22/2020 (ben rosche)

  • analyses complete, with explanations, annotations, and graphs

Aimed completion by 5/29/2020 (committee)

  • literature review is tight, written, boom. everything down. finish it.

Aimed completion by 6/5/2020 (presentation)

  • Externalizing data from the Git repository, so it can be dynamically downloaded / uploaded via AWS
  • trimming the paper and preparing it for publication

Possible projects

  • The documentation for this project can always be improved. This is typically through people reaching out to me when they have issues. Please feel free.
  • An object-oriented model for handling context would prevent the need for so much variable-passing between functions, reduce total code volume, and improve readability.
  • Different datasets and sources could be incorporated, if you have the need, in addition to JSTOR and WoS.
  • If you produce precomputed binaries and have an idea of how we could incorporate the sharing of these binaries within this library, please DM me or something. That would be great.
  • All analyses can be generalized to any counted variable of the citations. This wouldn't be tough, and would have a huge payout.
  • It would be amazing if we could make a graphical interface for this.
    • user simply imports data, chooses the analyses they want to run, fill in configuration parameters and press "go"
    • the output is a PDF with the code, visualizations, and explanations for a given analysis
    • behind the scenes, all this GUI does is run nbconvert
    • also could allow users to regenerate any/all analyses for each dataset with the click of a button
    • could provide immediate access to online archives, either to download or upload similar count datasets

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

knowknow-amcgail-0.1.6.tar.gz (17.5 MB view details)

Uploaded Source

Built Distribution

knowknow_amcgail-0.1.6-py3-none-any.whl (13.1 MB view details)

Uploaded Python 3

File details

Details for the file knowknow-amcgail-0.1.6.tar.gz.

File metadata

  • Download URL: knowknow-amcgail-0.1.6.tar.gz
  • Upload date:
  • Size: 17.5 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.2.0 requests-toolbelt/0.9.1 tqdm/4.46.0 CPython/3.7.5

File hashes

Hashes for knowknow-amcgail-0.1.6.tar.gz
Algorithm Hash digest
SHA256 3dd8065db777a1289998c3590c3ae54fdab8bab4c40ba8bb7d105b24d69b7230
MD5 080f7006e55480e9d0d87f00eb3da554
BLAKE2b-256 58a0ea86c883888b4243ea90e03ec6fd8b047f194a31a49cbcfe01b25bf8e6b9

See more details on using hashes here.

File details

Details for the file knowknow_amcgail-0.1.6-py3-none-any.whl.

File metadata

  • Download URL: knowknow_amcgail-0.1.6-py3-none-any.whl
  • Upload date:
  • Size: 13.1 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.2.0 requests-toolbelt/0.9.1 tqdm/4.46.0 CPython/3.7.5

File hashes

Hashes for knowknow_amcgail-0.1.6-py3-none-any.whl
Algorithm Hash digest
SHA256 342f9779acbc985ed33ba8dd57e294036822eddf26988167071b1df137d1bc3f
MD5 13a0e37bcd73d405336fff2d0c12a64d
BLAKE2b-256 f20257c8c2d311f810b5e2923e943a663332b908e310ec63520efd68c371adbc

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page