Skip to main content

The Literature mining and processing utility

Project description

# Combined Data Mining Utility

This script provides a combined utility for data mining tasks related to PubMed articles. It offers various functionalities to facilitate tasks such as searching PubMed, retrieving abstracts, downloading full texts, processing PubMed IDs, crawling URLs, removing duplicates, converting PDFs to text files, and additional utilities.

## Getting Started

### Prerequisites

Ensure you have Python installed on your system. The script is compatible with both Python 2 and 3.

### Installation

  1. Install the required package:

    `bash pip install scholarsync `

  2. If you encounter the following warning message:

    ` WARNING: The script scholarsync is installed in '/home/username/.local/bin' which is not on PATH. `

    Open a terminal and type the following command, then press Enter:

    `bash echo 'export PATH="$PATH:/home/username/.local/bin"' >> ~/.bashrc && source ~/.bashrc `

    Replace username with your actual username.

## Usage

Upon running the script, you will be prompted with a menu to select the desired functionality. The available options include:

  • PubMed search/query

  • Get abstracts from PubMed IDs

  • Attempt full text download from PubMed

  • Process PubMed IDs to get DOI

  • URL transformation of PubMed IDs

  • Crawling and downloading from URLs

  • Removing duplicates

  • Converting PDFs to text files

  • Additional utilities

Follow the on-screen instructions to navigate through the menu and execute the desired tasks.

## License

By using this script, you agree to the terms of the LICENSE included in the repository.

## Contributing

Contributions are welcome! Feel free to submit pull requests or open issues for any improvements or bug fixes.

## Acknowledgments

  • This script was developed to simplify various data mining tasks related to PubMed articles.

  • Special thanks to the developers and contributors of the libraries used in this script.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scholarsync-0.0.3.88.tar.gz (40.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

scholarsync-0.0.3.88-py3-none-any.whl (41.7 kB view details)

Uploaded Python 3

File details

Details for the file scholarsync-0.0.3.88.tar.gz.

File metadata

  • Download URL: scholarsync-0.0.3.88.tar.gz
  • Upload date:
  • Size: 40.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.8.10

File hashes

Hashes for scholarsync-0.0.3.88.tar.gz
Algorithm Hash digest
SHA256 39e6e856c0032cc46100661b030a9871eb6560d414ac200c91f703a92dead42e
MD5 2793d0efd6af2849d56f184bc9ee2be9
BLAKE2b-256 7fb5a0fde5d2f94d966959a2d14b2eec1674d46ddfa13de7d0592ab014f29dcd

See more details on using hashes here.

File details

Details for the file scholarsync-0.0.3.88-py3-none-any.whl.

File metadata

  • Download URL: scholarsync-0.0.3.88-py3-none-any.whl
  • Upload date:
  • Size: 41.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.8.10

File hashes

Hashes for scholarsync-0.0.3.88-py3-none-any.whl
Algorithm Hash digest
SHA256 c2e3ec727d1efff630949fb21e241afd5654d0a4699f9a777e18309fdf0e83c7
MD5 65c51cb4498a9936007d3571e71f7eef
BLAKE2b-256 9a9776b526e6eb931a73d4a53a9c4984eebef470be06ed6c341d2f263b72e466

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page