The Literature mining and processing utility
Project description
# Combined Data Mining Utility
This script provides a combined utility for data mining tasks related to PubMed articles. It offers various functionalities to facilitate tasks such as searching PubMed, retrieving abstracts, downloading full texts, processing PubMed IDs, crawling URLs, removing duplicates, converting PDFs to text files, and additional utilities.
## Getting Started
### Prerequisites
Ensure you have Python installed on your system. The script is compatible with both Python 2 and 3.
### Installation
Install the required package:
`bash pip install scholarsync `
If you encounter the following warning message:
` WARNING: The script scholarsync is installed in '/home/username/.local/bin' which is not on PATH. `
Open a terminal and type the following command, then press Enter:
`bash echo 'export PATH="$PATH:/home/username/.local/bin"' >> ~/.bashrc && source ~/.bashrc `
Replace username with your actual username.
## Usage
Upon running the script, you will be prompted with a menu to select the desired functionality. The available options include:
PubMed search/query
Get abstracts from PubMed IDs
Attempt full text download from PubMed
Process PubMed IDs to get DOI
URL transformation of PubMed IDs
Crawling and downloading from URLs
Removing duplicates
Converting PDFs to text files
Additional utilities
Follow the on-screen instructions to navigate through the menu and execute the desired tasks.
## License
By using this script, you agree to the terms of the LICENSE included in the repository.
## Contributing
Contributions are welcome! Feel free to submit pull requests or open issues for any improvements or bug fixes.
## Acknowledgments
This script was developed to simplify various data mining tasks related to PubMed articles.
Special thanks to the developers and contributors of the libraries used in this script.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for scholarsync-0.0.3.88-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | c2e3ec727d1efff630949fb21e241afd5654d0a4699f9a777e18309fdf0e83c7 |
|
MD5 | 65c51cb4498a9936007d3571e71f7eef |
|
BLAKE2b-256 | 9a9776b526e6eb931a73d4a53a9c4984eebef470be06ed6c341d2f263b72e466 |