No project description provided
Project description
Get Papers List
A Python program to fetch research papers from PubMed and filter those with at least one author affiliated with a pharmaceutical or biotech company.
Features
- Fetches research papers from PubMed using a search query.
- Filters papers based on author affiliation with pharmaceutical or biotech companies.
- Extracts relevant details, including emails of authors when available.
- Outputs results in a CSV format for easy viewing and analysis.
How the Code is Organized
The program is organized into the following components:
1. get_papers.py
- The main script that fetches papers from PubMed, filters them, and outputs the results.
- Contains functions for:
- Fetching Paper IDs:
fetch_paper_ids() - Fetching Paper Details:
fetch_paper_details() - Checking Pharmaceutical/Biotech Affiliations:
is_pharma_biotech() - Extracting Emails:
extract_email()
- Fetching Paper IDs:
2. Command-Line Interface
- The program is executed via the command line using
poetry run get-papers-list. - Supports the following options:
query: The PubMed query to search for papers.-dor--debug: Enables debug mode.-for--file: Saves the results to a CSV file.
3. Dependencies
- The program uses the following libraries:
requests: For making HTTP requests to the PubMed API.lxml: For parsing XML responses from PubMed.pandas: For creating and saving the results as a CSV file.
Installation and Execution
Prerequisites
- Python 3.8 or higher.
- Poetry (for dependency management).
Installation Steps
- Clone the repository:
git clone https://github.com/sanjudev1/get-papers-list.git
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
get_papers_list-0.1.4.tar.gz
(3.6 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file get_papers_list-0.1.4.tar.gz.
File metadata
- Download URL: get_papers_list-0.1.4.tar.gz
- Upload date:
- Size: 3.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.1.1 CPython/3.12.6 Windows/11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
18cb355aaf3f78680cfafd5500eb8caf6b197afde64a3c7d895387f724e2a261
|
|
| MD5 |
5e0c5e1f0f18b0e6e8f8b9d70a7c7b80
|
|
| BLAKE2b-256 |
06ee56d3e57da1667383383bae5f4464d7dd02e1abfa9bd973b93a5c959e6fcd
|
File details
Details for the file get_papers_list-0.1.4-py3-none-any.whl.
File metadata
- Download URL: get_papers_list-0.1.4-py3-none-any.whl
- Upload date:
- Size: 4.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.1.1 CPython/3.12.6 Windows/11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9d3613f8896f29ede43c6093ee5e2c83110d10199609e51939e6e238fad34207
|
|
| MD5 |
67b0e43f78e6247bf278d65826c764d0
|
|
| BLAKE2b-256 |
cd139ccb9821cabb7ea4b74be5961791cdd09f632de7a35a54c75dd2ac2d16fc
|