Skip to main content

A project to check if articles are free or paid

Project description

JournalPDFScraper

This project was started as a way to find and download free PDFs from medical journals. The goal is to expedite the early parts of medical research where the researcher must search for free PDFs and create a list of paid ones to purchase. With this tool the researcher would simply run the script to identify the free PDFs.

As noted in the status below, the project will no longer download PDFs. However, due to the reason listed below, the project will focus on determining if an article is free/paid.

Status

  • The project is undergoing a few fixes to make workable but as it stands I will not be adding any new journals.
  • You may notice there is a 'Base.py' and 'BaseSoup.py' file, as well as a 'BMJSoupScraper.py' along with the other scrapers. I intend to change all of these to use beautiful soup when I have time as I don't believe I will be supporting the downloading of files.
    • Note: Downloading files is honestly too hit and miss from journal-to-journal and I really just want the functionality to say if an article is free or not while providing the link.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

journalpdfscraper-0.2.1.tar.gz (6.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

journalpdfscraper-0.2.1-py3-none-any.whl (9.7 kB view details)

Uploaded Python 3

File details

Details for the file journalpdfscraper-0.2.1.tar.gz.

File metadata

  • Download URL: journalpdfscraper-0.2.1.tar.gz
  • Upload date:
  • Size: 6.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.10.0 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.55.0 CPython/3.8.5

File hashes

Hashes for journalpdfscraper-0.2.1.tar.gz
Algorithm Hash digest
SHA256 6e54ab4ee803e2da27997a04b44e8f1d4da1497589c47a0b018672d649786549
MD5 2328eb3d1c574b27feb084695c063d2d
BLAKE2b-256 51042fac64edaf6c4fff3df5ee250397f4057286f9390cf95a263ae5efacebbd

See more details on using hashes here.

File details

Details for the file journalpdfscraper-0.2.1-py3-none-any.whl.

File metadata

  • Download URL: journalpdfscraper-0.2.1-py3-none-any.whl
  • Upload date:
  • Size: 9.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.10.0 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.55.0 CPython/3.8.5

File hashes

Hashes for journalpdfscraper-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 124c2975afba12e21456640849d56bf8e5628557c3a52a219b209599fd2be3d6
MD5 4d9a2af5c64b9f8db13aad9f1c2e7146
BLAKE2b-256 4fa193ab062e123c1a77b7fcffa29270dffe2299e8ef7a5b13dcfa529b8ea7ae

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page