Skip to main content

Collects Grad School data from https://www.usnews.com and gives output in a .xls file.

Project description

U.S.News Scrapper is a Python library that collect data from the website of usnews and output those data in a file for offline usage. Till now, it is only capable of collecting graduate schools data and output it in .xls format. After generating the .xls file, it will be opened by default excel file opener. Visit github_ page for detailed informations.

Setup

Visit github_ page for detailed informations.

$ pip install usnews_scrapper

Usage

usage: python usnews_scrapper.py [-h] -u URL [-o OUTPUTFILENAME] [-p PAUSETIME] [–from STARTPAGE] [–to ENDPAGE]

Collects data from usnews and generates excel file

optional arguments: -h, –help Show this help message and exit -u URL, –url URL The usnews address to collect data from. Put the URL within qoutes i.e. “ or ‘ . -o OUTPUTFILENAME The output file name without extension. -p PAUSETIME, –pause PAUSETIME The pause time between loading pages from usnews. –from STARTPAGE The page number from which the scrapper starts working. –to ENDPAGE The page number to which the scrapper works.

Examples

Copy the address of the page from usnews website and in the Command Prompt and enter this command -

$ cd USNews-Scrapper
$ python usnews_scrapper.py –url=”https://www.usnews.com/best-graduate-schools/top-science-schools/computer-science-rankings” -o file_name -p 2 –from=2 –to=5

The output file will be saved in current directory under the name of file_name_*.xls

Authors

Project details


Release history Release notifications | RSS feed

This version

v0.1

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

usnews_scrapper-v0.1.tar.gz (5.9 kB view details)

Uploaded Source

File details

Details for the file usnews_scrapper-v0.1.tar.gz.

File metadata

  • Download URL: usnews_scrapper-v0.1.tar.gz
  • Upload date:
  • Size: 5.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.7.2

File hashes

Hashes for usnews_scrapper-v0.1.tar.gz
Algorithm Hash digest
SHA256 61d9a1ec00ca65a1302ecda5ae5e567fedd0a46f2de60e6ab1640ce8c519fb10
MD5 2fc88dee57f4604604267e5d77fc52b7
BLAKE2b-256 c388cc60a244d2d047bcc1532275f8e244f59e59dca237737f01bb6d0396ce01

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page