Skip to main content

Python library for scraping Google Scholar.

Project description

Version 0.2 works with Python3.

Installation

$ pip install google-scholar-scraper

Command-line usage

$ gsscraper "neeman grothendieck duality"

will return the first result from Google Scholar matching this query, in XML format.

$ gsscraper "neeman grothendieck duality" -n 5

will return the five results from Google Scholar matching this query. (Max is 10 results.)

Library usage

import gsscraper

query = "neeman grothendieck duality"
gsscraper.get_result(query) # (a)
gsscraper.get_results(query, 5) # (b)
gsscraper.get_result_as_xml(query) # (c)

Here,

  1. will return a Python dict with keys “title”, “author”, etc.;

  2. will return a list of such Python dicts;

  3. will return a list of strings in XML format.

License

GPL

Authors

Adeel Khan (@adeel)

Parts of the code are derived from the package gscholar by Bastian Venthur (@venthur).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

google-scholar-scraper-0.2.tar.gz (3.0 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page