Skip to main content

Python library for scraping Google Scholar.

Project description

Version 0.2 works with Python3.

Installation

$ pip install google-scholar-scraper

Command-line usage

$ gsscraper "neeman grothendieck duality"

will return the first result from Google Scholar matching this query, in XML format.

$ gsscraper "neeman grothendieck duality" -n 5

will return the five results from Google Scholar matching this query. (Max is 10 results.)

Library usage

import gsscraper

query = "neeman grothendieck duality"
gsscraper.get_result(query) # (a)
gsscraper.get_results(query, 5) # (b)
gsscraper.get_result_as_xml(query) # (c)

Here,

  1. will return a Python dict with keys “title”, “author”, etc.;

  2. will return a list of such Python dicts;

  3. will return a list of strings in XML format.

License

GPL

Authors

Adeel Khan (@adeel)

Parts of the code are derived from the package gscholar by Bastian Venthur (@venthur).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

google-scholar-scraper-0.2.tar.gz (3.0 kB view details)

Uploaded Source

File details

Details for the file google-scholar-scraper-0.2.tar.gz.

File metadata

File hashes

Hashes for google-scholar-scraper-0.2.tar.gz
Algorithm Hash digest
SHA256 535ef507527452d30d8ab0e8632699a68878a0e288ed9c4d64fcd23a41ad7fe7
MD5 a62bba960c4ad30e0fea492fcdbdf757
BLAKE2b-256 bc8451e84831774b768ed4aa84887a682554b0afc07cf3cc9cfdbbe74c58b5d0

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page