Skip to main content

Bibliographic capture system for non-scrapping data sources

Project description

Google Scholar Report

Description

Google Scholar Report is a tool for collecting data from Google Scholar profiles and storing it with metadata for each scientific paper. This tool has three main forms of use: generic, authenticated and admin; which differ in the amount and quality of the collected metadata. The default output is xlsx.

Usage from python-cli

Installation Package

$pip install GoogleScholarReport

For the first option of use (generic), use:

>>> from GoogleScholarReport import collector
>>> collector.gsr('url_to_google_scholar_profile', ouput='json')

Example

>>> from GoogleScholarReport import collector
>>> collector.gsr('https://scholar.google.com/citations?user=1sKULCoAAAAJ&hl=en', ouput='json')

For the second option of use (user authenticate):

>>> from GoogleScholarReport import collector
>>> collector.gsr('url_to_google_scholar_profile',email='user_email_google_scholar',password='pass_user_gs',' ouput='json')

Finally, for admin mode, use:

>>> from GoogleScholarReport import collector
>>> collector.gsr('url_to_google_scholar_profile',email='user_email_google_scholar',password='pass_user_gs', ouput='some_ouput(csv,json)',admin=True)

Usage from command-line

From command line, this tool has three main forms of use: generic, authenticated and admin; which differ in amount and quality of the collected metadata results.

For the first option of use (generic), use:

collector "url_for_the_google_scholar_profile"

Example:

collector "https://scholar.google.com/citations?user=1sKULCoAAAAJ&hl=en"

The above option return one xlsx file report in the current working directory with the following metadata:

'title', 'author', 'journal', 'volume', 'number','pages', 'year', 'cite_id', 'cites', 'TitleU'.

If you want the output in csv or json format aggregate the bellow flag and the desire output format, for instance:

collector "url_for_the_google_scholar_profile" --output csv

For the second option of use (user authenticate):

collector "url_for_the_google_scholar_profile" --email <email> --password <password>

This return one xlsx file report in the current working directory with the following metadata:

'cite_id', 'cites', 'publisher', 'year', 'pages', 'number', 'volume', 'journal', 'author', 'title','ENTRYTYPE', 'ID', 'school', 'booktitle', 'organization', 'note','month', 'institution'    Finally, for admin mode, issue:  

collector "url_for_the_google_scholar_profile" --email <email> --password <password> --admin

This returns by default an xlsx file with the same metadata that option two plus one fiedl 'bibtex'.

In general this command line tool have the following form:

collector "url_for_the_google_scholar_profile" --email <user_email> --password <password> --output <format> --admin

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

GoogleScholarReport-0.1.7.tar.gz (9.8 kB view details)

Uploaded Source

Built Distribution

GoogleScholarReport-0.1.7-py3-none-any.whl (9.8 kB view details)

Uploaded Python 3

File details

Details for the file GoogleScholarReport-0.1.7.tar.gz.

File metadata

  • Download URL: GoogleScholarReport-0.1.7.tar.gz
  • Upload date:
  • Size: 9.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.6.1 pkginfo/1.7.1 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.9.5

File hashes

Hashes for GoogleScholarReport-0.1.7.tar.gz
Algorithm Hash digest
SHA256 ded078fa496c64920c1ec94d5fbe0a252b7f08ef1b6d6bbeb29df2590ccf4d04
MD5 80df74281fd8ce059ffc818e7d6830c0
BLAKE2b-256 6788cdd8f81bca6c5034a9eb1f3ed23a486f664cfe73aeb4413efabbb95b973f

See more details on using hashes here.

File details

Details for the file GoogleScholarReport-0.1.7-py3-none-any.whl.

File metadata

  • Download URL: GoogleScholarReport-0.1.7-py3-none-any.whl
  • Upload date:
  • Size: 9.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.6.1 pkginfo/1.7.1 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.9.5

File hashes

Hashes for GoogleScholarReport-0.1.7-py3-none-any.whl
Algorithm Hash digest
SHA256 028d6bfff37507e7e8b3e1ac69056d90874db3d6340bd719bb15996e21bf2771
MD5 e0c50bf1bed949cb5f8aa83dae185c9f
BLAKE2b-256 0d653064fe330d7dc985826ce61a79bf1c45dcf55771bfd2f485c92e486c3b49

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page