Skip to main content

Bibliographic capture system for non-scrapping data sources

Project description

Google Scholar Report

Description

Google scholar report is a tool for collect data from google scholar profiles and store it with metadata for each scientific paper. This tool have three main forms of use: generic, authenticated and admin; which difer in amount and cuality of metadata results. Default output is xlsx.

Usage from python-cli

Installation Package

$pip install GoogleScholarReport

For the firts option of use (generic), issue:

>>> from GoogleScholarReport import collector
>>> collector.gsr('url_to_google_scholar_profile', ouput='json')

For the second option of use (user authenticate):

>>> from GoogleScholarReport import collector
>>> collector.gsr('url_to_google_scholar_profile',email='user_email_google_scholar',password='pass_user_gs',' ouput='json')

Finally, for admin mode, issue:

>>> from GoogleScholarReport import collector
>>> collector.gsr('url_to_google_scholar_profile',email='user_email_google_scholar',password='pass_user_gs', ouput='some_ouput(csv,json)',admin=True)

Usage from comman-line

From comman line, this tool have three main forms of use: generic, authenticated and admin; which difer in amount and cuality of metadata results.

For the firts option of use (generic), issue:

python3 collector.py "url_for_the_google_scholar_profile"

The above option return one xlsx file report in the current working directory with the following metadata:

'title', 'author', 'journal', 'volume', 'number','pages', 'year', 'cite_id', 'cites', 'TitleU'.

If you want the output in csv or json format agregate the bellow flag and the desire ouput format, for instance:

python3 collector.py "url_for_the_google_scholar_profile" --output csv

For the second option of use (user authenticate):

python3 collector.py "url_for_the_google_scholar_profile" --email <email> --password <password>

This return one xlsx file report in the current working directory with the following metadata:

'cite_id', 'cites', 'publisher', 'year', 'pages', 'number', 'volume', 'journal', 'author', 'title','ENTRYTYPE', 'ID', 'school', 'booktitle', 'organization', 'note','month', 'institution'

Finally, for admin mode, issue:

python3 collector.py "url_for_the_google_scholar_profile" --email <email> --password <password> --admin

This return by default an xlsx file with the same metadata that option two plus one fiedl 'bibtex'.

In general this commanline tool have the following form:

python3 collector.py "url_for_the_google_scholar_profile" --email <user_email> --password <password> --output <format> --admin

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

GoogleScholarReport-0.1.0.tar.gz (9.4 kB view hashes)

Uploaded Source

Built Distribution

GoogleScholarReport-0.1.0-py3-none-any.whl (9.2 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page