Bibliographic capture system for non-scrapping data sources
Project description
Google Scholar Report
Description
Google scholar report is a tool for collect data from google scholar profiles and store it with metadata for each scientific paper. This tool have three main forms of use: generic, authenticated and admin; which difer in amount and cuality of metadata results. Default output is xlsx.
Usage from python-cli
Installation Package
$pip install GoogleScholarReport
For the firts option of use (generic), issue:
>>> from GoogleScholarReport import collector
>>> collector.gsr('url_to_google_scholar_profile', ouput='json')
For the second option of use (user authenticate):
>>> from GoogleScholarReport import collector
>>> collector.gsr('url_to_google_scholar_profile',email='user_email_google_scholar',password='pass_user_gs',' ouput='json')
Finally, for admin mode, issue:
>>> from GoogleScholarReport import collector
>>> collector.gsr('url_to_google_scholar_profile',email='user_email_google_scholar',password='pass_user_gs', ouput='some_ouput(csv,json)',admin=True)
Usage from comman-line
From comman line, this tool have three main forms of use: generic, authenticated and admin; which difer in amount and cuality of metadata results.
For the firts option of use (generic), issue:
python3 collector.py "url_for_the_google_scholar_profile"
The above option return one xlsx file report in the current working directory with the following metadata:
'title', 'author', 'journal', 'volume', 'number','pages', 'year', 'cite_id', 'cites', 'TitleU'.
If you want the output in csv or json format agregate the bellow flag and the desire ouput format, for instance:
python3 collector.py "url_for_the_google_scholar_profile" --output csv
For the second option of use (user authenticate):
python3 collector.py "url_for_the_google_scholar_profile" --email <email> --password <password>
This return one xlsx file report in the current working directory with the following metadata:
'cite_id', 'cites', 'publisher', 'year', 'pages', 'number', 'volume', 'journal', 'author', 'title','ENTRYTYPE', 'ID', 'school', 'booktitle', 'organization', 'note','month', 'institution'
Finally, for admin mode, issue:
python3 collector.py "url_for_the_google_scholar_profile" --email <email> --password <password> --admin
This return by default an xlsx file with the same metadata that option two plus one fiedl 'bibtex'.
In general this commanline tool have the following form:
python3 collector.py "url_for_the_google_scholar_profile" --email <user_email> --password <password> --output <format> --admin
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for GoogleScholarReport-0.1.0.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4e541ac23e3381db0c42148ed59db958c6ecb8f38b6b3b66c1e5fe7a75087c77 |
|
MD5 | 6f2e944ab78dc1edcf3ddef4ccae41a4 |
|
BLAKE2b-256 | 97fc88c6b19cb4ca2ceef211f164cfbfa9d1c08d5ea75c589eda6e27cb5d3f24 |
Hashes for GoogleScholarReport-0.1.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2fe3c86b77b437c598719597dc44f032ec78a581014127a5555e97e3f35bc5ab |
|
MD5 | 8a7f8661dac4d349a28251c448498dcd |
|
BLAKE2b-256 | d896d6a0644848b2f10f7f7ed1536e5775ac6d1e51b3ddf96f239a5980027b84 |