Bibliographic capture system for non-scrapping data sources
Project description
Google Scholar Report
Description
Google Scholar Report is a tool for collecting data from Google Scholar profiles and storing it with metadata for each scientific paper. This tool has three main forms of use: generic, authenticated and admin; which differ in the amount and quality of the collected metadata. The default output is xlsx.
Usage from python-cli
Installation Package
$pip install GoogleScholarReport
For the first option of use (generic), use:
>>> from GoogleScholarReport import collector
>>> collector.gsr('url_to_google_scholar_profile', ouput='json')
Example
>>> from GoogleScholarReport import collector
>>> collector.gsr('https://scholar.google.com/citations?user=1sKULCoAAAAJ&hl=en', ouput='json')
For the second option of use (user authenticate):
>>> from GoogleScholarReport import collector
>>> collector.gsr('url_to_google_scholar_profile',email='user_email_google_scholar',password='pass_user_gs',' ouput='json')
Finally, for admin mode, use:
>>> from GoogleScholarReport import collector
>>> collector.gsr('url_to_google_scholar_profile',email='user_email_google_scholar',password='pass_user_gs', ouput='some_ouput(csv,json)',admin=True)
Usage from command-line
From command line, this tool has three main forms of use: generic, authenticated and admin; which differ in amount and quality of the collected metadata results.
For the first option of use (generic), use:
collector "url_for_the_google_scholar_profile"
Example:
collector "https://scholar.google.com/citations?user=1sKULCoAAAAJ&hl=en"
The above option return one xlsx file report in the current working directory with the following metadata:
'title', 'author', 'journal', 'volume', 'number','pages', 'year', 'cite_id', 'cites', 'TitleU'.
If you want the output in csv or json format aggregate the bellow flag and the desire output format, for instance:
collector "url_for_the_google_scholar_profile" --output csv
For the second option of use (user authenticate):
collector "url_for_the_google_scholar_profile" --email <email> --password <password>
This return one xlsx file report in the current working directory with the following metadata:
'cite_id', 'cites', 'publisher', 'year', 'pages', 'number', 'volume', 'journal', 'author', 'title','ENTRYTYPE', 'ID', 'school', 'booktitle', 'organization', 'note','month', 'institution' Finally, for admin mode, issue:
collector "url_for_the_google_scholar_profile" --email <email> --password <password> --admin
This returns by default an xlsx file with the same metadata that option two plus one fiedl 'bibtex'.
In general this command line tool have the following form:
collector "url_for_the_google_scholar_profile" --email <user_email> --password <password> --output <format> --admin
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file GoogleScholarReport-0.1.7.tar.gz
.
File metadata
- Download URL: GoogleScholarReport-0.1.7.tar.gz
- Upload date:
- Size: 9.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.1 importlib_metadata/4.6.1 pkginfo/1.7.1 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.9.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ded078fa496c64920c1ec94d5fbe0a252b7f08ef1b6d6bbeb29df2590ccf4d04 |
|
MD5 | 80df74281fd8ce059ffc818e7d6830c0 |
|
BLAKE2b-256 | 6788cdd8f81bca6c5034a9eb1f3ed23a486f664cfe73aeb4413efabbb95b973f |
File details
Details for the file GoogleScholarReport-0.1.7-py3-none-any.whl
.
File metadata
- Download URL: GoogleScholarReport-0.1.7-py3-none-any.whl
- Upload date:
- Size: 9.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.1 importlib_metadata/4.6.1 pkginfo/1.7.1 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.9.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 028d6bfff37507e7e8b3e1ac69056d90874db3d6340bd719bb15996e21bf2771 |
|
MD5 | e0c50bf1bed949cb5f8aa83dae185c9f |
|
BLAKE2b-256 | 0d653064fe330d7dc985826ce61a79bf1c45dcf55771bfd2f485c92e486c3b49 |