Skip to main content

A Python library for scraping Google Scholar profiles using Playwright

Project description

ScholarScraper

A Python library for scraping Google Scholar profiles using Playwright.

Installation

pip install scholarscraper

Usage

from scholarscraper import fetch_scholar_data
import asyncio

async def main():
    # Fetch data for a Google Scholar profile
    data = await fetch_scholar_data("SCHOLAR_ID")
    print(f"Name: {data['name']}")
    print(f"Total citations: {data['citations']['all']}")
    print(f"h-index: {data['h_index']['all']}")

if __name__ == "__main__":
    asyncio.run(main())

Features

  • Fetch author information
  • Get citation statistics (all-time and recent)
  • Get publication list with citations
  • Async/await support
  • Clean, typed interface

Requirements

  • Python 3.8+
  • Playwright

License

This project is licensed under the MIT License - see the LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

scholarlyscraper-0.1.0-py3-none-any.whl (4.0 kB view details)

Uploaded Python 3

File details

Details for the file scholarlyscraper-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for scholarlyscraper-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 11aefe80a8a0c4a2bde870a51c9abd76cc8c30447ed92e7ba99cf58280e88a54
MD5 e7e84dd01bf2f4f12dc02ab8d87bfa5f
BLAKE2b-256 94b84bb2540b3e76d32bcbd665a94e49982cc814e4522b0ea72e80664e5733fb

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page