Skip to main content

Scrape user data from Goodreads

Project description

Goodreads Icon

Goodreads User Scraper

Scrape Goodreads User Data: Profile, Book Shelves, Books, Authors

Version  Downloads  Build 

Contents

Usage

Using pip:

pip install goodreads-user-scraper
goodreads-user-scraper --user_id <your id> --output_dir goodreads-data

Using pipx:

pipx run goodreads-user-scraper --user_id <your id> --output_dir goodreads-data

Arguments

--user_id

  • Description: The user whose data should be scraped. Find your user id using these directions.
  • Required: Yes

--output_dir

  • Description: The directory where all scraped data will be output.
  • Required: No
  • Default: goodreads-data

--skip_user_info

  • Description: Whether the script should skip scraping user information.
  • Required: No
  • Default: False

--skip_shelves

  • Description: Whether the script should skip scraping shelves.
  • Required: No
  • Default: False

--skip_authors

  • Description: Whether the script should skip scraping authors.
  • Required: No
  • Default: False

Troubleshooting

Ensure that your profile is viewable by anyone:

  1. Navigate to the Goodreads Account Settings page
  2. Click on the Settings tab
  3. In the Privacy section, under the Who Can View My Profile question, select "anyone"

Development

  1. Clone the GitHub repository

    git clone https://github.com/YashTotale/goodreads-user-scraper.git
    
  2. Run the install script

    sh scripts/install.sh
    
  3. Make changes

  4. Run the test script

    sh scripts/test.sh
    

Publishing

  1. Create .env

    TWINE_USERNAME=<foo>
    TWINE_PASSWORD=<bar>
    
  2. Run the publish script

    sh scripts/publish.sh <patch|minor|major>
    

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

goodreads-user-scraper-1.2.4.tar.gz (7.1 kB view hashes)

Uploaded Source

Built Distribution

goodreads_user_scraper-1.2.4-py3-none-any.whl (8.9 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page