Skip to main content

Some utils for scraping the PCS web

Project description

Overview

Selenium scraper for fetching information from the ProCyclingStats page.

Setup

Setting up virtualenv

python3 -m pip install virtualenv
virtualenv .
source bin/activate

Installing requirements

pip install -r requirements.txt
sudo apt-get install xvfb
sudo apt-get install -y chromium-chromedriver

Usage

import pcs_scraper

pcs_scraper.scrap_teams_by_year(2021)
pcs_scraper.scrap_stage('tour-de-france', 2020, 2)
pcs_scraper.scrap_stage_gc('tour-de-france', 2020, 2)
pcs_scraper.scrap_final_gc('tour-de-france', 2021)
pcs_scraper.scrap_final_points('tour-de-france', 2021)
pcs_scraper.scrap_final_kom('tour-de-france', 2021)
pcs_scraper.scrap_final_youth('tour-de-france', 2021)
pcs_scraper.scrap_final_teams('tour-de-france', 2021)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pcs-utils-0.0.1.tar.gz (8.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pcs_utils-0.0.1-py3-none-any.whl (9.4 kB view details)

Uploaded Python 3

File details

Details for the file pcs-utils-0.0.1.tar.gz.

File metadata

  • Download URL: pcs-utils-0.0.1.tar.gz
  • Upload date:
  • Size: 8.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.1 importlib_metadata/4.10.1 pkginfo/1.8.2 requests/2.27.1 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.5

File hashes

Hashes for pcs-utils-0.0.1.tar.gz
Algorithm Hash digest
SHA256 8768d22f7c8f886769c9ec9339c832fb72c5484c5612d92140581bf80cbdaf0f
MD5 7ef4316e09224b205d3a72ef62f0320b
BLAKE2b-256 d29bac902251cb6ea3c3cea6591983aba3dff4a8764fcf15f6db5a65bcc1a1d5

See more details on using hashes here.

File details

Details for the file pcs_utils-0.0.1-py3-none-any.whl.

File metadata

  • Download URL: pcs_utils-0.0.1-py3-none-any.whl
  • Upload date:
  • Size: 9.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.1 importlib_metadata/4.10.1 pkginfo/1.8.2 requests/2.27.1 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.5

File hashes

Hashes for pcs_utils-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 311d7b522be9abaf82a64df825e232ac8c5b427a995e7ad349319620bec0bd44
MD5 8a10dc0e345762d8a05e1fde3d455a09
BLAKE2b-256 89c47a2452a530e9c1b6bfa9eda82f04be167d4cd2df12331da7b19f3486922a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page