Skip to main content

Python Hockey Data Scraper

Project description

EdgingHockeyScraper

im https://pypi.python.org/pypi/edginghockeyscraper

Install

pip install edginghockeyscraper

Python Hockey Data Scraper

Features

  • Python Hockey Data Scraper with following features:

    • Caching Requests to quickly fetch data
    • Parallel Processing to speed up data fetch from NHL API
  • Get League schedule for year - Usageschedule = edginghockeyscraper.get_league_schedule(2024)

  • Get Game boxscore

    • boxscore = edginghockeyscraper.get_boxscore(2024020345)
  • Get Game playByPlay

    • playByPlay = edginghockeyscraper.get_play_by_play(2024020345)
  • Get Season boxscores

    • boxscoreSeason = edginghockeyscraper.get_boxscore_season(2024)
  • Get Season playByPlay

    • playByPlaySeason = edginghockeyscraper.get_play_by_play_season(2024)

Filter Season data by PreSeason, Regular, PostSeason gametypes

  • e.g.
    • games = edginghockeyscraper.get_league_schedule(2024, {GameType.REG})
    • boxscoreSeason = edginghockeyscraper.get_boxscore_season(2024, {GameType.REG})

Utilize requests-cache for fast repeated request calls

  • First Call:

    %%time
    edginghockeyscraper.get_boxscore_season(2024, cache= True)
    

    CPU times: user 583 ms, sys: 318 ms, total: 901 ms Wall time: 1min 18s

  • Second Call:

    • CPU times: user 374 ms, sys: 141 ms, total: 515 ms Wall time: 1.31 s

    A 60x speedup!

Utilize multiprocessing to improve request speed

Benchmark using 2024 Macbook Air Apple M3 16GB

  1. No Parallel getBoxscoreSeason: CPU times: user 15.6 s, sys: 3.55 s, total: 19.1 s Wall time: 8min 49s

  2. Parallel getBoxscoreSeason: CPU times: user 583 ms, sys: 318 ms, total: 901 ms Wall time: 1min 18s

    A ~7x Speedup! (this is an 8-core CPU - you can expect roughly a <# cpu-cores> speedup)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

edginghockeyscraper-0.1.4.tar.gz (4.9 kB view details)

Uploaded Source

Built Distribution

edginghockeyscraper-0.1.4-py3-none-any.whl (4.9 kB view details)

Uploaded Python 3

File details

Details for the file edginghockeyscraper-0.1.4.tar.gz.

File metadata

  • Download URL: edginghockeyscraper-0.1.4.tar.gz
  • Upload date:
  • Size: 4.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for edginghockeyscraper-0.1.4.tar.gz
Algorithm Hash digest
SHA256 f385eb98bc418f187039764dfee100f140ada8a778a912c5c0857c4cda57cf34
MD5 15a9338e87d6a28a9c275252d2c7c41e
BLAKE2b-256 52bb80e86a0e5ce732a6e40f1ff19932e454036ef40b18e13cff33e8f43d6eec

See more details on using hashes here.

Provenance

The following attestation bundles were made for edginghockeyscraper-0.1.4.tar.gz:

Publisher: python-publish.yml on austinegri/edginghockeyscraper

Attestations:

File details

Details for the file edginghockeyscraper-0.1.4-py3-none-any.whl.

File metadata

File hashes

Hashes for edginghockeyscraper-0.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 5095a6f6072fe1121719fe87758a2166820cd73a0e60f98547f6308e4e090f7f
MD5 744e9340f915afcfb0eec0e6a1c3e53f
BLAKE2b-256 d7cc9ec6d099afab72f62e5b1832d02e7f762ad0a11f31933d496b90543dbe03

See more details on using hashes here.

Provenance

The following attestation bundles were made for edginghockeyscraper-0.1.4-py3-none-any.whl:

Publisher: python-publish.yml on austinegri/edginghockeyscraper

Attestations:

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page