Skip to main content

Scraper for ice hockey data

Project description

Hockey-API

Purpose

The purpose of this package is to scrape data from ice hockey leagues, currently the NHL, from available APIs and websites. The information that is scraped is both play by play and shift data, with support for all parts of the seasons (preseason, regular season, all star and playoffs) for seasons starting from 2010-2011.

Getting started

A short introduction to the prerequisites and installation.

Prerequisites

This package is built on Python, particularly Python3. To use this package, the Python version should be at least 3.7.0.

Installation

To install, simply open a terminal and run either of the following commands:

pip install hockey_api
python -m pip install hockey_api

Usage

To scrape data, three different functions are provided. Each function provides the same output structure, with a tuple containing play by play data for all the game(s) as well as the shifts.

  • To scrape a list of games, where the input should be a list of game ids:
from hockey_api import scrape_list_of_games

# Scrape the first three games of the 2021-2022 NHL season.
pbp, shifts = scrape_list_of_games(game_id_list=[2021020001, 2021020002, 2021020003])
  • To scrape a list of games between two dates:
from hockey_api import scrape_date_range

# Scrape the all the games during the month of October in 2021.
pbp, shifts = scrape_date_range(start_date="2021-10-01", end_date="2021-10-31")

# Scrape games from a given date.
pbp, shifts = scrape_date_range(start_date="2021-10-31")
  • To scrape a specific part of a season:
from hockey_api import scrape_season

# Scrape all games from the 2021-2022 NHL regular season.
pbp, shifts = scrape_season(season=2021, season_type="R")

# Scrape all games from the 2021-2022 NHL playoffs.
pbp_playoffs, shifts_playoffs = scrape_season(season=2021, season_type="P")

Saving data

To save data to a local file the following command should do:

pbp.to_csv("desired/path/pbp.csv", index=False)

Miscellaneous

An additional scraper is also available, called get_player_data(), which downloads metadata about all players in the NHL database. To use, simply run:

from hockey_api import get_player_data

# Scrape player data
players = get_player_data()

Contact

To get in contact with me send me an email at rasmus.safvenberg@gmail.com.

Copyright

Copyright (C) 2022-2023 Rasmus Säfvenberg

This file is part of hockey_api

hockey_api is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received a copy of the GNU General Public License along with this program. If not, see https://www.gnu.org/licenses/.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hockey_api-0.1.1.tar.gz (39.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

hockey_api-0.1.1-py3-none-any.whl (43.0 kB view details)

Uploaded Python 3

File details

Details for the file hockey_api-0.1.1.tar.gz.

File metadata

  • Download URL: hockey_api-0.1.1.tar.gz
  • Upload date:
  • Size: 39.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.4.0 CPython/3.9.1 Windows/10

File hashes

Hashes for hockey_api-0.1.1.tar.gz
Algorithm Hash digest
SHA256 21875726214df07af64a8857c8f0e8d6e8203da5d3b39ff9a0ee8751c261c805
MD5 e1bf2fc3177b853b2941d441c8d24c44
BLAKE2b-256 e0b5cff4109a4920df701e93e7c425d3181abcb153f744e9409c0e2507808642

See more details on using hashes here.

File details

Details for the file hockey_api-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: hockey_api-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 43.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.4.0 CPython/3.9.1 Windows/10

File hashes

Hashes for hockey_api-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 89fe9536f8b1c3dcb7391bdde20d37ee2fdfeb22744276e50f4559bf569948fb
MD5 682922cc8487c5ecc119c452c56837f0
BLAKE2b-256 b90aba0b20ba0564c18e352a71bd9b1dc92196635312bfc598b951970c2b22b9

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page