The TopDownHockey Scraper
Project description
TopDownHockey EliteProspects Scraper
By Patrick Bacon, made possible by the work of Marcus Sjölin.
This is a package built for scraping Elite Prospects, an extremely valuable website which makes hockey data for thousands of leagues available to the public.
This package is strictly built for end users who wish to scrape data for personal use. If you are interested in using this data for professional purposes, I recommend you look into the Elite Prospects API.
While using the scraper, please be mindful of EliteProspects servers.
Installation
You can install the package by entering the following command in terminal:
pip install TopDownHockey_Scraper
Then import the module using this function:
import TopDownHockey_Scraper.TopDownHockey_EliteProspects_Scraper as tdhepscrape
User-End Functions
get_skaters(leagues, seasons)
Returns a dataframe containing statistics for all skaters in a target set of league(s) and season(s).
- leagues: One or multiple leagues. If one league, enter as a string i.e; "nhl". If multiple leagues, enter as a tuple or list i.e; ("nhl", "ahl").
- seasons: One or multiple leagues. If one league, enter as a string i.e; "2018-2019". If multiple leagues, enter as a tuple or list i.e; ("2018-2019", "2019-2020").
Example:
tdhepscrape.get_skaters(("nhl", "ahl"), ("2018-2019", "2019-2020"))
get_goalies(leagues, seasons)
Returns a dataframe containing statistics for all goalies in a target set of league(s) and season(s).
- leagues: One or multiple leagues. If one league, enter as a string i.e; "nhl". If multiple leagues, enter as a tuple or list i.e; ("nhl", "ahl").
- seasons: One or multiple leagues. If one league, enter as a string i.e; "2018-2019". If multiple leagues, enter as a tuple or list i.e; ("2018-2019", "2019-2020").
Example:
tdhepscrape.get_goalies("khl", "2015-2016")
get_player_information(dataframe)
Returns a dataframe containing bio information for all skaters or goalies (or both) within a target dataframe.
- dataframe: The dataframe returned by one of the previous two commands.
Example:
Say you obtain skater data for the KHL in 2020-2021 and store that as a dataframe called output
. You can run this function to get bio information for every player in that league's scrape.
output = tdhepscrape.get_skaters("khl", "2020-2021")
tdhepscrape.get_player_information(output)
add_player_information(dataframe)
Returns a dataframe containing bio information for all skaters or goalies (or both) within a target dataframe as well as the statistics from the original dataframe.
- dataframe: The dataframe returned by one of the previous two commands.
Example:
Say you obtain skater data for the KHL in 2020-2021 and store that as a dataframe called output
. You can run this function to get bio information for every player in that league's scrape.
output = tdhepscrape.get_skaters("khl", "2020-2021")
tdhepscrape.add_player_information(output)
Comments, Questions, and Concerns.
My goal was to make this package as error-proof as possible. I believe I've accounted for every issue that could potentially throw off a scrape, but it's possible I've missed something.
If any issues arise, or you have any questions about the package, please do not hesitate to contact me on Twitter at @TopDownHockey or email me directly at patrick.s.bacon@gmail.com.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file TopDownHockey_Scraper-1.0.6.tar.gz
.
File metadata
- Download URL: TopDownHockey_Scraper-1.0.6.tar.gz
- Upload date:
- Size: 10.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.1 importlib_metadata/3.10.1 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.9.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1c6debf07a6ebb60d81a4ee858e11647caf5a662f1500716b0b7ab6cc4ec6531 |
|
MD5 | 67e702a299825cdcce4ae9008a8667af |
|
BLAKE2b-256 | 79fec8215e4a45b162a242356301f2958cf333a5d1b59975fea73c13e003accf |
File details
Details for the file TopDownHockey_Scraper-1.0.6-py3-none-any.whl
.
File metadata
- Download URL: TopDownHockey_Scraper-1.0.6-py3-none-any.whl
- Upload date:
- Size: 9.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.1 importlib_metadata/3.10.1 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.9.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ea8c690d29267f97b899b4d62eadc3f01a659d0347ddde4d853442f2e1938281 |
|
MD5 | 2fd6f53e52477c7f3c265c493db51b3a |
|
BLAKE2b-256 | ee3b3a777c0303c03679a1d41f2f4c394bc3521903a07225b4e5ef131c99ef7c |