Skip to main content

A python package for scraping kenpom.com NCAA basketball data.

Project description

kenpompy - Basketball for Nerds

Documentation Status Build Status codecov

This python package serves as a convenient web scraper for kenpom.com, which provides tons of great NCAA basketball statistics and metrics. It requires a subscription to Ken Pomeroy's site for use, otherwise only the home page will be accessible. It's a small fee for a year of access, and totally worth it in my opinion.

Objective

Ultimately, this package is to allow both hobbyist and reknown sports analysts alike to get data from kenpom in a format more suitable for visualization, transformation, and additional analysis. It's meant to be simple, easy to use, and to yield information in a way that is immediately usable.

Responsible Use

As with many web scrapers, the responsibility to use this package in a reasonable manner falls upon the user. Don't be a jerk and constantly scrape the site a thousand times a minute or you run the risk of potentially getting barred from it, which you'd likely deserve. I am in no way responsible for how you use (or abuse) this package. Be sensible.

But I Use R

Yeah, yeah, but have you heard of reticulate? It's an R interface to python that also supports passing objects (like dataframes!) between them.


Installation

kenpompy is easily installed via pip:

pip install kenpompy

What It Can (and Can't) Do

This a work in progress - it can currently scrape all of the summary, FanMatch, and miscellaneous tables, pretty much all of those under the Stats and Miscellany headings. Team and Player classes are planned, but they're more complicated and will take some time.

Usage

kenpompy is simple to use. Generally, tables on each page are scraped into pandas dataframes with simple parameters to select different seasons or tables. As many tables have headers that don't parse well, some are manually altered to a small degree to make the resulting dataframe easier to interpret and manipulate.

First, you must login:

from kenpompy.utils import login

# Returns an authenticated browser that can then be used to scrape pages that require authorization.
browser = login(your_email, your_password)

Then you can request specific pages that will be parsed into convenient dataframes:

import kenpompy.summary as kp

# Returns a pandas dataframe containing the efficiency and tempo stats for the current season (https://kenpom.com/summary.php).
eff_stats = kp.get_efficiency(browser)

Contributing

You can contribute by creating issues to highlight bugs and make suggestions for additional features. Pull requests are also very welcome.

License

kenpompy is released on the GNU GPL v3.0 license. You are free to use, modify, or redistribute it in almost any way, provided you state changes to the code, disclose the source, and use the same license. It is released with zero warranty for any purpose and I retain no liability for its use. Read the full license for additional details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kenpompy-0.2.1.tar.gz (12.7 kB view details)

Uploaded Source

Built Distribution

kenpompy-0.2.1-py3-none-any.whl (24.7 kB view details)

Uploaded Python 3

File details

Details for the file kenpompy-0.2.1.tar.gz.

File metadata

  • Download URL: kenpompy-0.2.1.tar.gz
  • Upload date:
  • Size: 12.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/45.2.0.post20200210 requests-toolbelt/0.9.1 tqdm/4.43.0 CPython/3.8.1

File hashes

Hashes for kenpompy-0.2.1.tar.gz
Algorithm Hash digest
SHA256 4d30aa32e355f4fe6ac44fc622adc3b9cd588ee4b3da645d44db33678c00a7d4
MD5 f2186cbf3b50c01addeac6a147305b9e
BLAKE2b-256 44b403f02f0e9f2415308fa91a5bdd8d08d88472bd865378bc293656780faef1

See more details on using hashes here.

File details

Details for the file kenpompy-0.2.1-py3-none-any.whl.

File metadata

  • Download URL: kenpompy-0.2.1-py3-none-any.whl
  • Upload date:
  • Size: 24.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/45.2.0.post20200210 requests-toolbelt/0.9.1 tqdm/4.43.0 CPython/3.8.1

File hashes

Hashes for kenpompy-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 9d9ec08f54af3264f6e4127df1e77dc48cc45e5fd4e6c0c85916128287dac47d
MD5 f35330092b67c84fcef6575b5be79d07
BLAKE2b-256 3b2f2f9a63ee2bc82fdfc6658c3101b70cd93570019e97817f5ea8e1600ab7e1

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page