Skip to main content

A Python interface for various NFL data sources

Project description

nfl-data-loader

nfl-data-loader is a robust Python package for accessing, merging, and transforming NFL sports data from multiple sources in an object-oriented, pipeline-friendly way. It provides a unified API for teams, players, events, and venues, with rich schema documentation and ready-to-use data loaders, transformers, and formatters.

TheEdgePredictor Services

Upstream

  • espn-api-orm (PyPi Package)
    • Team Data trigger

    • Venue Data trigger

    • Event Data trigger

    • odds-data-pump (TBD)

    • Fantasy Data trigger

  • nfl-madden-data (Manual Yearly Trigger)
  • ELO-Rating Data trigger

Downstream

  • Feature Store Data trigger
  • nfl-model-store (TBD)
  • nfl-madden-data (Manual Yearly Trigger)

๐Ÿš€ Features

  • Unified access to NFL data from multiple sources (nflverse, Madden, Pro Football Reference, ESPN API)
  • Modular loader, transform, and formatter architecture
  • Pydantic schemas for all endpoints and data objects
  • Grouped by: team, player, event, venue
  • Easy integration into ML/data pipelines
  • Auto-deploy to PyPI on merge to main (see CI/CD)

๐Ÿ“ฆ Data Sources

๐Ÿ—๏ธ Architecture

โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”   โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”   โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚  Sources   โ”‚โ†’โ†’ โ”‚ Formatters โ”‚โ†’โ†’ โ”‚  Workflows โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜   โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜   โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
      โ”‚               โ”‚                โ”‚
   [team, player, event, venue]  [Extract, Transform, Load Blocks and Pipelines]

๐Ÿงฉ Loader Groups

  • Team: Team stats, rosters, schedules, advanced metrics
  • Player: Player stats, depth charts, Madden ratings, combine data
  • Event: Game-level stats, play-by-play, Vegas lines, EPA, fantasy
  • Venue: Stadium/venue metadata and mappings

๐Ÿ“‘ Schemas & Data Dictionary

All endpoints return data validated by Pydantic schemas. See /schemas for:

  • Field names, types, and descriptions (see schemas/events/features.py, schemas/players/madden.py, schemas/players/position.py)
  • Example: Player Madden schema, event feature schema, position mappers

๐Ÿ› ๏ธ Usage Example

from nfl_data_loader.api.sources.players.rosters.rosters import collect_roster
from nfl_data_loader.workflows.transforms.players.player import make_player_stats

# Load player roster for a season
roster_df = collect_roster(2024)

# Transform player stats
player_stats = make_player_stats(2024, week=1, position_group='quarterback')

๐Ÿ“ˆ Data Flow Example

  1. Load: Pull raw data from any source (e.g., ESPN, nflverse) or pull cached feature store (Event, Player)
  2. Transform: Apply rolling averages, imputation, feature engineering
  3. Format: Output as DataFrame, JSON, or for ML pipelines

๐Ÿงช Testing & Validation

  • All endpoints are covered by tests (see /tests)
  • Schemas are validated with Pydantic
  • Linting and type checking enabled

๐Ÿค– CI/CD & PyPI Deployment

  • GitHub Actions workflow auto-deploys to PyPI on merge to main
  • See .github/workflows/publish.yml for details

๐Ÿค Contributing

Pull requests welcome! Please add/expand tests and update schema docs for new endpoints.


For a full list of available endpoints, schemas, and data fields, see the /schemas and /api directories. For questions or issues, open a GitHub issue.

Publishing

Auto publish available through GitHub Actions and Pypi

Local publishing (include token in rc)

  1. python -m build

  2. twine upload dist/*

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nfl_data_loader-0.0.3.tar.gz (67.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

nfl_data_loader-0.0.3-py3-none-any.whl (88.7 kB view details)

Uploaded Python 3

File details

Details for the file nfl_data_loader-0.0.3.tar.gz.

File metadata

  • Download URL: nfl_data_loader-0.0.3.tar.gz
  • Upload date:
  • Size: 67.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.5

File hashes

Hashes for nfl_data_loader-0.0.3.tar.gz
Algorithm Hash digest
SHA256 5772dcd99c19e85405a8c695c54c168099895dcf7bd8c2c564daa72c88690f6b
MD5 679598924449218def38c2d6ffe0ed8b
BLAKE2b-256 fa53e91c9efdc32f427ba637ef466628733edd943a38fb28cc0895900378ddc2

See more details on using hashes here.

File details

Details for the file nfl_data_loader-0.0.3-py3-none-any.whl.

File metadata

File hashes

Hashes for nfl_data_loader-0.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 bc519d6bdcdaa477c1f88c7ded6f0932a385da5295852a3524ca5c0601bd55e5
MD5 e8b385a7aa751ee9691e60a769213f09
BLAKE2b-256 0274a51ce25a188d7e7969adef623f8681fa86f45a337c041b777f09b8647f24

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page