A package for loading, preprocessing, vizualising and synchronizing soccere event aand tracking data.
Project description
databallpy
A package for loading, preprocessing, vizualising and synchronizing soccere event and tracking data.
This package is developed to create a standardized way to analyse soccer matches using both event- and tracking data. Other packages, like kloppy and floodligth, already standardize the import of data sources. The current package goes a step further in combining different data streams from the same match. In this case, the Match
object combines information from the event and tracking data.
We are currently working on adding more data sources and on creating a Match.synchronise_tracking_and_event_data()
function to efficiently align all events with a timeframe in the tracking data. This would make it possible to get contextual information from the tracking data at the exact moment that the event is taking place.
Installation
$ pip install databallpy
Usage
The package is centered around the Match
object. A Match
has tracking data, event data metadata about the match.
$ from databallpy.match import get_match, get_open_match
$
$ match = get_match(
$ tracking_data_loc="data/tracking_data.dat",
$ tracking_metadata_loc="data/tracking_metadata.xml",
$ tracking_data_provider="tracab"
$ event_data_loc="data/event_data_f24.xml",
$ event_metadata_loc="data/event_metadata_f7.xml",
$ event_data_provider="opta",
$ )
$
$ # or to load an open metrica dataset of tracking and event data
$ match = get_open_match()
$
$ match.home_team_name # the team name of the home playing team
$ match.away_players # pandas dataframe with the names, ids, shirt numbers and positions of the away team
$ match.tracking_data # pandas dataframe with tracking data of the match
$ match.event_data # pandas dataframe with event data of the match
See the documentation of the Match
object for more options. Note that this package is developed to combine event and tracking data, therefore both datastreams are necessary to create a Match
object.
Visualizing
The packages also provides tools to visualise the data. Note that to save a match clip the package relies on the use of ffmpeg. Make sure to have installed it to your machine and added it to your python path, otherwise the save_match_clip()
function will produce an error.
$ from databallpy.match import get_match, get_open_match
$ from databallpy.visualize import save_match_clip
$
$ match = get_match(
$ tracking_data_loc="data/tracking_data.dat",
$ tracking_metadata_loc="data/tracking_metadata.xml",
$ tracking_data_provider="tracab"
$ event_data_loc="data/event_data_f24.xml",
$ event_metadata_loc="data/event_metadata_f7.xml",
$ event_data_provider="opta",
$ )
$
$ # or to load an open metrica dataset of tracking and event data
$ match = get_open_match()
$
$ save_match_clip(match, start_idx=0, end_idx=100, folder_loc="data", title="example")
This function will save a .mp4 file in "data/"
directory of the match.tracking_data
from index 0 untill index 100.
Documentation
The official documentation can be found here.
Providers
For now we limited providers. We are planning on adding more providers later on.
Event data providers:
- Opta
- Metrica
Tracking data providers:
- Tracab
- Metrica
Contributing
Interested in contributing? Check out the contributing guidelines. Please note that this project is released with a Code of Conduct. By contributing to this project, you agree to abide by its terms.
License
databallpy
was created by Alexander Oonk & Daan Grob. It is licensed under the terms of the MIT license.
Similar projects
Although we think this package helps when starting to analyse soccer data, other packages may be better suited for your specific needs. Make sure to check out the following packages as well:
And for a more specific toturials on how to get started with soccer data"
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for databallpy-0.2.0a1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | cf6fec890c3e20c8408a29adb9b1c943ee48598f30e4ed5bd96cda75001ac681 |
|
MD5 | c79da3811b6951b5c46fd2b8391561a0 |
|
BLAKE2b-256 | 4f4d0e33702cb59fd64161738e93ce8f12c8e3cf1a61e35c608eb1f1634b9294 |