Skip to main content

EBES: Easy Benchmarking for Event Sequences.

Project description

EBES Easy Benchmarking for Event Sequences.

arXiv Docs

🎉 Accepted at KDD 2025!

Our paper "EBES: Easy Benchmarking for Event Sequences" has been accepted to the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '25).

@inproceedings{osin2025ebes,
  title={EBES: Easy Benchmarking for Event Sequences},
  author={Osin, Dmitry and Udovichenko, Igor and Shvetsov, Egor and Moskvoretskii, Viktor and Burnaev, Evgeny},
  booktitle={Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V. 2},
  pages={5730--5741},
  year={2025}
}

EBES is an easy-to-use development and application toolkit for Event Sequence(EvS) Assesment, with key features in configurability, compatibility and reproducibility. We hope this project could benefit both researchers and practitioners with the goal of easily customized development and open benchmarking in EvS.

Setup

Installation

To install the latest stable version:

pip install ebes

Datasets

Dataset Source Link Preprocessing Script Link Download Instructions
Physionet2012 Physionet2012 physionet2012.py Straightforward download on site
MIMIC-III MIMIC-III mimic-3.py Only credentialed users who sign the DUA can access the files.
Age Age age.py Download here if you have difficulties navigating site
Retail Retail x5-retail.py Download here if you have difficulties navigating site
MBD MBD mbd.py Straightforward download on site
Taobao Taobao taobao.py Need to login on site to download. After that pass tianchi_mobile_recommend_train_user.csv into script
BPI17 BPI17 bpi_17.py Straightforward download on site
ArabicDigits ArabicDigits SpokenArabicDigits.py Either just run preprocessing script and it will download automatically, or straightforward download on site
ElectricDevices ElectricDevices electric_devices.py Straightforward download on site
Pendulum We created it ourselves pendulum.py Run preprocessing script in order to generate from scratch. Make sure to keep default seed=0 in order to get exactly same dataset.

Usage

python main -d age -m gru -e correlation -s best

Results:

image

Performance of various models as a function of number of sequences. Metrics from Table 1 are reported. Number of sequences is presented in log scale. Standard deviation across 3 runs is depicted as vertical lines.

Performance metric relationships and correlations of different subsets among all methods on PhysioNet2012 are presented. We do not observe a correlation between the test metric and train-val on PhysioNet2012, as seen in the right upper corner. For the Taobao dataset, we do not observe a clear linear trend between hpo-val and the test metric suggesting the presence of distribution shift.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ebes-0.0.9.tar.gz (62.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ebes-0.0.9-py3-none-any.whl (77.0 kB view details)

Uploaded Python 3

File details

Details for the file ebes-0.0.9.tar.gz.

File metadata

  • Download URL: ebes-0.0.9.tar.gz
  • Upload date:
  • Size: 62.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.12

File hashes

Hashes for ebes-0.0.9.tar.gz
Algorithm Hash digest
SHA256 ab67a092ff52c2fddf4ca75211e13f79007c48acad668ebd396438df2088ea91
MD5 850dbd13d019c5283063d08ace6f2e52
BLAKE2b-256 c0b5bb38cfaa112cee64ec9d17709003d23a1660b4295921a783c7f4963d6d7c

See more details on using hashes here.

File details

Details for the file ebes-0.0.9-py3-none-any.whl.

File metadata

  • Download URL: ebes-0.0.9-py3-none-any.whl
  • Upload date:
  • Size: 77.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.12

File hashes

Hashes for ebes-0.0.9-py3-none-any.whl
Algorithm Hash digest
SHA256 ce9c5c6769f332a8433fb02d1f1b9b6e141e90e51fcc1d09a06fdfbbc0d0c289
MD5 d11290e08c7b77692ef1a7a347564582
BLAKE2b-256 5ca3e82f437e53c5b740b58df9d7dd0cfae45817ee434119c101a694f890793d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page