Several change point detection methods implemented in python.
Project description
Python Changepoint Detection (changepoynt)
Table of content:
This is the repository hosting the pip-installable python package changepoynt. It implements several change point detection techniques, while focusing mostly on "localized" algorithms, that could be run in an online fashion.
Current algorithms come from the field of:
-
Subspace Estimation (Extracting the characteristic signals)
-
Statistics (Detection of Change in the statistical properties)
-
Time Series Segmentation (Algorithms focused on comparing time series shape)
The package is aimed at execution performance (using JIT compilation and standing on the shoulders of giants like numpy and scipy) while also keeping code readable and maintainable. This includes comments as well as architectural choices. This might not be perfect, but we are trying!
All of our algorithms are implementations of a base changepoint detection interface and therefore are interchangeable. Currently, we are focused on shifting to the very common and existing sklearn interface of fit and transform. This enables our algorithms to be part of the standard sklearn pipeline for preprocessing.
Quick Start
If you want to start using the package right away, we recommend to use one of the singular spectrum transformation algorithms (SST). The first step is to install the package using pip. Then you can use the following example code:
import numpy as np # for signal creation
import matplotlib.pyplot as plt # to show the plot
from changepoynt.algorithms.esst import ESST # import the scoring algorithm
from changepoynt.visualization.score_plotting import plot_data_and_score # import a visualization function
# simulate a signal that goes from exponential decline into a sine wave
# the signals is only for demonstration purposes and can be replaced by your signal
steady_before = np.ones(200)
exp_signal = np.exp(-np.linspace(0, 5, 200))
steady_after = np.exp(-5)*np.ones(150)
sine_after = 0.2*np.sin(np.linspace(0, 3*np.pi*10, 300))
signal = np.concatenate((steady_before, exp_signal, steady_after, sine_after))
signal += 0.01*np.random.randn(signal.shape[0]) # add some minor noise
# This part is all you need to do to score a signal with our package
# create the scorer and compute the change score
detector = ESST(30)
detection = detector.transform(signal)
# make the plot using the built-in function of the package
plot_data_and_score(signal, detection)
plt.show()
The result looks like this:
Examples
You can find example code within the examples folder of this repository. We also wanted to tease the functionality using two different signals in order to show the capabilities of one of our recommended algorithms ESST. If you want to use the algorithms on the contents of a CSV directly, there is a frontend demonstrator currently hosted here (the adress is https://demo.changepoynt.de/, the code for the demonstrator is here).
The first application is a simulated temperature of a component in a power plant during shutdown. We artificially added a disturbance at the end of the shutdown, to show the capability of the algorithm to detect a change even in case of another major change.
The other application is for anomaly detection within periodic signals. The example is time series 34 from the Hexagon ML/UCR Time Series Anomaly Detection dataset, where we set the window size for the ESST to three times the estimated period in samples (estimated using maximum of FFT).
Both plots have been created using changepoynt.algorithms.esst
and the plot function from
changepoynt.visualization.score_plotting
.
Installation
You can install changepoynt
from the common package index PyPi using the
following line with pip:
pip install changepoynt
Please be aware, that we are currently in an alpha development phase, as this is part of a research project at the FAU Erlangen together with SIEMENS Energy developed by me. Nevertheless, we aim to be open-source and try our best to guarantee that all the code we use has very permissive licenses.
Algorithms
We are actively working on the package. Therefore, some algorithms are already available, while others are currently under development. An overview with sources can be seen here:
Algorithm | Source | Status |
---|---|---|
SST | Idé | Stable :heavy_check_mark: |
IKA-SST | Idé | Stable :heavy_check_mark: |
RSST | Weber | Experimental |
RuLSIF | Liu et al. | Stable :heavy_check_mark: |
uLSIF | Liu et al. | Stable :heavy_check_mark: |
KLIEP | Liu et al. | Planned |
ClaSP | Ermshaus et al. | Deactivated :x: |
FLUSS | Gharghabi et al. | Stable :heavy_check_mark: |
FLOSS | Gharghabi et al. | Stable :heavy_check_mark: |
BOCPD | Adams et al. | Planned |
Contributing
We always love to get feedback or new ideas. If you have any of those, feel free to open an issue. We try to get back to you as soon as we can.
If you are an author of a paper in the field or have another algorithmic idea: Feel free to open a pull request. Currently, we are still working on the contribution guides. But if somebody already comes along and has an idea, we do not want to be in the way!
Outlook
We are actively working on the package, and currently have the following steps planned:
- We are actively working on a benchmark tool for change point algorithms
- Implement Bayesion online change point detection
If you have further ideas, do not hesitate to open a ticket or a pull request!
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for changepoynt-0.0.8-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8dce345a0849678a0c4b21bd737e1355b178afe355e9cb2f31f3d052ccf9d6a9 |
|
MD5 | 4cdfc5dbfd1baf99d67a1dcb7df986c7 |
|
BLAKE2b-256 | 207f4742bf7bad0b40fd4c1f839cb2d907155d3a705def7d8506d73909ecf0f7 |