tsflex

Toolkit for flexible operations on time-series data

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

tsflex stands for: flexible time-series operations

It is a time-series first toolkit for processing & feature extraction, making few assumptions about input data.

example notebooks

Installation
Usage
- Series processing
- Feature extraction
Documentation

Installation

:WIP: - not yet published to pypi

pip install tsflex

Advantages of tsflex

tsflex has multiple selling points, for example

todo: create links to example benchmarking notebooks

it is efficient
- execution time -> multiprocessing / vectorized
- memory -> view based operations
it is flexible:
feature extraction:
- multiple series, signal & stride combinations are possible
- no frequency requirements, just a datetime index
it has logging capabilities to improve feature extraction speed.
it is field & unit tested
it has a comprehensive documentation
it is compatible with sklearn (w.i.p. for gridsearch integration), pandas and numpy

Usage

Series processing

import pandas as pd
import scipy.stats
import numpy as np

from tsflex.processing import SeriesProcessor, SeriesPipeline

Feature extraction

The only data assumptions made by tsflex are:

the data has a pd.DatetimeIndex & this index is monotonically_increasing
the data's series names must be unique

import pandas as pd
import scipy.stats
import numpy as np

from tsflex.features import FeatureDescriptor, FeatureCollection

# 1. Construct the collection in which you add all your features
fc = FeatureCollection(
    feature_descriptors=[
        FeatureDescriptor(
            function=scipy.stats.skew,
            series_name="myseries",
            window="1day",
            stride="6hours"
        )
    ]
)
# -- 1.1 Add another feature to the feature collection
fc.add(FeatureDescriptor(np.min, 'myseries', '2days', '1day'))

# 2. Get your time-indexed data
data = pd.Series(
    data=np.random.random(10_000), 
    index=pd.date_range("2021-07-01", freq="1h", periods=10_000),
).rename('myseries')
# -- 2.1 drop some data, as we don't make frequency assumptions
data = data.drop(np.random.choice(data.index, 200, replace=False))

# 3. Calculate the feature on some data
fc.calculate(data=data, n_jobs=1, return_df=True)
# which outputs: a pd.DataFrame with content:

index	myseries__skew__w=1D_s=12h	myseries__amin__w=2D_s=1D
2021-07-02 00:00:00	-0.0607221	nan
2021-07-02 12:00:00	-0.142407	nan
2021-07-03 00:00:00	-0.283447	0.042413
2021-07-03 12:00:00	-0.353314	nan
2021-07-04 00:00:00	-0.188953	0.0011865
2021-07-04 12:00:00	0.259685	nan
2021-07-05 00:00:00	0.726858	0.0011865
...	...	...

Documentation

:WIP:

Too see the documentation locally, install pdoc and execute the succeeding command from this folder location.

pdoc3 --template-dir docs/pdoc_template/ --http :8181 tsflex

👤 Jonas Van Der Donckt, Jeroen Van Der Donckt, Emiel Deprost

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.4.0

Apr 4, 2024

0.3.0

Oct 11, 2022

0.3rc0 pre-release

Aug 20, 2022

0.2.3.7.7

Jun 1, 2022

0.2.3.7.6

Apr 21, 2022

0.2.3.7.5

Apr 21, 2022

0.2.3.7.4

Apr 20, 2022

0.2.3.7.3

Apr 20, 2022

0.2.3.7.2

Apr 20, 2022

0.2.3.7.1

Apr 20, 2022

0.2.3.7

Apr 17, 2022

0.2.3.6

Feb 8, 2022

0.2.3.5

Feb 8, 2022

0.2.3.4

Jan 24, 2022

0.2.3.3

Dec 24, 2021

0.2.3.2

Dec 13, 2021

0.2.3.1

Dec 2, 2021

0.2.3

Nov 16, 2021

0.2.2

Nov 12, 2021

0.2.1

Nov 12, 2021

0.2.0

Nov 12, 2021

0.1.2.4

Oct 29, 2021

0.1.2.3

Oct 15, 2021

0.1.2.2

Sep 13, 2021

0.1.2.1

Aug 28, 2021

0.1.2.0

Jul 28, 2021

0.1.1.9

Jul 17, 2021

0.1.1.8

Jul 2, 2021

0.1.1.7

Jul 2, 2021

0.1.1.6

Jun 24, 2021

0.1.1.5

Jun 22, 2021

0.1.1.4

Jun 22, 2021

0.1.1.3

Jun 19, 2021

0.1.1.2

Jun 18, 2021

0.1.1.1

Jun 18, 2021

0.1.1

Jun 18, 2021

This version

0.1.0

Jun 18, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tsflex-0.1.0.tar.gz (31.1 kB view hashes)

Uploaded Jun 18, 2021 Source

Built Distribution

tsflex-0.1.0-py3-none-any.whl (40.0 kB view hashes)

Uploaded Jun 18, 2021 Python 3

Hashes for tsflex-0.1.0.tar.gz

Hashes for tsflex-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`ee951ba0b7e628a1dba475076f310a90c5f9ac274dc102399b8b164c9a43fc40`
MD5	`6b4ba002c1a05f9b590b1f8e661edd45`
BLAKE2b-256	`f50feedf0c1df22a5f7325b3e934f94b7fc00bc828172f7bbacd8fcc80ce9a8c`

Hashes for tsflex-0.1.0-py3-none-any.whl

Hashes for tsflex-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`33cbd28cd6bfd55da678020ee694daa966aaec1634e5554b610d9d46ac28dffc`
MD5	`8303d1a56efa1e6cb254d122073682ac`
BLAKE2b-256	`2a75de5152742835553c7f34fdb771e361e384df3ebf28d652b1c60940067e86`