Time Series Description

These details have not been verified by PyPI

Project links

Homepage

Project description

TSDSS 📊 🔮 📈

TSDSS is a comprehensive Python package for time series analysis and surrogate data generation. It provides a wide range of tools for statistical analysis, preprocessing, feature extraction, and surrogate data generation for both univariate and multivariate time series.

Features

Time Series Analysis

Basic statistics (mean, std, skewness, kurtosis)
Stationarity tests (ADF test, Ljung-Box test)
Correlation analysis (Pearson, Spearman, Kendall)
Spectral analysis
Nonlinear analysis (Lyapunov exponent, phase space reconstruction)
Entropy measures

Time Series Preprocessing

Missing value interpolation
Outlier detection
Normalization
Resampling
Feature extraction

Surrogate Data Generation

IAAFT (Iterative Amplitude Adjusted Fourier Transform)
IAAFT+ (Enhanced IAAFT)
IPFT (Iterative Phase-adjusted Fourier Transform)
AIAAFT (Adaptive IAAFT)
IAAWT (Iterative Amplitude Adjusted Wavelet Transform)
Multivariate surrogate methods
Bootstrap methods

Installation

pip install tsdss

Input Data Format

TSDSS accepts the following input formats:

NumPy arrays (1D for univariate, 2D for multivariate)
Pandas Series (for univariate)
Pandas DataFrame (for multivariate)

Example shapes:

Univariate: (n_samples,) or (n_samples, 1)
Multivariate: (n_samples, n_features)

Quick Start Examples

Basic Statistics and Analysis

import numpy as np
import pandas as pd
from tsdss  import ts_statistics, plot_decomposition, calculate_entropy

# Basic time series statistics
ts = np.random.normal(0, 1, 1000)
stats = ts_statistics(ts)
print(stats)

# Plot time series decomposition
plot_decomposition(ts)

# Calculate entropy
entropy = calculate_entropy(ts)
print(f"Entropy: {entropy}")

Time Series Preprocessing

from tsdss import interpolate_missing, detect_outliers, normalize_ts, resample_ts

# Handle missing values
ts = pd.Series([1, np.nan, 3, np.nan, 5])
ts_clean = interpolate_missing(ts, method='linear')  # Options: linear, ffill, bfill, cubic, spline

# Detect outliers
ts = np.random.normal(0, 1, 1000)
outliers = detect_outliers(ts, method='zscore', threshold=3)  # Options: zscore, iqr, mad

# Normalize data
ts_norm = normalize_ts(ts, method='zscore')  # Options: zscore, minmax, robust

# Resample time series (requires datetime index)
dates = pd.date_range('2023-01-01', periods=100, freq='D')
ts = pd.Series(np.random.randn(100), index=dates)
ts_resampled = resample_ts(ts, freq='W', method='mean')

Feature Extraction

from tsdss import extract_time_features, extract_freq_features

# Extract time domain features
ts = np.random.normal(0, 1, 1000)
time_features = extract_time_features(ts)
print("Time domain features:", time_features)

# Extract frequency domain features
freq_features = extract_freq_features(ts)
print("Frequency domain features:", freq_features)

Correlation Analysis

from tsdss import mutual_information, kendall_correlation

# Calculate mutual information
x = np.random.normal(0, 1, 1000)
y = 0.5 * x + np.random.normal(0, 1, 1000)
mi = mutual_information(x, y)
print(f"Mutual Information: {mi}")

# Calculate Kendall correlation
kendall = kendall_correlation(x, y)
print(f"Kendall Correlation: {kendall}")

Surrogate Data Generation

from tsdss  import (
    iaaft, iaaft_plus, ipft, aiaaft, 
    multivariate_iaaft, block_bootstrap, 
    stationary_bootstrap
)

# Generate univariate surrogate data
ts = np.random.normal(0, 1, 1000)

# IAAFT method
surrogate_iaaft = iaaft(ts, n_iterations=1000, num_surrogates=1)[0]

# IAAFT+ method
surrogate_iaaft_plus = iaaft_plus(ts, n_iterations=1000, num_surrogates=1)[0]

# IPFT method
surrogate_ipft = ipft(ts, n_iterations=1000, num_surrogates=1)[0]

# Generate multivariate surrogate data
data = np.random.normal(0, 1, (1000, 3))  # 3-dimensional time series
mv_surrogate = multivariate_iaaft(data, max_iter=100, num_surrogates=1)[0]

# Bootstrap methods
block_samples = block_bootstrap(ts, block_length=50, num_bootstrap=100)
stat_samples = stationary_bootstrap(ts, mean_block_length=50, num_bootstrap=100)

Wavelet Analysis

from tsdss import dwt, idwt, iaawt

# Perform discrete wavelet transform
ts = np.random.normal(0, 1, 1024)  # Length should be power of 2
coeffs = dwt(ts, level=3)

# Perform inverse wavelet transform
reconstructed = idwt(coeffs)

# Generate wavelet-based surrogate
surrogate = iaawt(ts, n_iterations=1000, num_surrogates=1)[0]

Advanced Multivariate Analysis

from tsdss import (
    mvts_surrogate_s_transform, 
    mvts_surrogate_wavelet,
    mvts_surrogate_pca,
    copula_surrogate
)

# Generate multivariate data
data = np.random.normal(0, 1, (1000, 5))

# Different multivariate surrogate methods
surrogate_st = mvts_surrogate_s_transform(data, num_surrogates=1)[0]
surrogate_wavelet = mvts_surrogate_wavelet(data, num_surrogates=1)[0]
surrogate_pca = mvts_surrogate_pca(data, num_surrogates=1)[0]
surrogate_copula = copula_surrogate(data, num_surrogates=1)[0]

Bootstrap Methods

from tsdss import block_bootstrap, stationary_bootstrap

# 1. Block Bootstrap
# Fixed block length, suitable for data with strong local dependencies
ts = np.random.normal(0, 1, 1000)
block_samples = block_bootstrap(
    data=ts, 
    block_length=50,  # Fixed block length
    num_bootstrap=100
)

# 2. Stationary Bootstrap
# Random block length (geometric distribution), preserves stationarity
stat_samples = stationary_bootstrap(
    data=ts, 
    mean_block_length=50,  # Average block length
    num_bootstrap=100
)

# Compare the two methods
print("Block Bootstrap first sample:", block_samples[0][:10])
print("Stationary Bootstrap first sample:", stat_samples[0][:10])

# Using with pandas Series
ts_series = pd.Series(ts)
block_samples_pd = block_bootstrap(ts_series, block_length=50, num_bootstrap=100)
stat_samples_pd = stationary_bootstrap(ts_series, mean_block_length=50, num_bootstrap=100)

# Key differences:
# 1. Block Bootstrap: Uses fixed block length
# 2. Stationary Bootstrap: Uses random block length (geometric distribution)
#    - Better preserves stationarity
#    - More suitable for time series with varying dependence structures

Performance

The package uses optimized C++ implementations for core computations:

Trend decomposition
Skewness and kurtosis calculation
ACF computation
Ljung-Box test

Requirements

Python >= 3.7
NumPy >= 1.19.0
Pandas >= 1.0.0
SciPy >= 1.6.0
Statsmodels >= 0.13.0
Scikit-learn >= 0.24.0
Matplotlib >= 3.0.0

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.2.0

Nov 21, 2024

This version

0.1.0

Nov 20, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tsdss-0.1.0.tar.gz (24.4 kB view details)

Uploaded Nov 20, 2024 Source

Built Distribution

tsdss-0.1.0-cp39-cp39-macosx_10_9_universal2.whl (56.5 kB view details)

Uploaded Nov 20, 2024 CPython 3.9 macOS 10.9+ universal2 (ARM64, x86-64)

File details

Details for the file tsdss-0.1.0.tar.gz.

File metadata

Download URL: tsdss-0.1.0.tar.gz
Upload date: Nov 20, 2024
Size: 24.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.9.6

File hashes

Hashes for tsdss-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`be9d0b5aefc5dca6620c66b80685b0b604e675e0d00a38bc0bad3a4170847913`
MD5	`5b9d0a6fafdf680e3ee82596848d4b02`
BLAKE2b-256	`5d040cc8dc1d19198116006dbc5eee635de0ce04d189f6cbcae0c65d25f03f7c`

See more details on using hashes here.

File details

Details for the file tsdss-0.1.0-cp39-cp39-macosx_10_9_universal2.whl.

File metadata

Download URL: tsdss-0.1.0-cp39-cp39-macosx_10_9_universal2.whl
Upload date: Nov 20, 2024
Size: 56.5 kB
Tags: CPython 3.9, macOS 10.9+ universal2 (ARM64, x86-64)
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.9.6

File hashes

Hashes for tsdss-0.1.0-cp39-cp39-macosx_10_9_universal2.whl
Algorithm	Hash digest
SHA256	`4f48a8b9c1af1df584ecde4f6b89cc53ae74eb77a9d5d852c8d0c0a95ad2c38f`
MD5	`dd85cfb3827caff81ddf4de22fbcab06`
BLAKE2b-256	`3b709ea7f7f8526f743c62f2da37f4e79dfe0eee29848c69268f01b76cc5a77c`