The practitioner's time series forecasting library

These details have not been verified by PyPI

Project links

Project description

Scalecast

Scalecast Logo

About

Scalecast helps you forecast time series. Here is how to initiate its main object:

from scalecast.Forecaster import Forecaster

f = Forecaster(
    y = array_of_values,
    current_dates = array_of_dates,
    future_dates=fcst_horizon_length,
    test_length = 0, # do you want to test all models? if so, on how many or what percent of observations?
    cis = False, # evaluate conformal confidence intervals for all models?
    metrics = ['rmse','mape','mae','r2'], # what metrics to evaluate over the validation/test sets?
)

Uniform ML modeling (with models from a diverse set of libraries, including scikit-learn, statsmodels, and tensorflow), reporting, and data visualizations are offered through the Forecaster and MVForecaster interfaces. Data storage and processing then becomes easy as all applicable data, predictions, and many derived metrics are contained in a few objects with much customization available through different modules. Feature requests and issue reporting are welcome! Don't forget to leave a star!⭐

Documentation

Popular Features

Easy LSTM Modeling: setting up an LSTM model for time series using tensorflow is hard. Using scalecast, it's easy. Many tutorials and Kaggle notebooks that are designed for those getting to know the model use scalecast (see the aritcle).

f.set_estimator('lstm')
f.manual_forecast(
    lags=36,
    batch_size=32,
    epochs=15,
    validation_split=.2,
    activation='tanh',
    optimizer='Adam',
    learning_rate=0.001,
    lstm_layer_sizes=(100,)*3,
    dropout=(0,)*3,
)

Auto lag, trend, and seasonality selection:

f.auto_Xvar_select( # iterate through different combinations of covariates
    estimator = 'lasso', # what estimator?
    alpha = .2, # estimator hyperparams?
    monitor = 'ValidationMetricValue', # what metric to monitor to make decisions?
    cross_validate = True, # cross validate
    cvkwargs = {'k':3}, # 3 folds
)

Hyperparameter tuning using grid search and time series cross validation:

from scalecast import GridGenerator

GridGenerator.get_example_grids()
models = ['ridge','lasso','xgboost','lightgbm','knn']
f.tune_test_forecast(
    models,
    limit_grid_size = .2,
    feature_importance = True, # save pfi feature importance for each model?
    cross_validate = True, # cross validate? if False, using a seperate validation set that the user can specify
    rolling = True, # rolling time series cross validation?
    k = 3, # how many folds?
)

Plotting results: plot test predictions, forecasts, fitted values, and more.

import matplotlib.pyplot as plt

fig, ax = plt.subplots(2,1, figsize = (12,6))
f.plot_test_set(models=models,order_by='TestSetRMSE',ax=ax[0])
f.plot(models=models,order_by='TestSetRMSE',ax=ax[1])
plt.show()

Pipelines that include transformations, reverting, and backtesting:

from scalecast import GridGenerator
from scalecast.Pipeline import Transformer, Reverter, Pipeline
from scalecast.util import find_optimal_transformation, backtest_metrics

def forecaster(f):
    models = ['ridge','lasso','xgboost','lightgbm','knn']
    f.tune_test_forecast(
        models,
        limit_grid_size = .2, # randomized grid search on 20% of original grid sizes
        feature_importance = True, # save pfi feature importance for each model?
        cross_validate = True, # cross validate? if False, using a seperate validation set that the user can specify
        rolling = True, # rolling time series cross validation?
        k = 3, # how many folds?
    )

transformer, reverter = find_optimal_transformation(f) # just one of several ways to select transformations for your series

pipeline = Pipeline(
    steps = [
        ('Transform',transformer),
        ('Forecast',forecaster),
        ('Revert',reverter),
    ]
)

f = pipeline.fit_predict(f)
backtest_results = pipeline.backtest(f)
metrics = backtest_metrics(backtest_results)

Model stacking: There are two ways to stack models with scalecast, with the StackingRegressor from scikit-learn or using its own stacking procedure.

from scalecast.auxmodels import auto_arima

f.set_estimator('lstm')
f.manual_forecast(
    lags=36,
    batch_size=32,
    epochs=15,
    validation_split=.2,
    activation='tanh',
    optimizer='Adam',
    learning_rate=0.001,
    lstm_layer_sizes=(100,)*3,
    dropout=(0,)*3,
)

f.set_estimator('prophet')
f.manual_forecast()

auto_arima(f)

# stack previously evaluated models
f.add_signals(['lstm','prophet','arima'])
f.set_estimator('catboost')
f.manual_forecast()

Multivariate modeling and multivariate pipelines:

from scalecast.MVForecaster import MVForecaster
from scalecast.Pipeline import MVPipeline
from scalecast.util import find_optimal_transformation, backtest_metrics
from scalecast import GridGenerator

GridGenerator.get_mv_grids()

def mvforecaster(mvf):
    models = ['ridge','lasso','xgboost','lightgbm','knn']
    mvf.tune_test_forecast(
        models,
        limit_grid_size = .2, # randomized grid search on 20% of original grid sizes
        cross_validate = True, # cross validate? if False, using a seperate validation set that the user can specify
        rolling = True, # rolling time series cross validation?
        k = 3, # how many folds?
    )

mvf = MVForecaster(f1,f2,f3) # can take N Forecaster objects

transformer1, reverter1 = find_optimal_transformation(f1)
transformer2, reverter2 = find_optimal_transformation(f2)
transformer3, reverter3 = find_optimal_transformation(f3)

pipeline = MVPipeline(
    steps = [
        ('Transform',[transformer1,transformer2,transformer3]),
        ('Forecast',mvforecaster),
        ('Revert',[reverter1,reverter2,reverter3])
    ]
)

f1, f2, f3 = pipeline.fit_predict(f1, f2, f3)
backtest_results = pipeline.backtest(f1, f2, f3)
metrics = backtest_metrics(backtest_results)

Transfer Learning (new with 0.19.0): Train a model in one Forecaster object and use that model to make predictions on the data in a separate Forecaster object.

f = Forecaster(...)
f.auto_Xvar_select()
f.set_estimator('xgboost')
f.cross_validate()
f.auto_forecast()

f_new = Forecaster(...) # different series than f
f_new = infer_apply_Xvar_selection(infer_from=f,apply_to=f_new)
f_new.transfer_predict(transfer_from=f,model='xgboost') # transfers the xgboost model from f to f_new

Installation

Only the base package is needed to get started:
- pip install --upgrade scalecast
Optional add-ons:
- pip install tensorflow (for RNN/LSTM on Windows) or pip install tensorflow-macos (for MAC/M1)
- pip install darts
- pip install prophet
- pip install greykite (for the silverkite model)
- pip install kats (changepoint detection)
- pip install pmdarima (auto arima)
- pip install tqdm (progress bar for notebook)
- pip install ipython (widgets for notebook)
- pip install ipywidgets (widgets for notebook)
- jupyter nbextension enable --py widgetsnbextension (widgets for notebook)
- jupyter labextension install @jupyter-widgets/jupyterlab-manager (widgets for Lab)

Papers that use scalecast

Udemy Course

Scalecast: Machine Learning & Deep Learning

Blog posts and notebooks

Forecasting with Different Model Types

Sklearn Univariate
- Expand your Time Series Arsenal with These Models
- Notebook
Sklearn Multivariate
RNN
ARIMA
- Forecast with ARIMA in Python More Easily with Scalecast
- Notebook
Theta
- Easily Employ A Theta Model For Time Series
- Notebook
VECM
- Employ a VECM to predict FANG Stocks with an ML Framework
- Notebook
Stacking
- Stacking Time Series Models to Improve Accuracy
- Notebook
Other Notebooks

Transforming and Reverting

Confidence Intervals

Dynamic Validation

Model Input Selection

Scaled Forecasting on Many Series

Transfer Learning

Anomaly Detection

Contributing

Contributing.md
Want something that's not listed? Open an issue!

How to cite scalecast

@misc{scalecast,
  title = {{scalecast}},
  author = {Michael Keith},
  year = {2024},
  version = {<your version>},
  url = {https://scalecast.readthedocs.io/en/latest/},
}

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.19.10

Oct 14, 2024

0.19.9

Jul 6, 2024

0.19.8

Jan 23, 2024

0.19.7

Dec 12, 2023

0.19.6

Dec 4, 2023

0.19.5

Oct 30, 2023

0.19.4

Oct 27, 2023

0.19.3

Oct 2, 2023

0.19.2

Sep 20, 2023

0.19.1

Sep 17, 2023

0.19.0

Sep 15, 2023

0.18.16

Aug 10, 2023

0.18.15

Aug 7, 2023

0.18.14

Aug 6, 2023

0.18.13

Aug 3, 2023

0.18.12

Aug 2, 2023

0.18.11

Jul 31, 2023

0.18.10

Jul 30, 2023

0.18.9

Jul 27, 2023

0.18.8

Jul 4, 2023

0.18.7

Jun 25, 2023

0.18.6

Jun 1, 2023

0.18.5

May 17, 2023

0.18.4

Apr 28, 2023

0.18.3

Apr 23, 2023

0.18.2

Apr 19, 2023

0.18.1

Apr 14, 2023

0.18.0

Apr 13, 2023

0.17.20

Apr 2, 2023

0.17.19

Apr 1, 2023

0.17.18 yanked

Apr 1, 2023

0.17.17

Apr 1, 2023

0.17.16

Mar 30, 2023

0.17.15

Mar 27, 2023

0.17.14

Mar 17, 2023

0.17.13

Mar 12, 2023

0.17.12

Mar 8, 2023

0.17.11

Mar 3, 2023

0.17.10

Mar 2, 2023

0.17.9

Feb 26, 2023

0.17.8

Feb 25, 2023

0.17.7

Feb 24, 2023

0.17.6

Feb 23, 2023

0.17.5

Feb 22, 2023

0.17.4

Feb 19, 2023

0.17.3 yanked

Feb 14, 2023

Reason this release was yanked:

grids not working

0.17.2

Feb 9, 2023

0.17.1

Feb 8, 2023

0.17.0

Feb 6, 2023

0.16.6

Feb 3, 2023

0.16.5

Jan 31, 2023

0.16.4

Jan 31, 2023

0.16.3

Jan 29, 2023

0.16.2 yanked

Jan 29, 2023

Reason this release was yanked:

ImportError

0.16.1 yanked

Jan 29, 2023

Reason this release was yanked:

Circular import

0.16.0

Jan 28, 2023

0.15.16

Jan 25, 2023

0.15.15 yanked

Jan 25, 2023

Reason this release was yanked:

Didn't add ax to all plotting functions

0.15.14

Jan 23, 2023

0.15.13 yanked

Jan 23, 2023

Reason this release was yanked:

SeriesTransformer function doesn't work

0.15.12

Dec 12, 2022

0.15.11

Dec 6, 2022

0.15.10

Dec 5, 2022

0.15.9

Nov 3, 2022

0.15.8

Nov 2, 2022

0.15.7

Oct 29, 2022

0.15.6

Oct 25, 2022

0.15.5

Oct 22, 2022

0.15.4

Oct 21, 2022

0.15.3

Oct 21, 2022

0.15.2

Oct 19, 2022

0.15.1

Oct 18, 2022

0.15.0

Oct 17, 2022

0.14.8

Oct 14, 2022

0.14.7

Oct 5, 2022

0.14.6

Oct 3, 2022

0.14.5

Sep 30, 2022

0.14.4

Sep 23, 2022

0.14.3

Sep 16, 2022

0.14.2 yanked

Sep 16, 2022

Reason this release was yanked:

util not working

0.14.1

Sep 9, 2022

0.14.0

Aug 31, 2022

0.13.11

Aug 19, 2022

0.13.10

Aug 15, 2022

0.13.9

Aug 11, 2022

0.13.8

Aug 8, 2022

0.13.7 yanked

Aug 8, 2022

Reason this release was yanked:

Util module is broken

0.13.6

Aug 4, 2022

0.13.5

Aug 3, 2022

0.13.4

Jul 29, 2022

0.13.3

Jul 27, 2022

0.13.2

Jul 25, 2022

0.13.1

Jul 24, 2022

0.13.0

Jul 19, 2022

0.12.9

Jul 15, 2022

0.12.8

Jul 11, 2022

0.12.7

Jul 8, 2022

0.12.6

Jul 6, 2022

0.12.5

Jul 1, 2022

0.12.4

Jun 28, 2022

0.12.3

Jun 27, 2022

0.12.2 yanked

Jun 27, 2022

0.12.1 yanked

Jun 27, 2022

Reason this release was yanked:

save_feature_importance doesn't work when test_only=True

0.11.2

Jun 20, 2022

0.11.1

Jun 15, 2022

0.11.0

Jun 14, 2022

0.10.5

Jun 7, 2022

0.10.4

Jun 6, 2022

0.10.3

Jun 5, 2022

0.10.2

May 17, 2022

0.10.1

May 16, 2022

0.10.0

May 14, 2022

0.9.9

May 12, 2022

0.9.8

May 11, 2022

0.9.7

May 10, 2022

0.9.6

May 6, 2022

0.9.5 yanked

May 6, 2022

Reason this release was yanked:

Prints when it shouldn't

0.9.4

Apr 29, 2022

0.9.3 yanked

Apr 29, 2022

Reason this release was yanked:

Didn't fix the issue it said it would fix

0.9.2

Apr 29, 2022

0.9.1

Apr 21, 2022

0.9.0

Apr 15, 2022

0.8.4

Apr 11, 2022

0.8.3

Apr 5, 2022

0.8.2

Apr 5, 2022

0.8.1

Apr 4, 2022

0.8.0

Apr 3, 2022

0.7.6

Mar 30, 2022

0.7.5

Mar 28, 2022

0.7.4

Mar 28, 2022

0.7.3

Mar 28, 2022

0.7.2

Mar 25, 2022

0.7.1

Mar 24, 2022

0.7.0

Mar 18, 2022

0.6.9

Mar 15, 2022

0.6.8

Mar 15, 2022

0.6.7

Mar 7, 2022

0.6.6

Feb 28, 2022

0.6.5 yanked

Feb 28, 2022

Reason this release was yanked:

Import error

0.6.4

Feb 25, 2022

0.6.3 yanked

Feb 25, 2022

Reason this release was yanked:

syntax error

0.6.2

Feb 25, 2022

0.6.1

Feb 23, 2022

0.6.0

Feb 9, 2022

0.5.9

Feb 1, 2022

0.5.8

Jan 27, 2022

0.5.7

Jan 26, 2022

0.5.6

Jan 25, 2022

0.5.5

Jan 21, 2022

0.5.4

Jan 20, 2022

0.5.3

Jan 18, 2022

0.5.2

Jan 13, 2022

0.5.1

Jan 13, 2022

0.5.0

Jan 10, 2022

0.4.25

Jan 6, 2022

0.4.4

Jan 7, 2022

0.4.3

Jan 7, 2022

0.4.2 yanked

Jan 6, 2022

Reason this release was yanked:

One of the new functions doesn't work. Fixed in 0.4.25

0.4.1

Dec 30, 2021

0.4.0

Dec 30, 2021

0.3.9

Dec 30, 2021

0.3.8

Dec 29, 2021

0.3.7

Dec 27, 2021

0.3.6

Dec 14, 2021

0.3.5

Dec 7, 2021

0.3.4

Dec 7, 2021

0.3.3

Nov 26, 2021

0.3.2

Nov 1, 2021

0.3.1

Oct 29, 2021

0.3.0

Oct 15, 2021

0.2.9

Sep 24, 2021

0.2.8

Aug 27, 2021

0.2.7

Aug 21, 2021

0.2.6

Aug 11, 2021

0.2.5

Aug 9, 2021

0.2.4

Aug 3, 2021

0.2.3

Jul 19, 2021

0.2.2

Jul 16, 2021

0.2.1 yanked

Jul 16, 2021

Reason this release was yanked:

Critical error in _diffy() function

0.2.0

Jul 16, 2021

0.1.9

Jul 9, 2021

0.1.8

Jul 5, 2021

0.1.7

Jul 4, 2021

0.1.6 yanked

Jul 4, 2021

Reason this release was yanked:

Critical error causing most combo models to fail

0.1.5

Jul 4, 2021

0.1.4

Jul 1, 2021

0.1.3

Jul 1, 2021

0.1.2

Jul 1, 2021

0.1.1

Jul 1, 2021

0.1

Jul 1, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scalecast-0.19.10.tar.gz (1.2 MB view hashes)

Uploaded Oct 14, 2024 Source

Hashes for scalecast-0.19.10.tar.gz

Hashes for scalecast-0.19.10.tar.gz
Algorithm	Hash digest
SHA256	`2b182a9f8b3f4f423cecb9e8440c2e00a873e15d42c644223c0934083cf70c70`
MD5	`6973c91a6a8225eddc05b7791c35e002`
BLAKE2b-256	`168e52acf029a7454e66dd9af176a9c77a47c207299e25bd7f60de97788bae47`