Scalable machine learning based time series forecasting

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 4 - Beta
Intended Audience
- Developers
License
- OSI Approved :: Apache Software License
Natural Language
- English
Programming Language

Project description

mlforecast

Install

PyPI

pip install mlforecast

If you want to perform distributed training, you can instead use pip install mlforecast[distributed], which will also install dask. Note that you’ll also need to install either LightGBM or XGBoost.

conda-forge

conda install -c conda-forge mlforecast

Note that this installation comes with the required dependencies for the local interface. If you want to perform distributed training, you must install dask (conda install -c conda-forge dask) and either LightGBM or XGBoost.

How to use

The following provides a very basic overview, for a more detailed description see the documentation.

Store your time series in a pandas dataframe with an index named unique_id that identifies each time serie, a column ds that contains the datestamps and a column y with the values.

from mlforecast.utils import generate_daily_series

series = generate_daily_series(20)
series.head()

	ds	y
unique_id
id_00	2000-01-01	0.264447
id_00	2000-01-02	1.284022
id_00	2000-01-03	2.462798
id_00	2000-01-04	3.035518
id_00	2000-01-05	4.043565

Then create a TimeSeries object with the features that you want to use. These include lags, transformations on the lags and date features. The lag transformations are defined as numba jitted functions that transform an array, if they have additional arguments you supply a tuple (transform_func, arg1, arg2, …).

from mlforecast.core import TimeSeries
from window_ops.expanding import expanding_mean
from window_ops.rolling import rolling_mean

ts = TimeSeries(
    lags=[7, 14],
    lag_transforms={
        1: [expanding_mean],
        7: [(rolling_mean, 7), (rolling_mean, 14)]
    },
    date_features=['dayofweek', 'month']
)
ts

TimeSeries(freq=<Day>, transforms=['lag-7', 'lag-14', 'expanding_mean_lag-1', 'rolling_mean_lag-7_window_size-7', 'rolling_mean_lag-7_window_size-14'], date_features=['dayofweek', 'month'], num_threads=1)

Next define a model. If you want to use the local interface this can be any regressor that follows the scikit-learn API. For distributed training there are LGBMForecast and XGBForecast.

from sklearn.ensemble import RandomForestRegressor

model = RandomForestRegressor(random_state=0)

Now instantiate your forecast object with the model and the time series. There are two types of forecasters, Forecast which is local and DistributedForecast which performs the whole process in a distributed way.

from mlforecast.forecast import Forecast

fcst = Forecast(model, ts)

To compute the features and train the model using them call .fit on your Forecast object.

fcst.fit(series)

Forecast(model=RandomForestRegressor(random_state=0), ts=TimeSeries(freq=<Day>, transforms=['lag-7', 'lag-14', 'expanding_mean_lag-1', 'rolling_mean_lag-7_window_size-7', 'rolling_mean_lag-7_window_size-14'], date_features=['dayofweek', 'month'], num_threads=1))

To get the forecasts for the next 14 days call .predict(14) on the forecaster. This will automatically handle the updates required by the features.

predictions = fcst.predict(14)
predictions.head()

	ds	y_pred
unique_id
id_00	2000-08-10	5.244840
id_00	2000-08-11	6.258609
id_00	2000-08-12	0.225484
id_00	2000-08-13	1.228957
id_00	2000-08-14	2.302455

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 4 - Beta
Intended Audience
- Developers
License
- OSI Approved :: Apache Software License
Natural Language
- English
Programming Language

Release history Release notifications | RSS feed

0.13.0

May 9, 2024

0.12.1

Apr 8, 2024

0.12.0

Mar 4, 2024

0.11.8

Feb 16, 2024

0.11.7

Feb 15, 2024

0.11.6

Jan 19, 2024

0.11.5

Jan 8, 2024

0.11.4

Jan 2, 2024

0.11.3

Dec 14, 2023

0.11.2

Dec 7, 2023

0.11.1

Nov 24, 2023

0.11.0

Nov 6, 2023

0.10.0

Oct 3, 2023

0.9.3

Sep 12, 2023

0.9.2

Aug 29, 2023

0.9.1

Aug 15, 2023

0.9.0

Aug 1, 2023

0.8.1

Jul 21, 2023

0.8.0

Jul 20, 2023

0.7.4

Jul 5, 2023

0.7.3

May 23, 2023

0.7.2

May 16, 2023

0.7.1

Apr 27, 2023

0.7.0

Apr 11, 2023

0.6.0

Feb 3, 2023

0.5.0

Jan 31, 2023

0.4.0

Nov 25, 2022

0.3.1

Nov 9, 2022

0.3.0

Nov 1, 2022

This version

0.2.0

Aug 10, 2022

0.1.0

Jun 24, 2021

0.0.9

Jun 9, 2021

0.0.8

May 31, 2021

0.0.7

May 31, 2021

0.0.6

May 8, 2021

0.0.5

May 7, 2021

0.0.4.1

May 4, 2021

0.0.4

May 3, 2021

0.0.3

Apr 30, 2021

0.0.2

Apr 27, 2021

0.0.1

Apr 27, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mlforecast-0.2.0.tar.gz (23.4 kB view hashes)

Uploaded Aug 10, 2022 Source

Built Distribution

mlforecast-0.2.0-py3-none-any.whl (32.5 kB view hashes)

Uploaded Aug 10, 2022 Python 3

Hashes for mlforecast-0.2.0.tar.gz

Hashes for mlforecast-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`152f5c92bc8097e20bb879493a4c1f25b9932f155e3d81d5e1a8b5ada0c74092`
MD5	`7eac84da1b8f5e0225959549eaaf07d3`
BLAKE2b-256	`6ab9cfdad279aee4c8836cd55cd66b828b90658dc298df30fef7435b909b3bf0`

Hashes for mlforecast-0.2.0-py3-none-any.whl

Hashes for mlforecast-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ed6c7c6d8d0f9241ac8fbb565d51c1476fc8162278d2de634a289eb84163b366`
MD5	`14e47fb777ab298966e084eae4474869`
BLAKE2b-256	`1c386b6011acf5095eb70397658884737379c861f809f077364bcfb0254bdffc`