A python library for the Bayesian dynamic linear model for time series modeling

These details have not been verified by PyPI

Project links

Homepage

Project description

PyDLM

Welcome to pydlm, a flexible time series modeling library for python. This library is based on the Bayesian dynamic linear model (Harrison and West, 1999) and optimized for fast model fitting and inference.

Updates

Version 0.1.1.13 has been released on PyPI.
- Migrated all the unnecessary print() to the default python logging operations, such as logging.info, logging.warning and logging.critical.
- Users can now set the model logging level to suppress unnecessary information during model run.
- Example to only print warning information:
```
    ...
    my_model.setLoggingLevel('WARNING')
    my_model.fit()
```
- Updated the package version manage through pip-compile and requirements.txt.

Installation

You can get the package (current version 0.1.1.11) from pypi by

  $ pip install pydlm

You can also get the latest from github

  $ git clone git@github.com:wwrechard/pydlm.git pydlm
  $ cd pydlm
  $ pip install pip-tools
  $ pip install -r requirements.txt
  $ pip install -e . --no-deps

pydlm depends on the following modules,

numpy (for core functionality)
matplotlib (for plotting results)
Sphinx (for generating documentation)
unittest (for testing)

Google data science post example

We use the example from the Google data science post as an example to show how pydlm could be used to analyze the real world data. The code and data is placed under examples/unemployment_insurance/.... The dataset contains weekly counts of initial claims for unemployment during 2004 - 2012 and is available from the R package bsts (which is a popular R package for time series modeling). The raw data is shown below (left)

We see strong annual pattern and some local trend from the data.

A simple model

Following the Google's post, we first build a simple model with only local linear trend and seasonality component.

from pydlm import dlm, trend, seasonality
# A linear trend
linear_trend = trend(degree=1, discount=0.95, name='linear_trend', w=10)
# A seasonality
seasonal52 = seasonality(period=52, discount=0.99, name='seasonal52', w=10)
# Build a simple dlm
simple_dlm = dlm(time_series) + linear_trend + seasonal52

In the actual code, the time series data is scored in the variable time_series. degree=1 indicates the trend is linear (2 stands for quadratic) and period=52 means the seasonality has a periodicy of 52. Since the seasonality is generally more stable, we set its discount factor to 0.99. For local linear trend, we use 0.95 to allow for some flexibility. w=10 is the prior guess on the variance of each component, the larger number the more uncertain. For actual meaning of these parameters, please refer to the user manual. After the model built, we can fit the model and plot the result (shown above, right figure)

# Fit the model
simple_dlm.fit()
# Plot the fitted results
simple_dlm.turnOff('data points')
simple_dlm.plot()

The blue curve is the forward filtering result, the green curve is the one-day ahead prediction and the red curve is the backward smoothed result. The light-colored ribbon around the curve is the confidence interval (you might need to zoom-in to see it). The one-day ahead prediction shows this simple model captures the time series somewhat good but loses accuracy around the peak crisis at Week 280 (which is between year 2008 - 2009). The one-day-ahead mean squared prediction error is 0.173 which can be obtained by calling

simple_dlm.getMSE()

We can decompose the time series into each of its components

# Plot each component (attribute the time series to each component)
simple_dlm.turnOff('predict plot')
simple_dlm.turnOff('filtered plot')
simple_dlm.plot('linear_trend')
simple_dlm.plot('seasonal52')

Most of the time series shape is attributed to the local linear trend and the strong seasonality pattern is easily seen. To further verify the performance, we use this simple model for long-term forecasting. In particular, we use the previous 351 week's data to forecast the next 200 weeks and the previous 251 week's data to forecast the next 200 weeks. We lay the predicted results on top of the real data

# Plot the prediction give the first 351 weeks and forcast the next 200 weeks.
simple_dlm.plotPredictN(date=350, N=200)
# Plot the prediction give the first 251 weeks and forcast the next 200 weeks.
simple_dlm.plotPredictN(date=250, N=200)

From the figure we see that after the crisis peak around 2008 - 2009 (Week 280), the simple model can accurately forecast the next 200 weeks (left figure) given the first 351 weeks. However, the model fails to capture the change near the peak if the forecasting start before Week 280 (right figure).

Dynamic linear regression

Now we build a more sophiscated model with extra variables in the data file. The extra variables are stored in the variable `features` in the actual code. To build the dynamic linear regression model, we simply add a new component

# Build a dynamic regression model
from pydlm import dynamic
regressor10 = dynamic(features=features, discount=1.0, name='regressor10', w=10)
drm = dlm(time_series) + linear_trend + seasonal52 + regressor10
drm.fit()
drm.getMSE()

# Plot the fitted results
drm.turnOff('data points')
drm.plot()

dynamic is the component for modeling dynamically changing predictors, which accepts features as its argument. The above code plots the fitted result (top left).

The one-day ahead prediction looks much better than the simple model, particularly around the crisis peak. The mean prediction error is 0.099 which is a 100% improvement over the simple model. Similarly, we also decompose the time series into the three components

drm.turnOff('predict plot')
drm.turnOff('filtered plot')
drm.plot('linear_trend')
drm.plot('seasonal52')
drm.plot('regressor10')

This time, the shape of the time series is mostly attributed to the regressor and the linear trend looks more linear. If we do long-term forecasting again, i.e., use the previous 301 week's data to forecast the next 150 weeks and the previous 251 week's data to forecast the next 200 weeks

drm.plotPredictN(date=300, N=150)
drm.plotPredictN(date=250, N=200)

The results look much better compared to the simple model

Documentation

Detailed documentation is provided in PyDLM with special attention to the User manual.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

0.1.1.13

Aug 13, 2024

0.1.1.12

May 7, 2023

0.1.1.11

Dec 19, 2018

0.1.1.10

Feb 11, 2018

0.1.1.9

Aug 30, 2017

0.1.1.8

May 15, 2017

0.1.1.7

Nov 11, 2016

0.1.1.6

Nov 11, 2016

0.1.1.5

Nov 8, 2016

0.1.1.4

Nov 7, 2016

0.1.1.3

Nov 6, 2016

0.1.1.2

Nov 6, 2016

0.1.1.1

Nov 6, 2016

0.1.1

Nov 6, 2016

0.1.0

Sep 14, 2016

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pydlm-0.1.1.13.tar.gz (2.2 MB view details)

Uploaded Aug 13, 2024 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pydlm-0.1.1.13-py2.py3-none-any.whl (64.3 kB view details)

Uploaded Aug 13, 2024 Python 2Python 3

File details

Details for the file pydlm-0.1.1.13.tar.gz.

File metadata

Download URL: pydlm-0.1.1.13.tar.gz
Upload date: Aug 13, 2024
Size: 2.2 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.0 CPython/3.12.5

File hashes

Hashes for pydlm-0.1.1.13.tar.gz
Algorithm	Hash digest
SHA256	`312071a2b086090203dde04a60c34f9a3f562e7578dd8ac594a076049718bb99`
MD5	`231318951e5ff21ffe7ba0be6f40ff3a`
BLAKE2b-256	`84b2fb3621fdf7d11b62c3acd1d0e57c6d5232177dc7c948b83401c2ca3490f0`

See more details on using hashes here.

File details

Details for the file pydlm-0.1.1.13-py2.py3-none-any.whl.

File metadata

Download URL: pydlm-0.1.1.13-py2.py3-none-any.whl
Upload date: Aug 13, 2024
Size: 64.3 kB
Tags: Python 2, Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.0 CPython/3.12.5

File hashes

Hashes for pydlm-0.1.1.13-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`3ab47c3c05e01538ea2bf6717a4a5c5c73cc80a8538a35a61820c67c6dc01854`
MD5	`1bdbba19e57f20c6556a9721ff81190d`
BLAKE2b-256	`e560b0ccdf6b0344b013e95541093bb279ef3d4e1175ed308f9e2c95a520fa53`

See more details on using hashes here.

pydlm 0.1.1.13

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

PyDLM

Updates

Installation

Google data science post example

A simple model

Dynamic linear regression

Documentation

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes