Python version of Google's Causal Impact model

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 4 - Beta
Environment
- Console
Intended Audience
- Developers
- Science/Research
License
- OSI Approved :: Apache Software License
Natural Language
- English
Operating System
- Unix
Programming Language
Topic
- Scientific/Engineering

Project description

Causal Impact

Python causal impact (or causal inference) implementation of Google's model with all functionalities fully ported and tested.

How it works

The main goal of the algorithm is to infer the expected effect a given intervention (or any action) had on some response variable by analyzing differences between expected and observed time series data.

Data is divided in two parts: the first one is what is known as the "pre-intervention" period and the concept of Bayesian Structural Time Series is used to fit a model that best explains what has been observed. The fitted model is used in the second part of data ("post-intervention" period) to forecast what the response would look like had the intervention not taken place. The inferences are based on the differences between observed response to the predicted one which yields the absolute and relative expected effect the intervention caused on data.

The model makes as assumption (which is recommended to be confirmed in your data) that the response variable can be precisely modeled by a linear regression with what is known as "covariates" (or X) that must not be affected by the intervention that took place (for instance, if a company wants to infer what impact a given marketing campaign will have on its "revenue", then its daily "visits" cannot be used as a covariate as probably the total visits might be affected by the campaign.

It is more commonly used to infer the impact that marketing interventions have on businesses such as the expected revenue associated to a given campaign or even to assert more precisely the revenue a given channel brings in by completely turning it off (also known as "hold-out" tests). It's important to note though that the model can be extensively used in different areas and subjects; any intervention on time series data can potentially be modeled and inferences be made upon observed and predicted data.

Please refer to getting started in the examples folder for more information.

Installation

pip install pycausalimpact

Requirements

python{2.7, 3.6, 3.7, 3.8} *
numpy
scipy
statsmodels
matplotlib
jinja2

* We no longer support Python2.7! Please refer to the tag 0.0.16 (pip install pycausalimpact==0.0.16) for the latest available supported version.

Getting Started

We recommend this presentation by Kay Brodersen (one of the creators of the causal impact implementation in R).

We also created this introductory ipython notebook with examples of how to use this package.

Simple Example

Here's a simple example (which can also be found in the original Google's R implementation) running in Python:

import numpy as np
import pandas as pd
from statsmodels.tsa.arima_process import ArmaProcess
from causalimpact import CausalImpact


np.random.seed(12345)
ar = np.r_[1, 0.9]
ma = np.array([1])
arma_process = ArmaProcess(ar, ma)
X = 100 + arma_process.generate_sample(nsample=100)
y = 1.2 * X + np.random.normal(size=100)
y[70:] += 5

data = pd.DataFrame({'y': y, 'X': X}, columns=['y', 'X'])
pre_period = [0, 69]
post_period = [70, 99]

ci = CausalImpact(data, pre_period, post_period)
print(ci.summary())
print(ci.summary(output='report'))
ci.plot()

alt text

Differences Between Python and R Packages

One thing you'll notice when using this package is that sometimes results will converge to be similar to the R package output and at times it may yield different conclusions.

This is a quite complex topic and we have discussed it more throroughly on the issues number #34, #37 and #40 which we highly recommend the reading.

In a nutshell, Python implementation relies on statsmodels which uses a classical Kalman Filter approach for solving the statespace equations whereas R`s uses a Bayesian approach (from bsts package) with a stochastic Kalman Filter technique; both algorithms are expected to converge to similar final statespace solution (ref).

Still, despite the similarities, both packages uses different assumptions for prior initalizations as well as for steps involved in the optimization process: while in R we find an approach that relies on user prior knowledge, Python uses classical statistical techniques aiming to maximize the likelihood function expressed in terms of the structural time series components.

As we discuss in the previously mentioned issues, it's hard to tell which is right or "more right"; each package has its own assumptions and its own techniques making it up for the final user to decide what is appropriate or not. We recommend comparing results from both packages in your use cases to have a more general idea whether there's convergence in conclusions or not.

As a final note, when using this Python package, we highly recommend setting the prior as None like so:

ci = CausalImpact(data, pre_period, post_period, prior_level_sd=None)

This will let statsmodel itself do the optimization for the prior on the local level component. If you are confident that your local level prior should be a given specific value (say 0.01), then it's probably ok to use it there, otherwise you run the risk of obtaining sub-optimal solutions as a result.

Contributing, Bugs, Questions

Contributions are more than welcome! If you want to propose new changes, fix bugs or improve something feel free to fork the repository and send us a Pull Request. You can also open new Issues for reporting bugs and general problems.

Project details

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 4 - Beta
Environment
- Console
Intended Audience
- Developers
- Science/Research
License
- OSI Approved :: Apache Software License
Natural Language
- English
Operating System
- Unix
Programming Language
Topic
- Scientific/Engineering

Release history Release notifications | RSS feed

This version

0.1.1

May 11, 2020

0.1.0

May 11, 2020

0.0.16

Jan 30, 2020

0.0.15

Oct 30, 2019

0.0.14

Oct 13, 2019

0.0.13

Aug 23, 2019

0.0.12

Apr 26, 2019

0.0.10

Feb 25, 2019

0.0.8

Dec 31, 2018

0.0.7

Nov 27, 2018

0.0.6

Nov 12, 2018

0.0.5

Oct 18, 2018

0.0.4

Oct 18, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pycausalimpact-0.1.1.tar.gz (37.3 kB view details)

Uploaded May 11, 2020 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pycausalimpact-0.1.1-py2.py3-none-any.whl (30.3 kB view details)

Uploaded May 11, 2020 Python 2Python 3

File details

Details for the file pycausalimpact-0.1.1.tar.gz.

File metadata

Download URL: pycausalimpact-0.1.1.tar.gz
Upload date: May 11, 2020
Size: 37.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.2.0 requests-toolbelt/0.9.1 tqdm/4.46.0 CPython/3.6.9

File hashes

Hashes for pycausalimpact-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`5e6d072e75369ad8bb4559f332c49f32af4f74509efd65673e3bc73ee40455fa`
MD5	`4d091ded48b08e334557cac7b6e2c68d`
BLAKE2b-256	`e3624b471c8ceb8f9a2115892bf80a438b7e2567a8a4fe0d9f95544a1fc53918`

See more details on using hashes here.

File details

Details for the file pycausalimpact-0.1.1-py2.py3-none-any.whl.

File metadata

Download URL: pycausalimpact-0.1.1-py2.py3-none-any.whl
Upload date: May 11, 2020
Size: 30.3 kB
Tags: Python 2, Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.2.0 requests-toolbelt/0.9.1 tqdm/4.46.0 CPython/3.6.9

File hashes

Hashes for pycausalimpact-0.1.1-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`360c56b277a62fd4b0c2c1c6d9182593d4f9c82c48b67aa39a5feb50bdc36367`
MD5	`69322a91167f06b6f98cd2d010867a51`
BLAKE2b-256	`e2cd2f9b327f58d5918c5c193434b733a6eafaf8a876ce30182f16216cba7001`

See more details on using hashes here.

pycausalimpact 0.1.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Causal Impact

How it works

Installation

Requirements

Getting Started

Simple Example

Differences Between Python and R Packages

Contributing, Bugs, Questions

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes