Skip to main content

Python version of Google's Causal Impact model

Project description

# Causal Impact [![Build Status](https://travis-ci.com/dafiti/causalimpact.svg?branch=master)](https://travis-ci.com/dafiti/causalimpact) [![Coverage Status](https://coveralls.io/repos/github/dafiti/causalimpact/badge.svg?branch=master)](https://coveralls.io/github/dafiti/causalimpact?branch=master) [![PyPI version](https://badge.fury.io/py/pycausalimpact.svg)](https://badge.fury.io/py/pycausalimpact) [![Pyversions](https://img.shields.io/pypi/pyversions/pycausalimpact.svg)](https://pypi.python.org/pypi/pycausalimpact) [![GitHub license](https://img.shields.io/github/license/dafiti/causalimpact.svg)](https://github.com/dafiti/causalimpact/blob/master/LICENSE)

This repository is a Python version of [Google’s Causal Impact](https://github.com/google/CausalImpact) model with all functionalities fully ported and tested.

## How it works The main goal of the algorithm is to infer the expected effect a given intervention (or any action) had on some response variable by analyzing differences between expected and observed time series data.

Data is divided in two parts: the first one is what is known as the “pre-intervention” period and the concept of [Bayesian Structural Time Series](https://en.wikipedia.org/wiki/Bayesian_structural_time_series) is used to fit a model that best explains what has been observed. The fitted model is used in the second part of data (“post-intervention” period) to forecast what the response would look like had the intervention not taken place. The inferences are based on the differences between observed response to the predicted one which yields the absolute and relative expected effect the intervention caused on data.

The model makes as assumption (which is recommended to be confirmed in your data) that the response variable can be precisely modeled by a linear regression with what is known as “covariates” (or X) that must not be affected by the intervention that took place (for instance, if a company wants to infer what impact a given marketing campaign will have on its “revenue”, then its daily “visits” cannot be used as a covariate as probably the total visits might be affected by the campaign.

The model is more commonly used to infer the impact that marketing interventions have on businesses such as the expected revenue associated to a given campaign or even to assert more precisely the revenue a given channel brings in by completely turning it off (also known as “hold-out” tests). It’s important to note though that the model can be extensively used in different areas and subjects; any intervention on time series data can potentially be modeled and inferences be made upon observed and predicted data.

Please refer to <a href=http://nbviewer.jupyter.org/github/dafiti/causalimpact/blob/master/examples/getting_started.ipynb>getting started</a> in the examples folder for more information.

## Instalation

pip install pycausalimpact

or (recommended):

pipenv install pycausalimpact

## Requirements

  • python3

  • numpy

  • scipy

  • statsmodels 0.9.0

  • matplotlib

  • jinja2

## Getting Started We recommend this [presentation](https://www.youtube.com/watch?v=GTgZfCltMm8) by Kay Brodersen (one of the creators of the causal impact implementation in R).

We also created this introductory [ipython notebook](http://nbviewer.jupyter.org/github/dafiti/causalimpact/blob/master/examples/getting_started.ipynb) with examples of how to use this package.

### Simple Example Here’s a simple example (which can also be found in the original Google’s R implementation) running in python:

`python import numpy as np import pandas as pd from statsmodels.tsa.arima_process import ArmaProcess from causalimpact import CausalImpact np.random.seed(12345) ar = np.r_[1, 0.9] ma = np.array([1]) arma_process = ArmaProcess(ar, ma) X = 100 + arma_process.generate_sample(nsample=100) y = 1.2 * X + np.random.normal(size=100) y[70:] += 1 data = pd.DataFrame({'y': y, 'X': X}, columns=['y', 'X']) pre_period = [0, 69] post_period = [70, 99] ci = CausalImpact(data, pre_period, post_period) print(ci.summary()) ci.plot() `

![alt text](https://raw.githubusercontent.com/dafiti/causalimpact/master/examples/ci_plot.png)

## Contributing, Bugs, Questions Contributions are more than welcome! If you want to propose new changes, fix bugs or improve something feel free to fork the repository and send us a Pull Request. You can also open new Issues for reporting bugs and general problems.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pycausalimpact-0.0.5.tar.gz (42.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pycausalimpact-0.0.5-py2.py3-none-any.whl (27.6 kB view details)

Uploaded Python 2Python 3

File details

Details for the file pycausalimpact-0.0.5.tar.gz.

File metadata

  • Download URL: pycausalimpact-0.0.5.tar.gz
  • Upload date:
  • Size: 42.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.18.4 setuptools/40.4.3 requests-toolbelt/0.8.0 tqdm/4.27.0 CPython/3.6.5

File hashes

Hashes for pycausalimpact-0.0.5.tar.gz
Algorithm Hash digest
SHA256 565818721bbe2e8faa1da4e362a2b6f744542b03a2b2898fc999acc743adb74f
MD5 06df659885f637b2d8e7d1d45eb777cd
BLAKE2b-256 6b582e69090e62372c04f296b54ea03b68ac6a95f2a1581a83be76825a7b1439

See more details on using hashes here.

File details

Details for the file pycausalimpact-0.0.5-py2.py3-none-any.whl.

File metadata

  • Download URL: pycausalimpact-0.0.5-py2.py3-none-any.whl
  • Upload date:
  • Size: 27.6 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.18.4 setuptools/40.4.3 requests-toolbelt/0.8.0 tqdm/4.27.0 CPython/3.6.5

File hashes

Hashes for pycausalimpact-0.0.5-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 09a347819f365db46128944a11d1d9d45db0997b025005d088a38f90a82012ae
MD5 08ce1c858e88fb33cb04736db3493452
BLAKE2b-256 2eec000b243434986e8d54566df449c6f2736b4112a08561de1ee6a7337068c3

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page