Tool package for epidemiologic analyses
Project description
zEpid
zEpid is an epidemiology analysis package, providing easy to use tools for epidemiologists coding in Python 3.5+. The purpose of this library is to provide a toolset to make epidemiology e-z. A variety of calculations and plots can be generated through various functions. For a sample walkthrough of what this library is capable of, please look at the tutorials available at https://github.com/pzivich/Python-for-Epidemiologists
A few highlights: basic epidemiology calculations, easily create functional form assessment plots, easily create effect measure plots, and causal inference tools. Implemented estimators include; inverse probability of treatment weights, inverse probability of censoring weights, inverse probabilitiy of missing weights, augmented inverse probability of treatment weights, time-fixed g-formula, Monte Carlo g-formula, Iterative conditional g-formula, and targeted maximum likelihood (TMLE). Additionally, generalizability/transportability tools are available including; inverse probability of sampling weights, g-transport formula, and doubly robust generalizability/transportability formulas.
If you have any requests for items to be included, please contact me and I will work on adding any requested features. You can contact me either through GitHub (https://github.com/pzivich), email (gmail: zepidpy), or twitter (@zepidpy).
Installation
Installing:
You can install zEpid using pip install zepid
Dependencies:
pandas >= 0.18.0, numpy, statsmodels >= 0.7.0, matplotlib >= 2.0, scipy, tabulate
Module Features
Measures
Calculate measures directly from a pandas dataframe object. Implemented measures include; risk ratio, risk difference, odds ratio, incidence rate ratio, incidence rate difference, number needed to treat, sensitivity, specificity, population attributable fraction, attributable community risk
Measures can be directly calculated from a pandas DataFrame object or using summary data.
Other handy features include; splines, Table 1 generator, interaction contrast, interaction contrast ratio, positive predictive value, negative predictive value, screening cost analyzer, counternull p-values, convert odds to proportions, convert proportions to odds
For guided tutorials with Jupyter Notebooks: https://github.com/pzivich/Python-for-Epidemiologists/blob/master/3_Epidemiology_Analysis/a_basics/1_basic_measures.ipynb
Graphics
Uses matplotlib in the background to generate some useful plots. Implemented plots include; functional form assessment (with statsmodels output), p-value function plots, spaghetti plot, effect measure plot (forest plot), receiver-operator curve, dynamic risk plots, and L'Abbe plots
For examples see: http://zepid.readthedocs.io/en/latest/Graphics.html
Causal
The causal branch includes various estimators for causal inference with observational data. Details on currently implemented estimators are below:
G-Computation Algorithm
Current implementation includes; time-fixed exposure g-formula, Monte Carlo g-formula, and iterative conditional g-formula
Inverse Probability Weights
Current implementation includes; IP Treatment W, IP Censoring W, IP Missing W. Diagnostics are also available for IPTW. IPMW supports monotone missing data
Augmented Inverse Probability Weights
Current implementation includes the augmented-IPTW estimator described by Funk et al 2011 AJE
Targeted Maximum Likelihood Estimator
TMLE can be estimated through standard logistic regression model, or through user-input functions. Alternatively, users
can input machine learning algorithms to estimate probabilities. Supported machine learning algorithms include sklearn
Generalizability / Transportability
For generalizing results or transporting to a different target population, several estimators are available. These include inverse probability of sampling weights, g-transport formula, and doubly robust formulas
Tutorials for the usage of these estimators are available at: https://github.com/pzivich/Python-for-Epidemiologists/tree/master/3_Epidemiology_Analysis/c_causal_inference
G-estimation of Structural Nested Mean Models
Single time-point g-estimation of structural nested mean models are supported.
Sensitivity Analyses
Includes trapezoidal distribution generator, corrected Risk Ratio
Tutorials are available at: https://github.com/pzivich/Python-for-Epidemiologists/tree/master/3_Epidemiology_Analysis/d_sensitivity_analyses
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file zepid-0.9.1.tar.gz
.
File metadata
- Download URL: zepid-0.9.1.tar.gz
- Upload date:
- Size: 535.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.3.0 pkginfo/1.6.1 requests/2.22.0 setuptools/57.1.0 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/3.6.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | a288c14b6fcf8fbd2f5fa5dfcbd1b2a62e8650b9ffa890108e6a62be0451de55 |
|
MD5 | 416011e151e7f7ccf27e59648d876c77 |
|
BLAKE2b-256 | 2bd8a357673e807d3cc644f8a470a3c28d287788a128646b8fa7085e62f62f6c |
File details
Details for the file zepid-0.9.1-py2.py3-none-any.whl
.
File metadata
- Download URL: zepid-0.9.1-py2.py3-none-any.whl
- Upload date:
- Size: 555.8 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.3.0 pkginfo/1.6.1 requests/2.22.0 setuptools/57.1.0 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/3.6.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | c38f6a35a697d6b25d412857bad65342cc0275ab0b8722846976c3fa68d00940 |
|
MD5 | 241fcb5fe7eb51aad20fafc06678eaff |
|
BLAKE2b-256 | ad102596e5d37a1e07d75917be5871020f6fe5ee01ef80281785e5a485e3ffaa |