Skip to main content

Statistical computations and models for Python

Project description

Travis Build Status Azure CI Build Status Appveyor Build Status Coveralls Coverage

About Statsmodels

Statsmodels is a Python package that provides a complement to scipy for statistical computations including descriptive statistics and estimation and inference for statistical models.

Documentation

The documentation for the latest release is at

https://www.statsmodels.org/stable/

The documentation for the development version is at

https://www.statsmodels.org/dev/

Recent improvements are highlighted in the release notes

https://www.statsmodels.org/stable/release/version0.9.html

Backups of documentation are available at https://statsmodels.github.io/stable/ and https://statsmodels.github.io/dev/.

Main Features

  • Linear regression models:

    • Ordinary least squares

    • Generalized least squares

    • Weighted least squares

    • Least squares with autoregressive errors

    • Quantile regression

    • Recursive least squares

  • Mixed Linear Model with mixed effects and variance components

  • GLM: Generalized linear models with support for all of the one-parameter exponential family distributions

  • Bayesian Mixed GLM for Binomial and Poisson

  • GEE: Generalized Estimating Equations for one-way clustered or longitudinal data

  • Discrete models:

    • Logit and Probit

    • Multinomial logit (MNLogit)

    • Poisson and Generalized Poisson regression

    • Negative Binomial regression

    • Zero-Inflated Count models

  • RLM: Robust linear models with support for several M-estimators.

  • Time Series Analysis: models for time series analysis

    • Complete StateSpace modeling framework

      • Seasonal ARIMA and ARIMAX models

      • VARMA and VARMAX models

      • Dynamic Factor models

      • Unobserved Component models

    • Markov switching models (MSAR), also known as Hidden Markov Models (HMM)

    • Univariate time series analysis: AR, ARIMA

    • Vector autoregressive models, VAR and structural VAR

    • Vector error correction modle, VECM

    • exponential smoothing, Holt-Winters

    • Hypothesis tests for time series: unit root, cointegration and others

    • Descriptive statistics and process models for time series analysis

  • Survival analysis:

    • Proportional hazards regression (Cox models)

    • Survivor function estimation (Kaplan-Meier)

    • Cumulative incidence function estimation

  • Multivariate:

    • Principal Component Analysis with missing data

    • Factor Analysis with rotation

    • MANOVA

    • Canonical Correlation

  • Nonparametric statistics: Univariate and multivariate kernel density estimators

  • Datasets: Datasets used for examples and in testing

  • Statistics: a wide range of statistical tests

    • diagnostics and specification tests

    • goodness-of-fit and normality tests

    • functions for multiple testing

    • various additional statistical tests

  • Imputation with MICE, regression on order statistic and Gaussian imputation

  • Mediation analysis

  • Graphics includes plot functions for visual analysis of data and model results

  • I/O

    • Tools for reading Stata .dta files, but pandas has a more recent version

    • Table output to ascii, latex, and html

  • Miscellaneous models

  • Sandbox: statsmodels contains a sandbox folder with code in various stages of developement and testing which is not considered “production ready”. This covers among others

    • Generalized method of moments (GMM) estimators

    • Kernel regression

    • Various extensions to scipy.stats.distributions

    • Panel data models

    • Information theoretic measures

How to get it

The master branch on GitHub is the most up to date code

https://www.github.com/statsmodels/statsmodels

Source download of release tags are available on GitHub

https://github.com/statsmodels/statsmodels/tags

Binaries and source distributions are available from PyPi

https://pypi.org/project/statsmodels/

Binaries can be installed in Anaconda

conda install statsmodels

Installing from sources

See INSTALL.txt for requirements or see the documentation

https://statsmodels.github.io/dev/install.html

License

Modified BSD (3-clause)

Discussion and Development

Discussions take place on our mailing list.

http://groups.google.com/group/pystatsmodels

We are very interested in feedback about usability and suggestions for improvements.

Bug Reports

Bug reports can be submitted to the issue tracker at

https://github.com/statsmodels/statsmodels/issues

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

statsmodels_dq-3.0.tar.gz (14.1 MB view details)

Uploaded Source

Built Distribution

statsmodels_dq-3.0-cp37-cp37m-win_amd64.whl (8.0 MB view details)

Uploaded CPython 3.7m Windows x86-64

File details

Details for the file statsmodels_dq-3.0.tar.gz.

File metadata

  • Download URL: statsmodels_dq-3.0.tar.gz
  • Upload date:
  • Size: 14.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.4.2 requests/2.21.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.28.1 CPython/3.7.1

File hashes

Hashes for statsmodels_dq-3.0.tar.gz
Algorithm Hash digest
SHA256 99737ee68b6720607db343b1e18b3bb7ed1e056141deb19119ef182825ccc5f4
MD5 95e493f4c8d7ea8225046008ed9c894c
BLAKE2b-256 0e0d09fa015728e1abb7de02aa61d09af59923dd528158c516de5e137c840eb0

See more details on using hashes here.

File details

Details for the file statsmodels_dq-3.0-cp37-cp37m-win_amd64.whl.

File metadata

  • Download URL: statsmodels_dq-3.0-cp37-cp37m-win_amd64.whl
  • Upload date:
  • Size: 8.0 MB
  • Tags: CPython 3.7m, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.4.2 requests/2.21.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.28.1 CPython/3.7.1

File hashes

Hashes for statsmodels_dq-3.0-cp37-cp37m-win_amd64.whl
Algorithm Hash digest
SHA256 7a050f9ab473367822ffb399cc364ab77300dcb8e304b633dd2ea059aeb63d28
MD5 7e851883e8a73fb3892a8846daaee2ce
BLAKE2b-256 d2adad358a299d9c652cdf1fbf89c93aedadc39c0cd7e44c647a810f14b25427

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page