Skip to main content

Statistical computations and models for use with SciPy

Project description

Statsmodels is a Python package that provides a complement to scipy for
statistical computations including descriptive statistics and
estimation of statistical models.

scikits.statsmodels provides classes and functions for the estimation of
several categories of statistical models. These currently include linear
regression models, OLS, GLS, WLS and GLS with AR(p) errors, generalized
linear models for six distribution families, M-estimators for robust
linear models, and regression with discrete dependent variables, Logit,
Probit, MNLogit, Poisson, based on maximum likelihood estimators,
timeseries models, ARMA, AR and VAR. An extensive list of result statistics
are available for each estimation problem. Statsmodels also contains
descriptive statistics, a wide range of statistical tests, tools for density
estimation and more.

We welcome feedback on our mailing list http://groups.google.com/group/pystatsmodels.
Report problems on our bug tracker https://github.com/statsmodels/statsmodels/issues.

For updated versions between releases, we recommend our repository on github
https://github.com/statsmodels/statsmodels.

Main changes for 0.3.0
----------------------

*Changes that break backwards compatibility*

Added api.py for importing. So the new convention for importing is ::

import scikits.statsmodels.api as sm

Importing from modules directly now avoids unnecessary imports and increases
the import speed if a library or user only needs specific functions.

* sandbox/output.py -> iolib/table.py
* lib/io.py -> iolib/foreign.py (Now contains Stata .dta format reader)
* family -> families
* families.links.inverse -> families.links.inverse_power
* Datasets' Load class is now load function.
* regression.py -> regression/linear_model.py
* discretemod.py -> discrete/discrete_model.py
* rlm.py -> robust/robust_linear_model.py
* glm.py -> genmod/generalized_linear_model.py
* model.py -> base/model.py
* t() method -> tvalues attribute (t() still exists but raises a warning)

*main changes and additions*

* Numerous bugfixes.
* Time Series Analysis model (tsa)
- Vector Autoregression Models VAR (tsa.VAR)
- Autogressive Models AR (tsa.AR)
- Autoregressive Moving Average Models ARMA (tsa.ARMA) :
optionally uses Cython for Kalman Filtering
use setup.py install with option --with-cython
- Baxter-King band-pass filter (tsa.filters.baxter_king)
- Hodrick-Prescott filter (tsa.filters.hpfilter)
- Christiano-Fitzgerald filter (tsa.filters.cffilter)

* Improved maximum likelihood framework uses all available scipy.optimize solvers
* Refactor of the datasets sub-package.
* Added more datasets for examples.
* Removed RPy dependency for running the test suite.
* Refactored the test suite.
* Refactored codebase/directory structure.
* Support for offset and exposure in GLM.
* Removed data_weights argument to GLM.fit for Binomial models.
* New statistical tests, especially diagnostic and specification tests
* Multiple test correction
* General Method of Moment framework in sandbox
* Improved documentation
* and other additions


Main Changes in 0.2.0
---------------------

* Improved documentation and expanded and more examples
* Added four discrete choice models: Poisson, Probit, Logit, and Multinomial Logit.
* Added PyDTA. Tools for reading Stata binary datasets (*.dta) and putting
them into numpy arrays.
* Added four new datasets for examples and tests.
* Results classes have been refactored to use lazy evaluation.
* Improved support for maximum likelihood estimation.
* bugfixes
* renames for more consistency
-RLM.fitted_values -> RLM.fittedvalues
-GLMResults.resid_dev -> GLMResults.resid_deviance


Python 3
--------

scikits.statsmodels has been ported and tested for Python 3.2. Python 3
version of the code can be obtained by running 2to3.py over the entire
statsmodels source. The numerical core of statsmodels worked almost without
changes, however there can be problems with data input and plotting.
The STATA file reader and writer in iolib.foreign has not been ported yet.
And there are still some problems with the matplotlib version for Python 3
that was used in testing. Running the test suite with Python 3.2 shows some
errors related to foreign and matplotlib.


Sandbox
-------

We are continuing to work on support for systems of equations models, panel data
models, time series analysis, and information and entropy econometrics in the
sandbox. This code is often merged into trunk as it becomes more robust.


Windows Help
------------
The source distribution for Windows includes a htmlhelp file (statsmodels.chm).
This can be opened from the python interpreter ::

>>> import scikits.statsmodels.api as sm
>>> sm.open_help()

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

scikits.statsmodels-0.3.0.zip (3.6 MB view details)

Uploaded Source

scikits.statsmodels-0.3.0.tar.gz (3.4 MB view details)

Uploaded Source

File details

Details for the file scikits.statsmodels-0.3.0.zip.

File metadata

File hashes

Hashes for scikits.statsmodels-0.3.0.zip
Algorithm Hash digest
SHA256 cc6a5b202402bf8103448a676229e0c4993124fab1d6bb33aabc859e001ff1b9
MD5 7e79934aca2e09136f382a53d0df1daf
BLAKE2b-256 0f74f4cd72d4b9171f7339a1c620e8bfb74f591e313a19646815d94f6ccd8bb3

See more details on using hashes here.

Provenance

File details

Details for the file scikits.statsmodels-0.3.0.tar.gz.

File metadata

File hashes

Hashes for scikits.statsmodels-0.3.0.tar.gz
Algorithm Hash digest
SHA256 13ab41c867693103acc9b063d0ead2db501fa66c290f791c9d4ac1d3d4ef573c
MD5 88bd33a09244547efbb67ca3c5e337bc
BLAKE2b-256 9e86f932e3a13d0fa3358c78fc650d4198407880a0debbe95448f9105b8de884

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page