Skip to main content

Python version of glmnet, originally from Stanford University, modified by Han Fang

Project description

# Glmnet for python

## Contact

Han Fang
hanfang.cshl@gmail.com

## Install

Using pip (recommended)

pip install glmnet_py

Complied from source

git clone https://github.com/hanfang/glmnet_py.git
cd glmnet_py
python setup.py install

Requirement: Python3, Linux

Currently, the checked-in version of GLMnet.so is compiled for the following config:

**Linux:** Linux version 2.6.32-573.26.1.el6.x86_64 (gcc version 4.4.7 20120313 (Red Hat 4.4.7-16) (GCC) )
**OS:** CentOS 6.7 (Final)
**Hardware:** 8-core Intel(R) Core(TM) i7-2630QM
**gfortran:** version 4.4.7 20120313 (Red Hat 4.4.7-17) (GCC)


## Usage
import glmnet_py
from glmnet import glmnet

For more examples, see https://github.com/hanfang/glmnet_python/tree/master/test


## Introduction

This is a python version of the popular `glmnet` library (beta release). Glmnet fits the entire lasso or elastic-net regularization path for `linear` regression, `logistic` and `multinomial` regression models, `poisson` regression and the `cox` model.

The underlying fortran codes are the same as the `R` version, and uses a cyclical path-wise coordinate descent algorithm as described in the papers linked below.

Currently, `glmnet` library methods for gaussian, multi-variate gaussian, binomial, multinomial, poisson and cox models are implemented for both normal and sparse matrices.

Additionally, cross-validation is also implemented for gaussian, multivariate gaussian, binomial, multinomial and poisson models. CV for cox models is yet to be implemented.

CV can be done in both serial and parallel manner. Parallellization is done using `multiprocessing` and `joblib` libraries.

During installation, the fortran code is compiled in the local machine using `gfortran`, and is called by the python code.

````diff
+Getting started:
````
*The best starting point to use this library is to start with the Jupyter notebooks in the `test` directory (glmnet_examples.ipynb). Detailed explanations of function calls and parameter values along with plenty of examples are provided there to get you started.*

## Authors:

Algorithm was designed by Jerome Friedman, Trevor Hastie and Rob Tibshirani. Fortran code was written by Jerome Friedman. R wrapper (from which the MATLAB wrapper was adapted) was written by Trevor Hastie.

The original MATLAB wrapper was written by Hui Jiang (14 Jul 2009), and was updated and is maintained by Junyang Qian (30 Aug 2013).

This python wrapper (which was adapted from the MATLAB and R wrappers) was originally written by B. J. Balakumar (5 Sep 2016), later modified by Han Fang.

B. J. Balakumar, bbalasub@stanford.edu (5 Sep 2016).
Department of Statistics, Stanford University, Stanford, California, USA.

REFERENCES:
* Friedman, J., Hastie, T. and Tibshirani, R. (2008) Regularization Paths for Generalized Linear Models via Coordinate Descent,
http://www.jstatsoft.org/v33/i01/
*Journal of Statistical Software, Vol. 33(1), 1-22 Feb 2010*

* Simon, N., Friedman, J., Hastie, T., Tibshirani, R. (2011) Regularization Paths for Cox's Proportional Hazards Model via Coordinate Descent,
http://www.jstatsoft.org/v39/i05/
*Journal of Statistical Software, Vol. 39(5) 1-13*

* Tibshirani, Robert., Bien, J., Friedman, J.,Hastie, T.,Simon, N.,Taylor, J. and Tibshirani, Ryan. (2010) Strong Rules for Discarding Predictors in Lasso-type Problems,
http://www-stat.stanford.edu/~tibs/ftp/strong.pdf
*Stanford Statistics Technical Report*



Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

glmnet_py-0.1.0b2.tar.gz (1.2 MB view details)

Uploaded Source

Built Distributions

glmnet_py-0.1.0b2-py3-none-any.whl (378.3 kB view details)

Uploaded Python 3

glmnet_py-0.1.0b2-py2.py3-none-any.whl (378.3 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file glmnet_py-0.1.0b2.tar.gz.

File metadata

File hashes

Hashes for glmnet_py-0.1.0b2.tar.gz
Algorithm Hash digest
SHA256 b03f4205ba1556c532f5f42bf3a220ca877c899edd9a53bb8866d6d3c14f7c2f
MD5 887f736fff59f9568b86cc5044727687
BLAKE2b-256 f51a55b708789a66f2405783bce0f53e02b86128ea3f21d329c87dc6dbef8e51

See more details on using hashes here.

File details

Details for the file glmnet_py-0.1.0b2-py3-none-any.whl.

File metadata

File hashes

Hashes for glmnet_py-0.1.0b2-py3-none-any.whl
Algorithm Hash digest
SHA256 39f6c92088401b47ea0fc13c7a8528aaaf2d0f532eb157ebfd05caa7968ce742
MD5 5fc7b7784e01d85bf5a797b06763f0bd
BLAKE2b-256 8219e1edbc0599f386816551fc6de2653be42cd62ff628ce8b30514392650277

See more details on using hashes here.

File details

Details for the file glmnet_py-0.1.0b2-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for glmnet_py-0.1.0b2-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 55e5e2704fcba2fc65081780ef4feca94aadd03eac1369ca6e412945dbd11232
MD5 ca06ee8815057eec907d0249acc0b4e7
BLAKE2b-256 26cbf87b3ad5a3136bd35767adc2043d23ff11c5d33cf0dd01f087ad95b90346

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page