A Python module for regression and classification with the Partial Least Squares algorithm
Project description
Version 1.0.3 is a quick release that fixes the problem with relative imports in the previous version. Python3 does not like relative imports.
Version 1.0.2 fixes the module packaging errors that had crept into the previous version.
Version 1.0.1 includes a couple of CSV data files in the Examples directory that were inadvertently left out of Version 1.0 packaging of the module.
You may need this module if (1) you are trying to make multidimensional predictions from multidimensional observations; (2) the dimensionality of the observation space is large; and (3) the data you have available for constructing a prediction model is rather limited. The more traditional multiple linear regression (MLR) algorithms are likely to become numerically unstable under these conditions.
In addition to presenting an implementation of the main Partial Least Squares (PLS) algorithm that can be used to make a multidimensional prediction from a multidimensional observation, this module also includes what is known as the PLS1 algorithm for the case when the predicted entity is just onedimensional (as in, say, face recognition in computer vision).
Typical usage syntax:
In the notation that is typically used for describing PLS, X denotes the matrix formed by multidimensional observations, with each row of X standing for the values taken by all the predictor variables. And Y denotes the matrix formed by the values for the predicted variables. Each row of Y corresponds to the prediction that can be made on the basis of the corresponding row of X. Let's say that you have some previously collected data for the X and Y matrices in the form of CSV records in disk files. Given these X and Y, you would want to calculate the matrix B of regression coefficients with this module. Toward that end, you can make the following calls in your script: import PartialLeastSquares as PLS XMatrix_file = "X_data.csv" YMatrix_file = "Y_data.csv" pls = PLS.PartialLeastSquares( XMatrix_file = XMatrix_file, YMatrix_file = YMatrix_file, epsilon = 0.0001, ) pls.get_XMatrix_from_csv() pls.get_YMatrix_from_csv() B = pls.PLS() The object B returned by the last call will be a numpy matrix consisting of the calculated regression coefficients. Let's say that you now have a matrix Xtest of new data for the predictor variables. All you have to do to calculate the values for the predicted variables is Ytest = Xtest * B
Project details
Release history Release notifications
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Filename, size  File type  Python version  Upload date  Hashes 

Filename, size PartialLeastSquares1.0.3.tar.gz (1.3 MB)  File type Source  Python version None  Upload date  Hashes View hashes 
Hashes for PartialLeastSquares1.0.3.tar.gz
Algorithm  Hash digest  

SHA256  8fdbc0233ce549838d9cb260ba18562052a3d379c968b7d72d342f3baca40bee 

MD5  79a59ff19bc301faaa77ffc65faa5844 

BLAKE2256  2a53b829fbcfa91da62de8d4ebc1282b4265b5c060aa34b4208e49a849aeb31a 