A library implementing the Partial Least Squares Path Model algorithm

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: GNU General Public License v3 (GPLv3)
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

PLSPM: A library implementing Partial Least Squares Path Modeling

Please note: This is not an officially supported Google product.

plspm is a Python 3 package dedicated to Partial Least Squares Path Modeling (PLS-PM) analysis. It is a port of the R package plspm, with additional features adopted from the R package seminr

PLSPM (partial least squares path modeling) is a correlation-based structural equation modeling (SEM) algorithm. It allows for estimation of complex cause-effect or prediction models using latent/manifest variables.

PLSPM may be preferred to other SEM methods for several reasons: it is a method that is appropriate for exploratory research, can be used with small-to-medium sample sizes (as well as large data sets), and does not require assumptions of multivariate normality. (See Hulland, J. (1999). Use of partial least squares (PLS) in strategic management research: a review of four recent studies. Strategic management journal, 20(2), 195-204.) In contrast to covariance-based SEM (CBSEM), goodness of fit is less important, because the purpose of the algorithm is to optimize for prediction of the dependent variable vs. fit of data to a predetermined model. (See "goodness of fit" vs "goodness of model" in Chin, W. W. (2010). How to write up and report PLS analyses. In Handbook of partial least squares (pp. 655-690). Springer, Berlin, Heidelberg.)

Features

Uses variance-based PLS esimation to model composite constructs using Mode A and Mode B
Uses a natural-feeling, domain specific language to build and estimate structural equation models, including second-order constructs
Supports centroid, factorial, and path schemes
Supports metric and non-metric numerical data (including nominal and ordinal)
Handles missing data
Bootstrapping with multi-core support
Tested against seminr, which is, in turn, tested against SmartPLS (Ringle et al., 2015) and ADANCO (Henseler and Dijkstra, 2015), as well as other R packages such as semPLS (Monecke and Leisch, 2012) and matrixpls (Rönkkö, 2016).

Planned but not yet implemented

Native modeling of moderation
Improved assessment measures such as HTMT, VIF, f^2, Q^2, and q^2
Modeling formative constructs using the PLS consistent (PLSc) algorithm

Installation

You can install the latest version of this package using pip:

python3 -m pip install --user plspm

It's hosted on pypi: https://pypi.org/project/plspm/

Use

plspm expects to get a Pandas DataFrame containing your data. You start by creating a Config object with the details of the model, and then pass it, along with the data and optionally some further configuration, to an instance of Plspm. Use the examples below to get started, or browse the documentation (start with Config and Plspm)

Examples

PLS-PM with metric data

Typical example with a Customer Satisfaction Model

#!/usr/bin/env python3
import pandas as pd, plspm.config as c
from plspm.plspm import Plspm
from plspm.scheme import Scheme
from plspm.mode import Mode

satisfaction = pd.read_csv("file:tests/data/satisfaction.csv", index_col=0)

structure = c.Structure()
structure.add_path(["IMAG"], ["EXPE", "SAT", "LOY"])
structure.add_path(["EXPE"], ["QUAL", "VAL", "SAT"])
structure.add_path(["QUAL"], ["VAL", "SAT"])
structure.add_path(["VAL"], ["SAT"])
structure.add_path(["SAT"], ["LOY"])

config = c.Config(structure.path(), scaled=False)
config.add_lv_with_columns_named("IMAG", Mode.A, satisfaction, "imag")
config.add_lv_with_columns_named("EXPE", Mode.A, satisfaction, "expe")
config.add_lv_with_columns_named("QUAL", Mode.A, satisfaction, "qual")
config.add_lv_with_columns_named("VAL", Mode.A, satisfaction, "val")
config.add_lv_with_columns_named("SAT", Mode.A, satisfaction, "sat")
config.add_lv_with_columns_named("LOY", Mode.A, satisfaction, "loy")

plspm_calc = Plspm(satisfaction, config, Scheme.CENTROID)
print(plspm_calc.inner_summary())
print(plspm_calc.path_coefficients())

This will produce the output:

            type  r_squared  block_communality  mean_redundancy       ave
EXPE  Endogenous   0.335194           0.616420         0.206620  0.616420
IMAG   Exogenous   0.000000           0.582269         0.000000  0.582269
LOY   Endogenous   0.509923           0.639052         0.325867  0.639052
QUAL  Endogenous   0.719688           0.658572         0.473966  0.658572
SAT   Endogenous   0.707321           0.758891         0.536779  0.758891
VAL   Endogenous   0.590084           0.664416         0.392061  0.664416

          IMAG      EXPE      QUAL       VAL       SAT  LOY
IMAG  0.000000  0.000000  0.000000  0.000000  0.000000    0
EXPE  0.578959  0.000000  0.000000  0.000000  0.000000    0
QUAL  0.000000  0.848344  0.000000  0.000000  0.000000    0
VAL   0.000000  0.105478  0.676656  0.000000  0.000000    0
SAT   0.200724 -0.002754  0.122145  0.589331  0.000000    0
LOY   0.275150  0.000000  0.000000  0.000000  0.495479    0

Specifying higher-order constructs

Example using seminr's mobile industry data set:

mobi = pd.read_csv("file:tests/data/mobi.csv", index_col=0)

structure = c.Structure()
structure.add_path(["Expectation", "Quality"], ["Satisfaction"])
structure.add_path(["Satisfaction"], ["Complaints", "Loyalty"])

config = c.Config(structure.path(), default_scale=Scale.NUM)
config.add_higher_order("Satisfaction", Mode.A, ["Image", "Value"])
config.add_lv_with_columns_named("Expectation", Mode.A, mobi, "CUEX")
config.add_lv_with_columns_named("Quality", Mode.B, mobi, "PERQ")
config.add_lv_with_columns_named("Loyalty", Mode.A, mobi, "CUSL")
config.add_lv_with_columns_named("Image", Mode.A, mobi, "IMAG")
config.add_lv_with_columns_named("Complaints", Mode.A, mobi, "CUSCO")
config.add_lv_with_columns_named("Value", Mode.A, mobi, "PERV")

mobi_pls = Plspm(mobi, config, Scheme.PATH, 100, 0.00000001)

print(plspm_calc.inner_model())

This will produce the output:

                                     from            to  estimate  std error          t         p>|t|
index                                                                                                
Quality -> Satisfaction           Quality  Satisfaction  0.743041   0.046318  16.042102  3.633866e-40
Expectation -> Satisfaction   Expectation  Satisfaction  0.089572   0.046318   1.933832  5.427626e-02
Satisfaction -> Loyalty      Satisfaction       Loyalty  0.627940   0.049420  12.706272  7.996788e-29
Satisfaction -> Complaints   Satisfaction    Complaints  0.486696   0.055472   8.773752  2.841768e-16

PLS-PM with nonmetric data

Example with the classic Russett data (original data set)

#!/usr/bin/env python3
import pandas as pd, plspm.config as c
from plspm.plspm import Plspm
from plspm.scale import Scale
from plspm.scheme import Scheme
from plspm.mode import Mode

russa = pd.read_csv("file:tests/data/russa.csv", index_col=0)

structure = c.Structure()
structure.add_path(["AGRI", "IND"], ["POLINS"])

config = c.Config(structure.path(), default_scale=Scale.NUM)
config.add_lv("AGRI", Mode.A, c.MV("gini"), c.MV("farm"), c.MV("rent"))
config.add_lv("IND", Mode.A, c.MV("gnpr"), c.MV("labo"))
config.add_lv("POLINS", Mode.A, c.MV("ecks"), c.MV("death"), c.MV("demo"), c.MV("inst"))

plspm_calc = Plspm(russa, config, Scheme.CENTROID, 100, 0.0000001)
print(plspm_calc.inner_summary())
print(plspm_calc.effects())

This will produce the output:

              type  r_squared  block_communality  mean_redundancy       ave
AGRI     Exogenous   0.000000           0.739560         0.000000  0.739560
IND      Exogenous   0.000000           0.907524         0.000000  0.907524
POLINS  Endogenous   0.592258           0.565175         0.334729  0.565175

   from      to    direct  indirect     total
0  AGRI  POLINS  0.225639       0.0  0.225639
1   IND  POLINS  0.671457       0.0  0.671457

Example 2: Different Scaling

PLS-PM using data set russa, and different scaling

#!/usr/bin/python3
import pandas as pd, plspm.config as c, plspm.util as util
from plspm.plspm import Plspm
from plspm.scale import Scale
from plspm.scheme import Scheme
from plspm.mode import Mode

russa = pd.read_csv("file:tests/data/russa.csv", index_col=0)

structure = c.Structure()
structure.add_path(["AGRI", "IND"], ["POLINS"])
config = c.Config(structure.path(), default_scale=Scale.NUM)
config.add_lv("AGRI", Mode.A, c.MV("gini"), c.MV("farm"), c.MV("rent"))
config.add_lv("IND", Mode.A, c.MV("gnpr", Scale.ORD), c.MV("labo", Scale.ORD))
config.add_lv("POLINS", Mode.A, c.MV("ecks"), c.MV("death"), c.MV("demo", Scale.NOM), c.MV("inst"))

plspm_calc = Plspm(russa, config, Scheme.CENTROID, 100, 0.0000001)

Example 3: Missing Data

#!/usr/bin/env python3
import pandas as pd, plspm.config as c
from plspm.plspm import Plspm
from plspm.scale import Scale
from plspm.scheme import Scheme
from plspm.mode import Mode

russa = pd.read_csv("file:tests/data/russa.csv", index_col=0)
russa.iloc[0, 0] = np.NaN
russa.iloc[3, 3] = np.NaN
russa.iloc[5, 5] = np.NaN

structure = c.Structure()
structure.add_path(["AGRI", "IND"], ["POLINS"])
config = c.Config(structure.path(), default_scale=Scale.NUM)
config.add_lv("AGRI", Mode.A, c.MV("gini"), c.MV("farm"), c.MV("rent"))
config.add_lv("IND", Mode.A, c.MV("gnpr"), c.MV("labo"))
config.add_lv("POLINS", Mode.A, c.MV("ecks"), c.MV("death"), c.MV("demo"), c.MV("inst"))

plspm_calc = Plspm(russa, config, Scheme.CENTROID, 100, 0.0000001)

Maintainers

Jez Humble (humble at google.com)

Nicole Forsgren (nicolefv at github.com)

Project details

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: GNU General Public License v3 (GPLv3)
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

This version

0.5.7

Jun 22, 2024

0.5.6

Aug 6, 2020

0.5.5

Jul 19, 2020

0.5.4

Jul 18, 2020

0.5.3

Jul 5, 2020

0.5.2

Jun 27, 2020

0.5.1

Jun 25, 2020

0.5.0

Jun 25, 2020

0.4.7

Jun 24, 2020

0.4.6

Apr 8, 2020

0.4.5

Oct 30, 2019

0.4.4

Oct 30, 2019

0.4.3

Oct 29, 2019

0.4.2

May 1, 2019

0.4.1

Mar 10, 2019

0.4.0

Mar 9, 2019

0.3.0

Mar 7, 2019

0.2.2

Mar 4, 2019

0.2.1

Mar 3, 2019

0.2.0

Mar 3, 2019

0.1.1

Mar 2, 2019

0.1.0

Feb 24, 2019

0.0.2

Feb 15, 2019

0.0.1

Feb 13, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

plspm-0.5.7.tar.gz (40.2 kB view details)

Uploaded Jun 22, 2024 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

plspm-0.5.7-py3-none-any.whl (42.9 kB view details)

Uploaded Jun 22, 2024 Python 3

File details

Details for the file plspm-0.5.7.tar.gz.

File metadata

Download URL: plspm-0.5.7.tar.gz
Upload date: Jun 22, 2024
Size: 40.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.0 CPython/3.12.4

File hashes

Hashes for plspm-0.5.7.tar.gz
Algorithm	Hash digest
SHA256	`e40468cb47beff3153f64ec12b24614db98610f707c96ed2e667f59182950c3b`
MD5	`0137d14255989c0abb66a5661bab079c`
BLAKE2b-256	`e6a52c7cc0a557d1fab7374c34a4307687a7766ac05263d46830ff2dba786737`

See more details on using hashes here.

File details

Details for the file plspm-0.5.7-py3-none-any.whl.

File metadata

Download URL: plspm-0.5.7-py3-none-any.whl
Upload date: Jun 22, 2024
Size: 42.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.0 CPython/3.12.4

File hashes

Hashes for plspm-0.5.7-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1abe2a34c2f5da56df617f49363bd3abfd9d46fcaed0c533f2df2c3bbe3e1888`
MD5	`9ad6758320582092aeb5359b9a6796da`
BLAKE2b-256	`77fc2b5252c8d0319f08df87b49d805e51818a812cb9fe600e686361371b9ca1`

See more details on using hashes here.

plspm 0.5.7

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

PLSPM: A library implementing Partial Least Squares Path Modeling

Features

Planned but not yet implemented

Installation

Use

Examples

PLS-PM with metric data

Specifying higher-order constructs

PLS-PM with nonmetric data

Example 2: Different Scaling

Example 3: Missing Data

Maintainers

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes