Python version of R package SLIDE

These details have not been verified by PyPI

Project links

Project description

SLIDE_py

A bunch of python wrappers for R code

Overview

SLIDE combines the LOVE (Latent Model-Based Clustering for Biological Discovery) clustering algorithm with knockoff-based statistical inference to identify significant standalone and interacting latent factors. Link to R package: [!https://github.com/jishnu-lab/SLIDE]

Quick Start

Basic Usage

If running into trouble, feel free to use or clone the environment here: /ix3/djishnu/alw399/envs/rhino

From command line

Use the full path if you are not calling slide.py from the same directory

python slide.py \
    --x_path /path/to/your/features.csv \
    --y_path /path/to/your/labels.csv \
    --out_path /path/to/output/directory

In a notebook

import sys
sys.path.append('src/SLIDE')

from slide import OptimizeSLIDE

# Configure input parameters
input_params = {
    'x_path': '/path/to/your/features.csv',
    'y_path': '/path/to/your/labels.csv',
    'fdr': 0.1,
    'thresh_fdr': 0.1,
    'spec': 0.2,
    'y_factor': True,
    'niter': 500,
    'SLIDE_top_feats': 20,
    'rep_CV': 50,
    'pure_homo': True,
    'delta': [0.01],
    'lambda': [0.5, 0.1],
    'out_path': '/path/to/output/directory'
}

# Initialize and run SLIDE
slider = OptimizeSLIDE(input_params)
slider.run_pipeline(verbose=True, n_workers=1)

Pipeline Overview

The run_pipeline() has three main parts:

Stage 1: Latent Factor Discovery

LOVE Algorithm: Runs the overlapping clustering algorithm to identify latent factors
Output: Generates the latent factors (z_matrix) representing underlying data structure

Stage 2: Statistical Inference with SLIDE

2a) Standalone Factor Analysis: Uses knockoffs to identify statistically significant standalone latent factors
2b) Interaction Analysis: Applies knockoffs to discover significant interacting latent factor pairs
Feature Selection: Controls false discovery rate (FDR) while maintaining statistical power

Stage 3: Visualization

Control Plots: Generates diagnostic plots to assess model performance and statistical validity
Latent Factor Genes: For each latent factor, plots the top features with loadings > abs(0.05)

Parameter Configuration

Parameter	Type	Description	Default/Example
`x_path`	str	Path to feature matrix CSV file	Required
`y_path`	str	Path to response labels CSV file	Required
`fdr`	float	False discovery rate threshold (Knockoffs)	0.1
`thresh_fdr`	float	FDR threshold for feature selection (LOVE)	0.1
`spec`	float	minimum % times an LF found to be significant in order to be included	0.2
`y_factor`	bool	Treat response as factor variable	True
`niter`	int	Number of iterations	500
`SLIDE_top_feats`	int	Number of top features to display	20
`pure_homo`	bool	Use homogeneous loadings for pure variables	True
`delta`	list	Regularization parameter(s)	[0.5, 0.1]
`lambda`	list	Penalty parameter(s)	[0.1]
`out_path`	str	Output directory path	Required

Advanced Configuration

pure_homo=True: Forces pure variable loadings to be 1 (recommended)
pure_homo=False: Relaxes the pure variable loading constraint being 1 without losing any guarantees. However, it is difficult to find the right delta parameter
n_workers: Controls parallelization (1 for sequential processing), but CURRENTLY NOTHING IS PARALLELIZED
verbose: Enables detailed progress reporting (just a bunch of print statements)

Project Structure

SLIDE_py/
├── src/
│   ├── SLIDE/              # Core SLIDE implementation
│   │   ├── slide.py        # Main Python interface
│   │   └── ...            # Supporting R functions
│   └── LOVE-master/        # Original LOVE algorithm
│       ├── ...            # Original LOVE code (do not use)
│       ├── ...            # pure_homo LOVE code (use carefully)
|   └── LOVE-SLIDE/        # SLIDE implementation of LOVE

Implementation Details

LOVE Algorithm Integration

Primary Implementation: Located in src/SLIDE/get_Latent_Factors.R
Alternative Version: Available in LOVE-master when pure_homo=False
Note: The original LOVE code in LOVE-master may yield different results than the SLIDE implementation and is provided for reference

To-do list

These files

Yaml conversion: Since people already have pipelines set up, it would be convenient to have a function to read yamls into dictionaries
Other y_factor: Currently only binary y is accomodated.
Parallelization: Knockoffs can be made much faster. Please see select_short_freq in src/SLIDE/knockoffs.py. I was trying to use concurrent futures/ pqdm but I couldn't figure out the errors and gave up.
Correlation networks: I think networkx can make similar graph-like figures, but I'm not familiar with making them

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.0.7

Dec 10, 2025

0.0.6

Jun 11, 2025

0.0.5

Jun 11, 2025

0.0.4

Jun 10, 2025

0.0.3

Jun 10, 2025

0.0.2

Jun 10, 2025

This version

0.0.1

Jun 10, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

loveslide-0.0.1.tar.gz (30.7 kB view details)

Uploaded Jun 10, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

loveslide-0.0.1-py3-none-any.whl (32.6 kB view details)

Uploaded Jun 10, 2025 Python 3

File details

Details for the file loveslide-0.0.1.tar.gz.

File metadata

Download URL: loveslide-0.0.1.tar.gz
Upload date: Jun 10, 2025
Size: 30.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.9.21

File hashes

Hashes for loveslide-0.0.1.tar.gz
Algorithm	Hash digest
SHA256	`42b144132582cfce69e349f38e358ecfd047ce28ad2a3480a9fb0ead2523ffe6`
MD5	`1a1a92d12389cbd93add50744c290ccc`
BLAKE2b-256	`ff556ac01a23a78a1ba52b5b5b7e7cd0a6cb9ec96fbd6de7db5109ff3f28294e`

See more details on using hashes here.

File details

Details for the file loveslide-0.0.1-py3-none-any.whl.

File metadata

Download URL: loveslide-0.0.1-py3-none-any.whl
Upload date: Jun 10, 2025
Size: 32.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.9.21

File hashes

Hashes for loveslide-0.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`5b2a93e101c6720fb7d3a3006a66fca656a221f28f08d4208f77cb69cb0ceb91`
MD5	`7daf1691ef2089e939a610d9e7eb0272`
BLAKE2b-256	`12882a32d486b706109109d696fbed1d6b96cf1b7fdf235d83f4ac4c571fbfba`

See more details on using hashes here.

loveslide 0.0.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

SLIDE_py

Overview

Quick Start

Basic Usage

From command line

In a notebook

Pipeline Overview

Stage 1: Latent Factor Discovery

Stage 2: Statistical Inference with SLIDE

Stage 3: Visualization

Parameter Configuration

Advanced Configuration

Project Structure

Implementation Details

LOVE Algorithm Integration

To-do list

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes