secuer

Secuer: ultrafast, scalable and accurate clustering of single-cell RNA-seq data

Project description

# Secuer: ultrafast, scalable and accurate clustering of single-cell RNA-seq data

Secuer is a superfast and scalable clustering algorithm for (ultra-)large scRNA-seq data analysis based on spectral clustering. Secuer-consensus is a consensus clustering algorithm with Secuer as a subroutine. In addition, Secuer can also be applied to other large-scale omics data with two-dimensional (features by observations). For more details see [secuer](https://arxiv.org/abs/2205.12432v2).

The workflow of Secuer:

## Installation

Secuer is available in [python](https://www.python.org).

`python pip install secuer `

## Run Seucer (usage)

#### Essential parameters

To run Secuer with default parameters, you only need to specify:

-i INPUTFILE

scRNA-seq data (cells by genes) file for clustering.

–yaml

The parameters of data preprocessing. see [config.yaml](https://github.com/nanawei11/Secuer/blob/main/config.yaml) for more details.

#### options You can also specify the following options:

-p

The number of anchors, default by 1000.
-o

Output file directory and file name, default by output.
–knn

The number of k nearest neighbors anchors, default by 7.
–distance

The metrics measuring the dissimilarity between cells or anchors, default by euclidean.
–transpose

Require it if your data is a .csv, .txt or tsv file with features by observations.
–eskMethod

Specify the method used for estimated the number of cluster, default by subGraph.

–eskResolution

Specify the resolution when –eskMethod is subGraph, default by 0.8.
–gapth

Specify the gapth largest value when –eskMethod is not subGraph.

Example for run Secuer with custom parameters:

`sh $ Secuer S -i ./example_data/Biase_k3_FPKM_scRNA --yaml ./config.yaml -o ./Biase_result -p 1000 --knn 5 --transpose `

## Output files

output/SecuerResult.txt is the clustering result.
output/SecuerResult.h5ad is the preprocessed data with the clustering result.

## Run Seucer-consensus (usage)

#### Essential parameters

To run Secuer-consensus with default parameters, you only need to specify:

-i

two-dimensional data (observations by features) file for clustering.

–yaml

The parameters of data preprocessing. see [config.yaml](https://github.com/nanawei11/Secuer/blob/main/config.yaml) for more details.

#### options You can also specify the following options:

-p

The number of anchors, default by 1000.
-o

Output file directory and file name, default by outputCon.

-M

The times to run secuer.
--knn

The number of k nearest neighbors anchors, default by 7.

–transpose Require it if your data is a .csv, .txt or tsv file with genes by cells, default by False.

Example for run Secuer-consensus: `sh $ Secuer C -i ./example_data/Biase_k3_FPKM_scRNA --yaml ./config.yaml -o ./Biase_conresult -p 900 --knn 5 -M 7 --transpose `

## Output files

output/SecuerConsensusResult.txt is the clustering result.
output/SecuerConsensusResult.h5ad is the preprocessed data with the clustering result.

## Citation

Project details

Release history Release notifications | RSS feed

1.1

Mar 24, 2023

1.0.11

Nov 4, 2022

This version

1.0.10

Nov 4, 2022

1.0.9

Nov 4, 2022

1.0.7

Jul 23, 2022

1.0.6

Jul 18, 2022

1.0.5

Jul 18, 2022

1.0.2

Jul 17, 2022

1.0.1

Jul 17, 2022

1.0

Jul 17, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

secuer-1.0.10.tar.gz (20.5 MB view hashes)

Uploaded Nov 4, 2022 Source

Built Distribution

secuer-1.0.10-py3-none-any.whl (15.9 kB view hashes)

Uploaded Nov 4, 2022 Python 3

Hashes for secuer-1.0.10.tar.gz

Hashes for secuer-1.0.10.tar.gz
Algorithm	Hash digest
SHA256	`5879fa68d5076da0042841470c2bd79a62b5ecb04ab84262c3ecb036d2154a57`
MD5	`3e0925d5e358a3ea8e7c153c4ab71bc8`
BLAKE2b-256	`faa8cf789f0fc1bd14c454d6eac8e1ea0e3db64342829c290385762b98e01959`

Hashes for secuer-1.0.10-py3-none-any.whl

Hashes for secuer-1.0.10-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6b733d9c5955c6ef2a36a9858dddd22192f0d7b7504922d8ac272e423543f9bb`
MD5	`e411b637e1a3fc5319e97f213ca1c332`
BLAKE2b-256	`b985e1c53e4a426327d7420dc536b9aebc358a301f027f8b1d6f4ec92cef3e77`