Skip to main content

cellsnake

Project description

cellsnake

A command line tool for easy and scalable single cell analysis

Installation

Method for installing directly from the GitHub repo:

git clone https://github.com/sinanugur/cellsnake.git
cd cellsnake
pip install .

Check if the installation works by calling the main script:

cellsnake --help

then install and check if all the R packages are installed by typing:

cellsnake --install-packages

You should see this message if all the packages are available:
[1] "All packages were installed...OK"

Note: You have to install dependencies if you prefer Github installation. We strongly recommend Bioconda and Mamba.

--install-packages argument must be always called to check if the R packages are available and installed correctly.

Quick start examples

Run cellsnake in a clean directory and cellsnake will create the required directories while running. You may download publicly available fetal brain dataset to test your cellsnake installation. The link is here.

After downloading the dataset, just point the data folder which contains the two dataset folders, this will trigger a standard cellsnake workflow:

cellsnake data

After the pipeline finishes, you may also integrate these two samples:

cellsnake data --option integration

To determine a manual resolution parameter, you can also create only a ClusTree:

cellsnake data --option integration --option clustree

Let's say you want a resolution of 0.1, then you can trigger a run with this resolution:

cellsnake data --option integration --resolution 0.1

It is also possible to use automatic resolution selection, however this might be very slow in large datasets:

cellsnake data --option integration --resolution auto

See our documentation for detailed explanations and to read full features: https://cellsnake.readthedocs.io/

Options and Arguments

Usage:
    cellsnake <INPUT> [--resolution <text>] [--percent_mt <text>] [--configfile <text>] [--gene <text>] [--jobs <integer>] [--option <text>]... [--release-the-kraken <text>] [--taxa <text>] [--unlock|--remove] [--dry]
    cellsnake <INPUT> [--unlock|--remove] [--dry]
    cellsnake --generate-template
    cellsnake --install-packages
    cellsnake (-h | --help)
    cellsnake --version
    cellsnake --init

Arguments:
    INPUT                                   Input directory or a file to process (if a directory given, batch mode is ON).
    -c <text>, --configfile <text>          Config file name (if not supplied, it will use default settings, you may generate a template, change it and use it in your runs).
    --resolution <text>                     Resolution for cluster detection, write "auto" for auto detection [default: 0.8].
    --percent_mt <text>                     Maximum mitochondrial gene percentage cutoff, for example, 5 or 10, write "auto" for auto detection [default: 10].
    --gene <text>                           Create publication ready plots for a gene or a list of genes from a text file.
    --option <text>                         cellsnake run options: "minimal", "standard", "clustree", "advanced" [default: standard]. "integration" is to integrate and run on integrated samples.
    --release-the-kraken <text>             Kraken database folder.
    --taxa <text>                           Microbiome taxonomic level collapse to "domain", "kingdom", "phylum", "class", "order", "family", "genus", "species" [default: genus]
    -j <integer>, --jobs <integer>          Total CPUs. [default: 2]

Options:
    --generate-template                Generate config file template in the current directory.
    --install-packages                 Install, reinstall or check required R packages.
    -u, --unlock                       Rescue stalled jobs (Try this if the previous job ended prematurely or currently failing).
    -r, --remove                       Delete all output files (this won't affect input files).
    -d, --dry                          Dry run, nothing will be generated.
    -h, --help                         Show this screen.
    --version                          Show version.

Output

The cellsnake main executable will generate two main folders: analyses and results.
If an integrated dataset available, analyses_integrated and results_integrated.

Logs

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cellsnake-0.2.0.dev7.tar.gz (1.4 MB view details)

Uploaded Source

File details

Details for the file cellsnake-0.2.0.dev7.tar.gz.

File metadata

  • Download URL: cellsnake-0.2.0.dev7.tar.gz
  • Upload date:
  • Size: 1.4 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.10

File hashes

Hashes for cellsnake-0.2.0.dev7.tar.gz
Algorithm Hash digest
SHA256 a39201cec1df95fdccc00b13fe1e59026d2147f1d0ade5c07f96a853490d07d0
MD5 06a08ba878920995ae902b3989141dd7
BLAKE2b-256 2cc0111fca7ab223c43563aaccbfb800f7cb648310763af9710b78cbd8e06501

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page