Skip to main content

Not your grandma's NGS analysis - software for analyzing FASTQs from SRSLY libraries

Project description

ClaretBio's SRSLY Library processing software

This software is for the basic informatic processing of sequencing data generated using ClaretBio's SRSLY library prep kit with or without using UMIs.

Installation

This sofware can be installed as a python package using the command pip install srslyrun

Usage

The basic analysis can be run with srsly runsamples for standard libraries or srsly runsamples --umi for libraries with unique molecular identifiers (UMIs). This software takes in raw fastqs and trims adapters, aligns to a user-specified reference genome, and marks duplicates. For UMI aware demultiplexing of SRSLY libraries please use our SRSLYumi python package (more info at https://github.com/claretbio/SRSLYumi).

In order run, this software requires an installation of conda- installation instructions can be found here.

Required Arguments

`--fastqdir` : the directory containing the raw fastqs you wish to process (defaults to current working directory)

`--resultsdir` : the directory you would like the output to be in (defaults to current working directory)

`--reference` : a path to the reference genome you wish to align to (no default, this must be provided, and must be a file ending in `.fasta` or `.fa`)

`--libraries` or `--libfile`: the library IDs you would like analyzed in comma separated format and in a file with one id per line, respectively

The library IDs provided should match the beginning of the fastq files, for example the library ID for fastq files named lib1_R1.fastq.gz and lib1_R2.fastq.gz would be lib1. This can be provided directly on the command line with a comma separated list like --libraries lib1,lib2 or as a file that lists one library ID per line with --libfile libfile.txt.

an example command:

srsly runsamples --fastqdir /home/user/fastqfiles --resultsdir /home/user/srslyanalysis --reference /home/user/genomes/hg19.fa --libraries lib1,lib2,lib3

For reproducibility's sake and to ensure appropriate versions we use snakemake wrappers for many of the tools in this pipeline, which are often slow to create the first time they are used. As a result, your first time running the software may take a long time - don't worry, this is totally normal!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

srslyrun-0.1.0.tar.gz (12.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

srslyrun-0.1.0-py3-none-any.whl (12.0 kB view details)

Uploaded Python 3

File details

Details for the file srslyrun-0.1.0.tar.gz.

File metadata

  • Download URL: srslyrun-0.1.0.tar.gz
  • Upload date:
  • Size: 12.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.7.10

File hashes

Hashes for srslyrun-0.1.0.tar.gz
Algorithm Hash digest
SHA256 812d5f826cc9fa7322f11cf70102111b9440b2e1a3548c156beb8484ea6c4f8f
MD5 d602236e21746cd8360fe9de992328f4
BLAKE2b-256 d37cb704fd3c449c221bb7cb4c37895ada13feb2ae9fa96c51d9139d455a9209

See more details on using hashes here.

File details

Details for the file srslyrun-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: srslyrun-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 12.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.7.10

File hashes

Hashes for srslyrun-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 e80222c9ca2f73ac76d1f1e669d4ca4fea630d438af35bd2bc0dac6d8b97ed90
MD5 56ab5dba9eb01c490a622f1740e35df4
BLAKE2b-256 6a1d4239f0e647b14a342c0b8eb27248e019bcb7ccfef30fe73a475e0fea1e11

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page