Skip to main content

A python command line interface to demultiplex illumina reads.

Project description

Demultiplexer2

logo

Introduction

Demultiplexer2 is a Python package designed to efficiently demultiplex paired-end Illumina sequencing reads by identifying and sorting inline tags, enabling streamlined downstream analysis.

Installation

You can install demultiplexer2 using pip:

pip install demultiplexer2

When updates are released, users will be notified and can upgrade to the latest version with:

pip install --upgrade demultiplexer2

Usage

A primerset is a configuration file that stores details about the primers and tags used for a specific dataset. It is automatically saved in the /demultiplexer2/data directory for future use.

Step 1: Create a Primer Set

To create a new primerset, use the following command:

demultiplexer2 create_primerset --name NameOfPrimerset --n_primers NumberOfPrimers
  • --name: Specifies the name of the primerset (e.g., fwh2F2-fwhR2n).
  • --n_primers: Indicates the number of primers in your dataset.

The primerset is an Excel file that contains critical information organized into three sheets:

  1. General Information: Stores details about the primers used for amplification.
  2. Forward Tags: Contains the names and sequences of the tags associated with the forward primers.
  3. Reverse Tags: Contains the names and sequences of the tags associated with the reverse primers.

Fill out the primerset file before continuing with the next step.

Step 2: Create a Tagging scheme

To create a new tagging scheme, use the following command:

demultiplexer2 create_tagging_scheme --name NameOfTaggingScheme --data_dir InputDirectory --primerset_path PathToPrimerset
  • --name: Specifies the name of the tagging scheme (e.g., MyFirstStudy).
  • --data_dir: Specifies the directory with all files you want to demultiplex (gzipped fastq files).
  • --primerset_path: Specifies the path to the primerset you want to use to demultiplex this dataset.

The tagging scheme is an Excel file that links your input files to sample names after demultiplexing. It will be save to the current working directory. The sample names have to be added in the tagging scheme. Fill out the tagging scheme before continuing with the next step.

Step 3: Demultiplex

To run the demultiplexing algorithm use this command:

demultiplexer2 demultiplex --primerset_path PathToPrimerset --tagging_scheme_path PathToTaggingScheme --output_dir OutputDirectory
  • --primerset_path: Specifies the path to the primerset you want to use to demultiplex this dataset.
  • --tagging_scheme_path: Specifies the path to the tagging scheme you want to use to demultiplex this dataset.
  • --output_dir: Specifies the output directory to write to.

Given this information, demultiplexer2 will demultiplex your input to your output directory and give some statistics about how many reads could be assigned to tags. Unmatched reads will be directly discarded. The output will be gzipped fastq files.

08:58:58: TEST_001_r1.fastq.gz - TEST_001_r2.fastq.gz: 16865 of 100000 sequences matched the provided tag sequences (16.86 %)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

demultiplexer2-1.1.5.tar.gz (23.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

demultiplexer2-1.1.5-py3-none-any.whl (22.9 kB view details)

Uploaded Python 3

File details

Details for the file demultiplexer2-1.1.5.tar.gz.

File metadata

  • Download URL: demultiplexer2-1.1.5.tar.gz
  • Upload date:
  • Size: 23.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for demultiplexer2-1.1.5.tar.gz
Algorithm Hash digest
SHA256 af0a0300fd5962b3625c888d5586a0d8a6c786fa169c76c385a14d393314b26b
MD5 5e7c70c5a3a9c0b5be755eef241e84e9
BLAKE2b-256 32893b0b8a7e64ddea88eccfeb5c17c259ef842068ab8e208b32708c7ec4aa2b

See more details on using hashes here.

File details

Details for the file demultiplexer2-1.1.5-py3-none-any.whl.

File metadata

  • Download URL: demultiplexer2-1.1.5-py3-none-any.whl
  • Upload date:
  • Size: 22.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for demultiplexer2-1.1.5-py3-none-any.whl
Algorithm Hash digest
SHA256 05945d70989a6a9d4474971b86316b28142eedf9ff88d38b173ab0a7c0818f82
MD5 7acbf00d54665fe9fbc8330cb86aa98e
BLAKE2b-256 3a11d0f03b749f5e9318f2d925ee04d700005ab634900bdee593c38b86c6f4b0

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page