Skip to main content

A utility to split multiple sequence files using multiple sets of barcodes

Project description

A utility to split multiple sequence files using multiple sets of barcodes.

Copyright 2017 Lance Parsons <lparsons@princeton.edu>, Robert Leach <rleach@princeton.edu>

BSD 2-Clause License - See LICENSE.txt

Installation

  1. Install Barcode Splitter:

    pip install barcode_splitter

barcode_splitter

Split multiple fastq files by matching barcodes in one or more of the sequence files. Barcodes in the tab-delimited barcodes.txt file are matched against the beginning (or end) of the specified index read(s). By default, barcodes must match exactly, but --mistmatches can be set higher if desired. Compressed input is read (from all files) if the first input file name ends in .gz. Reading of compressed input can be forced with the gzipin option.

Examples

Split an Illumina paired-end run where the index read are in the second read file (read 2), the forward read is the first read file (read 1), and the reverse reads are in the third read file (read 3):

barcode_splitter --bcfile barcodes.txt read1.fastq read2_index.fastq read3.fastq --idxread 2 --suffix .fastq

UTF-8

Sample names containing UTF-8 characters are allowed, however, outputting those characters to STDOUT and piping to a file can be problematic. Ensure python uses the proper encoding for STDOUT by setting PYTHONIOENCODING='utf-8'.:

PYTHONIOENCODING='utf-8' barcode_splitter --bcfile barcodes_utf8.txt read1.fastq read2_index.fastq read3.fastq --idxread 2 --suffix .fastq

Citation

Please use the following BibTeX entry:

@misc{leach_bcs_2019,
    address = {Princeton, {NJ}, {USA}},
    title = {Barcode Splitter, version 0.18.5 [Software]},
    url = {https://bitbucket.org/princeton_genomics/barcode_splitter},
    author = {Leach, Robert and
              Parsons, Lance},
    year = {2019}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

barcode_splitter-0.18.6-py2.py3-none-any.whl (15.0 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file barcode_splitter-0.18.6-py2.py3-none-any.whl.

File metadata

  • Download URL: barcode_splitter-0.18.6-py2.py3-none-any.whl
  • Upload date:
  • Size: 15.0 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.19.1 setuptools/40.0.0 requests-toolbelt/0.8.0 tqdm/4.31.1 CPython/2.7.15

File hashes

Hashes for barcode_splitter-0.18.6-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 4d573cf5bcf0c9c5d46b6ba1704ad0d5408b33adeeacddc42d7a7b928610911d
MD5 e0c69c65bdc7905e09d8ebf4a2fc0bbd
BLAKE2b-256 88eb393381d43fe2a47b6b2d798f553754b96071988accbdebb4c622fd369ee3

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page