Skip to main content

Analyze haplotypes from Illumina paired-end amplicon sequencing

Project description

CloneArmy

CloneArmy is a modern Python package for analyzing haplotypes from Illumina paired-end amplicon sequencing data. It provides a streamlined workflow for processing FASTQ files, aligning reads, and identifying sequence variants.

Features

  • Fast paired-end read processing using BWA-MEM
  • Quality-based filtering of bases and alignments
  • Haplotype identification and frequency analysis
  • Rich command-line interface with progress tracking
  • Comprehensive output reports
  • Multi-threading support

Installation

pip install cloneArmy

Requirements

  • Python ≥ 3.8
  • BWA (must be installed and available in PATH)
  • Samtools (must be installed and available in PATH)

Usage

Command Line Interface

# Basic usage
cloneArmy /path/to/fastq/directory reference.fasta

# With all options
cloneArmy /path/to/fastq/directory reference.fasta \
    --threads 8 \
    --output results \
    --min-base-quality 20 \
    --min-mapping-quality 30

Python API

from pathlib import Path
from clone_army.processor import AmpliconProcessor

# Initialize processor
processor = AmpliconProcessor(
    reference_path="reference.fasta",
    min_base_quality=20,
    min_mapping_quality=30
)

# Process a single sample
results = processor.process_sample(
    fastq_r1="sample_R1.fastq.gz",
    fastq_r2="sample_R2.fastq.gz",
    output_dir="results",
    threads=4
)

# Results are returned as a pandas DataFrame
print(results)

Output

For each sample, CloneArmy generates:

  • A sorted BAM file with alignments
  • A CSV file containing haplotype information:
    • Sequence
    • Read count
    • Frequency
    • Number of mutations
  • Console output

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

clonearmy-0.1.2.tar.gz (14.7 kB view details)

Uploaded Source

File details

Details for the file clonearmy-0.1.2.tar.gz.

File metadata

  • Download URL: clonearmy-0.1.2.tar.gz
  • Upload date:
  • Size: 14.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.2

File hashes

Hashes for clonearmy-0.1.2.tar.gz
Algorithm Hash digest
SHA256 e1d82ae2f072c5fcdd2e9532a9541b9b154e0e11db4166501f01304d4f7d1928
MD5 c84fb2dae60b3ce20872a3b1bb136715
BLAKE2b-256 72ed93703e857dfd3a4914adba244d34e12fd012477258e8d4f8d4d5536df6fe

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page