Skip to main content

Automated RNA-seq analysis pipeline from SRA to differential expression and interactive reporting.

Project description

🧬 Pranomics

Pranomics is a fully automated RNA-seq analysis pipeline that takes raw sequencing data (SRA/FASTQ) all the way to differential expression analysis and publication-ready reports — in a single command.

It integrates widely used bioinformatics tools like FastQC, Bowtie2, Samtools, StringTie, and edgeR, wrapped in an easy-to-use Python CLI.


🚀 Features

  • 🔄 One-command RNA-seq pipeline
  • 📥 Automatic SRA → FASTQ conversion
  • 🔬 Quality control (FastQC)
  • ✂️ Read trimming
  • 🧬 Alignment using Bowtie2
  • 📊 BAM processing & sorting
  • 📈 Gene/transcript quantification (StringTie)
  • 📉 Differential expression (edgeR)
  • 📊 PCA, Heatmap & Volcano plots
  • 🌐 Interactive HTML report
  • ♻️ Resume support using checkpoints
  • ⚙️ Auto tool installation (conda/apt/R)

⚡ Pipeline Overview

SRA → FASTQ → QC → Trimming → Alignment → BAM Processing → Counting → DEG → Visualization → Report


📦 Installation

🥇 Recommended (Conda)

git clone https://github.com/yourname/pranomics.git
cd pranomics

conda env create -f environment.yml
conda activate pranomics

pip install .
🧪 Developer Mode
pip install -e .
🐳 Docker (Optional)
docker build -t pranomics .
docker run -v $PWD:/data pranomics run
▶️ Usage
🧬 Initialize project
pranomics init

Creates required folders:

data/
metadata/
reference/
counts/
DEG/
report/
🚀 Run full pipeline (auto mode)
pranomics run

Automatically:

Detects samples
Validates metadata
Runs full pipeline
Generates report
⚙️ Manual pipeline
pranomics pipeline \
  --samples data \
  --metadata metadata/metadata.csv \
  --reference-dir reference \
  --threads 4
📊 Differential Expression only
pranomics deg \
  --counts counts/gene_count_matrix.csv \
  --metadata metadata/metadata.csv \
  --control Control \
  --treatment Treatment
📊 Outputs

After execution:

🧬 DEG results
DEG/
 ├── DE_results.csv
 ├── upregulated.csv
 ├── downregulated.csv
 ├── summary.txt
📈 Visualizations
PCA plot
Heatmap
Volcano plot (interactive + static)
🌐 Report
report/pranomics_Report.html
🧠 Requirements
System tools (auto-installed if missing)
FastQC
Bowtie2
Samtools
StringTie
SRA-tools
Java
R + edgeR
Python
pandas
numpy
plotly
rich
🧬 Example Metadata
sample,condition
SRR5967160,Control
SRR5967162,Control
SRR5967168,Treatment
📌 Notes
Designed for Linux (Ubuntu/WSL recommended)
Works best with conda environment
Supports resume via checkpoint system
Large datasets recommended on SSD
⚠️ Disclaimer

This tool wraps multiple third-party bioinformatics tools. Ensure proper installation and citation of:

FastQC
Bowtie2
Samtools
StringTie
edgeR (Bioconductor)
📜 License

MIT License © 2026 Gourav Bairagi

See LICENSE file for details.

👨‍💻 Author

Gourav Bairagi
RNA-seq Pipeline Developer

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pranomics-0.1.0.tar.gz (27.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pranomics-0.1.0-py3-none-any.whl (35.3 kB view details)

Uploaded Python 3

File details

Details for the file pranomics-0.1.0.tar.gz.

File metadata

  • Download URL: pranomics-0.1.0.tar.gz
  • Upload date:
  • Size: 27.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.13

File hashes

Hashes for pranomics-0.1.0.tar.gz
Algorithm Hash digest
SHA256 ea79d058cc3eca190a56a8a63240fffe6c9a8cd6e05f7bf0eb230b04196f7267
MD5 19d9b3aa1e39bd71285cde6150d692a6
BLAKE2b-256 6487aadaadcd22ab858e13f96ff017393fa5dff1981f25a2f6f47584cc2f8c3b

See more details on using hashes here.

File details

Details for the file pranomics-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: pranomics-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 35.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.13

File hashes

Hashes for pranomics-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 1d3402a685bcd4fa2f230d20783cc84a0618177ec847040707f9f3324deac0d6
MD5 56e732a37310cbc9e7250d8689379647
BLAKE2b-256 12624793449aaf38bbad23caf0938bfede046bba73773159e36e708095445f40

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page