Skip to main content

Automated RNA-seq analysis pipeline from SRA to differential expression and interactive reporting.

Project description

🧬 Pranomics

Pranomics is a fully automated RNA-seq analysis pipeline that takes raw sequencing data (SRA/FASTQ) all the way to differential expression analysis and publication-ready reports — in a single command.

It integrates widely used bioinformatics tools like FastQC, Bowtie2, Samtools, StringTie, and edgeR, wrapped in an easy-to-use Python CLI.


🚀 Features

  • 🔄 One-command RNA-seq pipeline
  • 📥 Automatic SRA → FASTQ conversion
  • 🔬 Quality control (FastQC)
  • ✂️ Read trimming
  • 🧬 Alignment using Bowtie2
  • 📊 BAM processing & sorting
  • 📈 Gene/transcript quantification (StringTie)
  • 📉 Differential expression (edgeR)
  • 📊 PCA, Heatmap & Volcano plots
  • 🌐 Interactive HTML report
  • ♻️ Resume support using checkpoints
  • ⚙️ Auto tool installation (conda/apt/R)

⚡ Pipeline Overview

SRA → FASTQ → QC → Trimming → Alignment → BAM Processing → Counting → DEG → Visualization → Report


📦 Installation

🥇 Recommended (Conda)

git clone https://github.com/yourname/pranomics.git
cd pranomics

conda env create -f environment.yml
conda activate pranomics

pip install .
🧪 Developer Mode
pip install -e .
🐳 Docker (Optional)
docker build -t pranomics .
docker run -v $PWD:/data pranomics run
▶️ Usage
🧬 Initialize project
pranomics init

Creates required folders:

data/
metadata/
reference/
counts/
DEG/
report/
🚀 Run full pipeline (auto mode)
pranomics run

Automatically:

Detects samples
Validates metadata
Runs full pipeline
Generates report
⚙️ Manual pipeline
pranomics pipeline \
  --samples data \
  --metadata metadata/metadata.csv \
  --reference-dir reference \
  --threads 4
📊 Differential Expression only
pranomics deg \
  --counts counts/gene_count_matrix.csv \
  --metadata metadata/metadata.csv \
  --control Control \
  --treatment Treatment
📊 Outputs

After execution:

🧬 DEG results
DEG/
 ├── DE_results.csv
 ├── upregulated.csv
 ├── downregulated.csv
 ├── summary.txt
📈 Visualizations
PCA plot
Heatmap
Volcano plot (interactive + static)
🌐 Report
report/pranomics_Report.html
🧠 Requirements
System tools (auto-installed if missing)
FastQC
Bowtie2
Samtools
StringTie
SRA-tools
Java
R + edgeR
Python
pandas
numpy
plotly
rich
🧬 Example Metadata
sample,condition
SRR5967160,Control
SRR5967162,Control
SRR5967168,Treatment
📌 Notes
Designed for Linux (Ubuntu/WSL recommended)
Works best with conda environment
Supports resume via checkpoint system
Large datasets recommended on SSD
⚠️ Disclaimer

This tool wraps multiple third-party bioinformatics tools. Ensure proper installation and citation of:

FastQC
Bowtie2
Samtools
StringTie
edgeR (Bioconductor)
📜 License

MIT License © 2026 Gourav Bairagi

See LICENSE file for details.

👨‍💻 Author

Gourav Bairagi
RNA-seq Pipeline Developer

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pranomics-0.1.4.tar.gz (32.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pranomics-0.1.4-py3-none-any.whl (41.9 kB view details)

Uploaded Python 3

File details

Details for the file pranomics-0.1.4.tar.gz.

File metadata

  • Download URL: pranomics-0.1.4.tar.gz
  • Upload date:
  • Size: 32.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.13

File hashes

Hashes for pranomics-0.1.4.tar.gz
Algorithm Hash digest
SHA256 0001adfed0157d3088e14a8366c0defe0292896cf3137fb59ce625329190deb6
MD5 09fcbcfa51974304d0b8a1bc6f1a9eed
BLAKE2b-256 36337bbb79e6986e45fc2496b0275971e8b6ac45c35b803047e62159ecfb7eac

See more details on using hashes here.

File details

Details for the file pranomics-0.1.4-py3-none-any.whl.

File metadata

  • Download URL: pranomics-0.1.4-py3-none-any.whl
  • Upload date:
  • Size: 41.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.13

File hashes

Hashes for pranomics-0.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 b7c26817a6097937aa728b06600e87c34fdd7d83509f1905b28593aca344633c
MD5 e8ff55251e618f2c5161d10a968cc536
BLAKE2b-256 dbc4239e75420552c707ad35849683adc782c511e3b291dd5ecb152d9afa5723

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page