Skip to main content

Automated RNA-seq analysis pipeline from SRA to differential expression and interactive reporting.

Project description

🧬 Pranomics

Pranomics is a fully automated RNA-seq analysis pipeline that takes raw sequencing data (SRA/FASTQ) all the way to differential expression analysis and publication-ready reports — in a single command.

It integrates widely used bioinformatics tools like FastQC, Bowtie2, Samtools, StringTie, and edgeR, wrapped in an easy-to-use Python CLI.


🚀 Features

  • 🔄 One-command RNA-seq pipeline
  • 📥 Automatic SRA → FASTQ conversion
  • 🔬 Quality control (FastQC)
  • ✂️ Read trimming
  • 🧬 Alignment using Bowtie2
  • 📊 BAM processing & sorting
  • 📈 Gene/transcript quantification (StringTie)
  • 📉 Differential expression (edgeR)
  • 📊 PCA, Heatmap & Volcano plots
  • 🌐 Interactive HTML report
  • ♻️ Resume support using checkpoints
  • ⚙️ Auto tool installation (conda/apt/R)

⚡ Pipeline Overview

SRA → FASTQ → QC → Trimming → Alignment → BAM Processing → Counting → DEG → Visualization → Report


📦 Installation

🥇 Recommended (Conda)

git clone https://github.com/yourname/pranomics.git
cd pranomics

conda env create -f environment.yml
conda activate pranomics

pip install .
🧪 Developer Mode
pip install -e .
🐳 Docker (Optional)
docker build -t pranomics .
docker run -v $PWD:/data pranomics run
▶️ Usage
🧬 Initialize project
pranomics init

Creates required folders:

data/
metadata/
reference/
counts/
DEG/
report/
🚀 Run full pipeline (auto mode)
pranomics run

Automatically:

Detects samples
Validates metadata
Runs full pipeline
Generates report
⚙️ Manual pipeline
pranomics pipeline \
  --samples data \
  --metadata metadata/metadata.csv \
  --reference-dir reference \
  --threads 4
📊 Differential Expression only
pranomics deg \
  --counts counts/gene_count_matrix.csv \
  --metadata metadata/metadata.csv \
  --control Control \
  --treatment Treatment
📊 Outputs

After execution:

🧬 DEG results
DEG/
 ├── DE_results.csv
 ├── upregulated.csv
 ├── downregulated.csv
 ├── summary.txt
📈 Visualizations
PCA plot
Heatmap
Volcano plot (interactive + static)
🌐 Report
report/pranomics_Report.html
🧠 Requirements
System tools (auto-installed if missing)
FastQC
Bowtie2
Samtools
StringTie
SRA-tools
Java
R + edgeR
Python
pandas
numpy
plotly
rich
🧬 Example Metadata
sample,condition
SRR5967160,Control
SRR5967162,Control
SRR5967168,Treatment
📌 Notes
Designed for Linux (Ubuntu/WSL recommended)
Works best with conda environment
Supports resume via checkpoint system
Large datasets recommended on SSD
⚠️ Disclaimer

This tool wraps multiple third-party bioinformatics tools. Ensure proper installation and citation of:

FastQC
Bowtie2
Samtools
StringTie
edgeR (Bioconductor)
📜 License

MIT License © 2026 Gourav Bairagi

See LICENSE file for details.

👨‍💻 Author

Gourav Bairagi
RNA-seq Pipeline Developer

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pranomics-0.1.1.tar.gz (27.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pranomics-0.1.1-py3-none-any.whl (35.6 kB view details)

Uploaded Python 3

File details

Details for the file pranomics-0.1.1.tar.gz.

File metadata

  • Download URL: pranomics-0.1.1.tar.gz
  • Upload date:
  • Size: 27.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.13

File hashes

Hashes for pranomics-0.1.1.tar.gz
Algorithm Hash digest
SHA256 9c54b9c9769737a6bafd07585129390f86e96eb3b0878e3fd5b162ad6e2bd76f
MD5 64c27f48e33305563fb75da68cd532c8
BLAKE2b-256 932287965f2d0af5bf8b9a6f3526b2a7c69e0d6af3d1beaceed1f1b29cf591f1

See more details on using hashes here.

File details

Details for the file pranomics-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: pranomics-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 35.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.13

File hashes

Hashes for pranomics-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 bce87d0f7dbf2dfca7746512f6096c21c4126ca22f26706ff0979ad17c304cec
MD5 7be06c40d0ad1af06ca2674385c84c27
BLAKE2b-256 c4c50699926d6cd68b53e4eaa406b5d1d3b04567292bd6b119ef70929bcfc0f5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page