Skip to main content

A lightweight tool to inspect GEO series matrices online before downloading.

Project description

OmicsCheck: Intelligent Quality Assessment for Gene Expression Data

OmicsCheck is a cutting-edge, automated tool designed to evaluate the quality of gene expression datasets from GEO (Gene Expression Omnibus). It intelligently analyzes raw expression matrices and generates professional PDF reports, visual summaries, and interpretable quality metrics – enabling researchers to screen studies before conducting downstream analysis.


Features

  • Download and parse GEO Series Matrix files automatically.
  • Auto-orientation of matrices (genes × samples vs. samples × genes).
  • Log2 transformation suggestions based on data distribution.
  • Data filtering for the most variable genes (top-N).
  • QA Score Evaluation using multi-criteria assessment.
  • PCA and Heatmap visualization of gene-level variability.
  • PDF report generation with visual plots and summary stats.
  • Exports analysis files in .json and .csv formats.

Installation

pip install omicscheck

Requires Python 3.8+


Usage

omicscheck run GSE12345

This will:

  1. Download the matrix file for GSE12345
  2. Parse and analyze the data
  3. Evaluate quality and suggest improvements
  4. Generate visual plots
  5. Create a full PDF report at: ~/Desktop/OmicsCheck/GSE12345/report.pdf

Quality Evaluation Logic

OmicsCheck uses a composite QA Score based on:

  • Matrix completeness
  • Distribution characteristics
  • Gene variance
  • Sample PCA spread

Each dataset receives a final Rating: Excellent, Good, Moderate, or Poor


Output Files

  • report.pdf: Professional scientific report
  • analysis.json: Full results in machine-readable format
  • analysis.csv: Simplified metrics table
  • boxplot.png, heatmap.png, pca_plot.png: Generated plots

Example Output

Example report: docs/report_example.pdf


OmicsCheck

A smart pre-download screening tool for high-quality omics datasets.

PyPI version License DOI


Citation

Ahmed Yassin. OmicsCheck Enables Improved Selection of High-Quality Gene Expression Datasets: A Pre-download Screening Tool with Biological Impact Assessment.
Research Square (2025).
https://doi.org/10.21203/rs.3.rs-7297713/v1


Contributing

We welcome contributions! Fork the repo and submit a pull request.


Contact

Lead Developer: [AHMED YASSIN || Computational Biologist]
Email: [Ahmedyassin300@outlook.com]
Project Page: [https://github.com/AHMEDY3DGENOME/OmicsCheck]


© 2025 OmicsCheck Team. All rights reserved.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

omicscheck-1.0.2.tar.gz (10.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

omicscheck-1.0.2-py3-none-any.whl (12.2 kB view details)

Uploaded Python 3

File details

Details for the file omicscheck-1.0.2.tar.gz.

File metadata

  • Download URL: omicscheck-1.0.2.tar.gz
  • Upload date:
  • Size: 10.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for omicscheck-1.0.2.tar.gz
Algorithm Hash digest
SHA256 27c503929950e84881b31c4aecf26219d210fad02f4d2bfda4d2324600fcf82c
MD5 cacab07649486f6df5bd293fdfc75fb1
BLAKE2b-256 3394c672ae1a949e2b70f3be704147f5555d6268f12f7313a03dec0d0918eb47

See more details on using hashes here.

Provenance

The following attestation bundles were made for omicscheck-1.0.2.tar.gz:

Publisher: publish.yml on AHMEDY3DGENOME/OmicsCheck

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file omicscheck-1.0.2-py3-none-any.whl.

File metadata

  • Download URL: omicscheck-1.0.2-py3-none-any.whl
  • Upload date:
  • Size: 12.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for omicscheck-1.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 7f3f7f265d4bcfb4e06e8bbc6b779beedfc33c5c752860ebff7ad5ddd780eb67
MD5 45ececc80934da942381178b33a6ad40
BLAKE2b-256 cd4ca3d8fe0d1e1c0dfe1ac4555bc1ebd51d524bde6794c13a4a03f5237a16ba

See more details on using hashes here.

Provenance

The following attestation bundles were made for omicscheck-1.0.2-py3-none-any.whl:

Publisher: publish.yml on AHMEDY3DGENOME/OmicsCheck

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page