Skip to main content

A lightweight tool to inspect GEO series matrices online before downloading.

Project description

OmicsCheck: Intelligent Quality Assessment for Gene Expression Data

OmicsCheck is a cutting-edge, automated tool designed to evaluate the quality of gene expression datasets from GEO (Gene Expression Omnibus). It intelligently analyzes raw expression matrices and generates professional PDF reports, visual summaries, and interpretable quality metrics – enabling researchers to screen studies before conducting downstream analysis.


Features

  • Download and parse GEO Series Matrix files automatically.
  • Auto-orientation of matrices (genes × samples vs. samples × genes).
  • Log2 transformation suggestions based on data distribution.
  • Data filtering for the most variable genes (top-N).
  • QA Score Evaluation using multi-criteria assessment.
  • PCA and Heatmap visualization of gene-level variability.
  • PDF report generation with visual plots and summary stats.
  • Exports analysis files in .json and .csv formats.

Installation

pip install omicscheck

Requires Python 3.8+


Usage

omicscheck run GSE12345

This will:

  1. Download the matrix file for GSE12345
  2. Parse and analyze the data
  3. Evaluate quality and suggest improvements
  4. Generate visual plots
  5. Create a full PDF report at: ~/Desktop/OmicsCheck/GSE12345/report.pdf

Quality Evaluation Logic

OmicsCheck uses a composite QA Score based on:

  • Matrix completeness
  • Distribution characteristics
  • Gene variance
  • Sample PCA spread

Each dataset receives a final Rating: Excellent, Good, Moderate, or Poor


Output Files

  • report.pdf: Professional scientific report
  • analysis.json: Full results in machine-readable format
  • analysis.csv: Simplified metrics table
  • boxplot.png, heatmap.png, pca_plot.png: Generated plots

Example Output

Example report: docs/report_example.pdf


Citation

If you use OmicsCheck in your research, please cite:

OmicsCheck: Intelligent Pre-Analysis Quality Evaluation for Gene Expression Datasets. (2025). Bioinformatics Tool Suite. DOI:10.xxxx/omicscheck

Contributing

We welcome contributions! Fork the repo and submit a pull request.


Contact

Lead Developer: [AHMED YASSIN || Computational Biologist]
Email: [your.email@domain.com]
Project Page: [https://github.com/AHMEDY3DGENOME/OmicsCheck]


© 2025 OmicsCheck Team. All rights reserved.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

omicscheck-1.0.1.tar.gz (10.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

omicscheck-1.0.1-py3-none-any.whl (12.0 kB view details)

Uploaded Python 3

File details

Details for the file omicscheck-1.0.1.tar.gz.

File metadata

  • Download URL: omicscheck-1.0.1.tar.gz
  • Upload date:
  • Size: 10.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for omicscheck-1.0.1.tar.gz
Algorithm Hash digest
SHA256 8ee1d292370a8f183b4e22723e02669a629c6956fa86c4fadf513609eee9441e
MD5 55714fe6857bc1933b09952b80c8f0cd
BLAKE2b-256 30398d3bb6eb4cd03070d356e8b0ac9ad265218f99dac7d6db62b7acc7544f7a

See more details on using hashes here.

Provenance

The following attestation bundles were made for omicscheck-1.0.1.tar.gz:

Publisher: publish.yml on AHMEDY3DGENOME/OmicsCheck

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file omicscheck-1.0.1-py3-none-any.whl.

File metadata

  • Download URL: omicscheck-1.0.1-py3-none-any.whl
  • Upload date:
  • Size: 12.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for omicscheck-1.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 3f8436d7aeb754c2e4d631c807963e8f3624db486b8d1387f523c9e16fb1d63a
MD5 d40b6edb8087adf9c683c1ecfc1346ec
BLAKE2b-256 3b7d0d0b879a38b1c1ee1f723e3b247449d4d85af6c7f38bbef0a6d328c0882f

See more details on using hashes here.

Provenance

The following attestation bundles were made for omicscheck-1.0.1-py3-none-any.whl:

Publisher: publish.yml on AHMEDY3DGENOME/OmicsCheck

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page