Skip to main content

No project description provided

Project description

GWASStudio: A Tool for Genomic Data Management

alt text

Overview

GWASStudio is a powerful CLI tool designed for efficient storage, retrieval, and querying of genomic summary statistics. It offers a high-performance infrastructure for handling and analyzing large-scale GWAS and QTL datasets, enabling seamless cross-dataset exploration.

Core Purpose

GWASStudio provides a unified interface across the CDH infrastructure, handling the ingestion, storage, querying and export of genomic data using high-performance technologies.

Key Functionalities

GWASStudio consists of several key functionalities:

1. Data Ingestion

  • Data Ingestion: Imports summary statistics data and its metadata associated.
  • Support for Multiple Storage Options: Works with both local filesystems and cloud storage (S3).

2. Data Querying

  • Flexible Search: Enables searching metadata using template files.

3. Data Export

  • Selective Export: Extracts subsets of data and its metadata associated based on genomic regions, SNPs, or the entire set of data.

Technical Architecture

GWASStudio leverages several advanced technologies:

  1. TileDB Embedded: A high-performance array storage engine that enables efficient storage and retrieval of genomic data.
  2. MongoDB: A flexible, scalable NoSQL database used for storing and querying metadata associated with genomic datasets.
  3. Dask: Provides distributed computing capabilities for processing large datasets.
  4. Python Ecosystem: Built on Python with libraries like Click/Cloup for CLI interfaces, Pandas for data manipulation, and various genomics-specific tools.

Installation

For detailed installation instructions, please refer to the documentation at https://ht-diva.github.io/gwasstudio/

Usage

For detailed instructions on how to use this tool, please refer to the documentation and check the cli_test script for a practical guide by examples.

Citation

Example files are derived from:

The variant call format provides efficient and robust storage of GWAS summary statistics. Matthew Lyon, Shea J Andrews, Ben Elsworth, Tom R Gaunt, Gibran Hemani, Edoardo Marcora. bioRxiv 2020.05.29.115824; doi: https://doi.org/10.1101/2020.05.29.115824

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gwasstudio-2.18.0.tar.gz (44.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

gwasstudio-2.18.0-py3-none-any.whl (56.1 kB view details)

Uploaded Python 3

File details

Details for the file gwasstudio-2.18.0.tar.gz.

File metadata

  • Download URL: gwasstudio-2.18.0.tar.gz
  • Upload date:
  • Size: 44.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for gwasstudio-2.18.0.tar.gz
Algorithm Hash digest
SHA256 d5438cf84d9e0d5a2505618c337e0e7d791b0bb801027e1362866342023541e5
MD5 9c6d3286a108581e0c55406e98cdabac
BLAKE2b-256 d93c9124d31f6ed4db0cca5e35dee572a4f8efd69757ee20dbe4a2bb7fcc3f9d

See more details on using hashes here.

Provenance

The following attestation bundles were made for gwasstudio-2.18.0.tar.gz:

Publisher: release.yml on ht-diva/gwasstudio

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file gwasstudio-2.18.0-py3-none-any.whl.

File metadata

  • Download URL: gwasstudio-2.18.0-py3-none-any.whl
  • Upload date:
  • Size: 56.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for gwasstudio-2.18.0-py3-none-any.whl
Algorithm Hash digest
SHA256 0ff6aa6f53f5544c161f40a0e4b2750c6a1669a52e33d6d2e7aa7d62d1275004
MD5 2236c904d52bd99b581fdf828e78464a
BLAKE2b-256 30ec184d40fbd8a9e3f7f06dc7a2da6c0aa6fadd9e2e62eb8874542d07b0ddb6

See more details on using hashes here.

Provenance

The following attestation bundles were made for gwasstudio-2.18.0-py3-none-any.whl:

Publisher: release.yml on ht-diva/gwasstudio

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page