Skip to main content

No project description provided

Project description


IGEM - Integrative Genome-Exposome Method
=========================================

An Architecture for Efficient Bioinformatics Analysis
-----------------------------------------------------


Abstract:
IGEM software is a robust and scalable architecture designed for bioinformatics analysis. IGEM incorporates various modules that seamlessly work together to enable efficient data processing, analysis, and visualization. This paper explores the architecture of IGEM, including its core components, the two versions available (Server and Client), the ETL (Extraction, Transformation, and Loading) process, term replacement techniques, and the utilization of master data. Additionally, it highlights the powerful analysis functions offered by IGEM, such as dataset loading, quality control functionalities, and association and interaction analyses. The flexibility and capabilities of IGEM make it a valuable tool for researchers and practitioners in the field of omics research.

1. Introduction
The IGEM software provides a comprehensive suite of tools for bioinformatics analysis. Its architecture is built upon a scalable and efficient framework that supports the integration and analysis of diverse omics datasets. In this paper, we delve into the various aspects of the IGEM architecture, highlighting its key components, functionalities, and advantages.

2. IGEM Architecture
The architecture of IGEM revolves around its core modules, which enable seamless data processing, analysis, and visualization. At the heart of IGEM lies the GE-db, a multi-database that serves as the foundation of the knowledge base. This knowledge base is vital for conducting meaningful analyses and extracting valuable insights from external sources.

3. IGEM Versions: Server and Client
To cater to different user needs, IGEM is available in two distinct versions: the IGEM Server and the IGEM Client. The IGEM Server version provides a comprehensive suite of tools for handling large-scale omics data and performing advanced analytics. On the other hand, the IGEM Client version offers a streamlined and lightweight experience, suitable for individual researchers or smaller teams focusing on specific analyses.

4. ETL Process: Collect, Prepare, Map, Reduce
The ETL (Extraction, Transformation, and Loading) process is a crucial component of IGEM, ensuring the acquisition and preparation of data for analysis. The ETL process consists of four steps: collect, prepare, map, and reduce. In the collect step, active datasets are selected and the latest data is extracted and stored. The prepare step transforms the data into a well-structured format, while the map step establishes relationships between terms. Finally, the reduce step identifies and records terms per line, ensuring accurate and up-to-date information is stored.

5. Replacing Terms: Pre-computed Mapping and IGEM Search Engine
To ensure consistency and accuracy in the data, IGEM employs a pre-computed term mapping approach combined with a powerful search engine. Prior to the ETL process, a mapping table is created, associating different variations and synonyms of terms with their standardized counterparts. During the term replacement step, IGEM's search engine matches terms in the data with their standardized form, ensuring coherence and alignment within the dataset.

6. IGEM Master Data
IGEM utilizes master data entries to effectively configure and manage the integration of external datasets. These entries provide essential information about each dataset, including unique identifiers, database details, field-level parameters, and hierarchical relationships among terms. Configuring field-level parameters ensures accurate interpretation of data, while establishing term hierarchies enhances organization and accessibility.

7. Analysis Functions: Server and Client Versions
Both the IGEM Server and Client versions offer a range of analysis functions to enhance the software's capabilities. Users can load datasets, apply quality control processes, and perform association and interaction analyses. Association analysis allows users to explore relationships between variables, while interaction analysis focuses on ExE and GxE interactions. Pairwise analysis further refines the investigation of specific pairs exhibiting
significant interactions.

8. Conclusion
The IGEM software provides a robust and scalable architecture for efficient bioinformatics analysis. Its modular design, flexible functionality, and powerful analysis capabilities make it a valuable tool for researchers and practitioners in the field. By leveraging the IGEM architecture, users can seamlessly integrate omics datasets, perform comprehensive analyses, and gain valuable insights into biological systems. Further advancements and enhancements to the IGEM software will continue to propel bioinformatics research forward, driving discoveries and breakthroughs in the field of omics research.


Questions
---------

feel free to open an `Issue <https://github.com/HallLab/igem/issues>`_.

Citing IGEM
--------------


https://igem.readthedocs.io/en/latest/

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

igem-0.1.4.tar.gz (3.3 MB view details)

Uploaded Source

Built Distribution

igem-0.1.4-py3-none-any.whl (3.6 MB view details)

Uploaded Python 3

File details

Details for the file igem-0.1.4.tar.gz.

File metadata

  • Download URL: igem-0.1.4.tar.gz
  • Upload date:
  • Size: 3.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.2.2 CPython/3.7.12 Darwin/21.3.0

File hashes

Hashes for igem-0.1.4.tar.gz
Algorithm Hash digest
SHA256 009d5e3cd16046ee21824857e2c17ff27f50725c15ca6facab6ceba43ca8934d
MD5 b4d69c5528b1b2e5291f5729d8ac001b
BLAKE2b-256 50ed14672ebeb84ca6c9e021c5779dd5cb78663141c86556b3c475e81a04be25

See more details on using hashes here.

File details

Details for the file igem-0.1.4-py3-none-any.whl.

File metadata

  • Download URL: igem-0.1.4-py3-none-any.whl
  • Upload date:
  • Size: 3.6 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.2.2 CPython/3.7.12 Darwin/21.3.0

File hashes

Hashes for igem-0.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 7e42c8092324d6184e41990ad64651495793ac8aa166961dbbf61a6504027f3b
MD5 1c4ddfd897c03921c23532cd86931a1a
BLAKE2b-256 cff6a08f98617ac1eb54a109d1ae7c5d9915fed9d77d67600ccb38e9e0c2fe7c

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page