Skip to main content

OpenOmics provides a bioinformatics API and web-app platform integrate and visualize the multiomics and clinical data.

Project description

PyPI version Documentation Status pyOpenSci status DOI OpenOmics codecov

OpenOmics is currently under active development and we may break API compatibility in the future.

This Python package provide a series of tools to integrate and explore the genomics, transcriptomics, proteomics, and clinical data (aka multi-omics data). With interfaces to popular annotation databases and scalable data-frame manipulation tools, OpenOmics facilitates the common data wrangling tasks when preparing data for RNA-seq bioinformatics analysis.

Documentation (Latest | Stable) | OpenOmics at a glance

Features

OpenOmics assist in integration of heterogeneous multi-omics bioinformatics data. The library provides a Python API as well as an interactive Dash web interface. It features support for:

  • Genomics, Transcriptomics, Proteomics, and Clinical data.
  • Harmonization with 20+ popular annotation, interaction, disease-association databases.

OpenOmics also has an efficient data pipeline that bridges the popular data manipulation Pandas library and Dask distributed processing to address the following use cases:

  • Providing a standard pipeline for dataset indexing, table joining and querying, which are transparent and customizable for end-users.
  • Providing Efficient disk storage for large multi-omics dataset with Parquet data structures.
  • Integrating various data types including interactions and sequence data, then exporting to NetworkX graphs or data generators for down-stream machine learning.
  • Accessible by both developers and scientists with a Python API that works seamlessly with an external Galaxy tool interface or the built-in Dash web interface (WIP).

Installation via pip:

$ pip install openomics

Citations

The journal paper for this scientific package was reviewed by JOSS at https://joss.theoj.org/papers/10.21105/joss.03249#, and can be cited with:

# BibTeX
@article{Tran2021,
  doi = {10.21105/joss.03249},
  url = {https://doi.org/10.21105/joss.03249},
  year = {2021},
  publisher = {The Open Journal},
  volume = {6},
  number = {61},
  pages = {3249},
  author = {Nhat C. Tran and Jean X. Gao},
  title = {OpenOmics: A bioinformatics API to integrate multi-omics datasets and interface with public databases.},
  journal = {Journal of Open Source Software}
}

Credits

Thank you for extremely helpful feedback and guidance from the pyOpenSci reviewers. This package was created with the pyOpenSci/cookiecutter-pyopensci project template, based off audreyr/cookiecutter-pypackage.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

openomics-0.8.9.tar.gz (70.5 MB view hashes)

Uploaded source

Built Distribution

openomics-0.8.9-py2.py3-none-any.whl (20.9 kB view hashes)

Uploaded py2 py3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page