Skip to main content

Download, retrieve and process omics data for further bioinformatics

Project description

\n# bio_omics Download, retrieve and process omics data, or biological informatics data from public database

Comprehensive Databases

Sepecific Databases

See the help documents of example coding at https://www.fbridges.com/pipeline/bio_omics.

https://www.iedb.org/downloader.php?file_name=doc/epitope_full_v3.zip

data model

ETL data processing is composed of some steps including downloads, retrieval, organization, combination, integration, enrichment, formation. This packages focus on downloads, retrieval, and combination of omics data. It is suggested that the data model would be consistent. Data are organized by entity namely protein, or antigen. An example of data is showed as the below. Here the pair 'key' defines unique identifier of this entity. "ID" is automatically created. Retrieved data are pushed as one key-value. Note:

  • Abundant data are possible and to be allowed.
  • The key-value is defined by this corresponding database source.
  • Used for Integration rather than enrichment. Therefore, data combination or aggregation is not recommended.
  • Data from various source could be different or invalid. Those would be validated in the afterwards step rather than this step.
{
    "ID": "79541",
    "key": "H0YED9",
    "UniProt_SwissProt": {
        ....
    },
    "NCBI": {
        ....
    },
    "PDB": {
        ....
    },
    ....    
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bioomics-0.2.8.tar.gz (29.5 kB view hashes)

Uploaded Source

Built Distribution

bioomics-0.2.8-py3-none-any.whl (33.7 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page