Download, retrieve and process omics data for further bioinformatics
Project description
\n# bio_omics Download, retrieve and process omics data, or biological informatics data from public database
Comprehensive Databases
- NCBI: genome database,
- UniProt: protein database, https://www.expasy.org/resources/uniprotkb-swiss-prot
Sepecific Databases
- miRBase: mircoRNA database, https://www.mirbase.org/
- RNACentral: non-coding RNA squence database, https://rnacentral.org/
- IEDB: immune epitope database, https://www.iedb.org/
See the help documents of example coding at https://www.fbridges.com/pipeline/bio_omics.
https://www.iedb.org/downloader.php?file_name=doc/epitope_full_v3.zip
data model
ETL data processing is composed of some steps including downloads, retrieval, organization, combination, integration, enrichment, formation. This packages focus on downloads, retrieval, and combination of omics data. It is suggested that the data model would be consistent. Data are organized by entity namely protein, or antigen. An example of data is showed as the below. Here the pair 'key' defines unique identifier of this entity. "ID" is automatically created. Retrieved data are pushed as one key-value. Note:
- Abundant data are possible and to be allowed.
- The key-value is defined by this corresponding database source.
- Used for Integration rather than enrichment. Therefore, data combination or aggregation is not recommended.
- Data from various source could be different or invalid. Those would be validated in the afterwards step rather than this step.
{
"ID": "79541",
"key": "H0YED9",
"UniProt_SwissProt": {
....
},
"NCBI": {
....
},
"PDB": {
....
},
....
}
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file bioomics-0.2.9.tar.gz
.
File metadata
- Download URL: bioomics-0.2.9.tar.gz
- Upload date:
- Size: 31.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 55b55d955738d46b1c7aa44764a45c0422d629c325236896d6ae31aff86706ca |
|
MD5 | 979003f6b5d9121923fad82ea061db12 |
|
BLAKE2b-256 | e5758f8a352a09fe12ae65aedec4d8129b569302884fa420f326f1c3e3bb8caf |
File details
Details for the file bioomics-0.2.9-py3-none-any.whl
.
File metadata
- Download URL: bioomics-0.2.9-py3-none-any.whl
- Upload date:
- Size: 35.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 333c09172d756d630a575fe8577668519a7152e641e6b4e1ab69fbcbab7d79cc |
|
MD5 | d86bdfbbcc1d796742933203a4b9bf26 |
|
BLAKE2b-256 | 29d8bab7009df5b5b08242683307b2855568e366aa500c4b0212f9762b32a29b |