Skip to main content

UKB API

Project description

UKB_API

This library has been designed to provide convenient access to UKB dataset to users of PSC and Openmind7 clusters by leveraging Datalad package. Datalad allows its users to store large datasets in a distributed manner and faciliate easier collaboration. Currently the repo has modules dedicated to three types of data stored within the UKB dataset namely Scalar, Genetic and Bulk data types. The library also serves as a directory for all the unique categories and field ids which are part of UKB dataset.

Datalad is designed as a tool to version control large datasets in a manner quite similar to Git but limited to either a single network, HPC cluster. This library has been designed to help the end user make use the principles of datalad to access data stored on a different cluster and even with limited knowledge of how datalad works.

Additionally this repository also houses the notebook files related to this project showcasing exploratory data analysis done on multiple categories.The modules are currently designed to give access to the necessary data through the use of maximum of three-four lines of code.

Documentation

[Documentation for this repo can be found here]

Documentation

Installation

pip install ukb-api==0.370

  1. Import relevant modules and initilize objects
from UKBRepo.UKBRepo import module_scalar_data_handler as scalar_module

scalar_handler_object=scalar_module.scalar_data_handler()
  1. Pick the main category to which your datatype belongs (T1_Images/Freesurfer/Diet/Smoking)
scalar_handler_object.display_all_ukb_categories()
  1. Fetch relevant field for that Category
bulk_handler_object.get_field_ids_for_category(Category_Name)
  1. Retrive the list of subjects who have those particular field ids
bulk_handler_object.get_subject_list_field_ids(Field_Id_List)
  1. Retrive relevant data for the subjects
bulk_handler_object.get_data_bulk(Field_Id,subject_id)

After executing the funtion of data retrival,the output user receives depends upon the type of data bieng requested. In case of scalar data the user will recive the actual data in the output whereas in case of bulk data it will be the path where the fetched bulk data files have been stored.

Requirements

  1. Datalad
  2. Git-annex
  3. Pandas
  4. Numpy

Refrences

http://handbook.datalad.org/en/latest/

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ukb_api-0.409.tar.gz (9.6 kB view details)

Uploaded Source

Built Distribution

ukb_api-0.409-py3-none-any.whl (10.6 kB view details)

Uploaded Python 3

File details

Details for the file ukb_api-0.409.tar.gz.

File metadata

  • Download URL: ukb_api-0.409.tar.gz
  • Upload date:
  • Size: 9.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.7

File hashes

Hashes for ukb_api-0.409.tar.gz
Algorithm Hash digest
SHA256 1f916a89673e3e62659e158c87b7c6c7b16b88510361f3ab060e4d09d3e80317
MD5 5f69cd1fdcdd4247f6b6aa70eec9b50b
BLAKE2b-256 e929d36da32c8e4a4dff8b2516512840568151a4fe416dd99ed493c41f0cbe69

See more details on using hashes here.

File details

Details for the file ukb_api-0.409-py3-none-any.whl.

File metadata

  • Download URL: ukb_api-0.409-py3-none-any.whl
  • Upload date:
  • Size: 10.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.7

File hashes

Hashes for ukb_api-0.409-py3-none-any.whl
Algorithm Hash digest
SHA256 ce9a3f85434367640b5d095e06f349a0e72239013d136bbb843afccf2409a9c6
MD5 df01ed0b5ab211a76f828adc89697610
BLAKE2b-256 71270662c1f9ae3833e75201974699683a90118cd1367d645687c210c9466e74

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page