Skip to main content

UKB Access Repo

Project description

UKB_Wrapper_Repo

This library has been designed to provide convenient access to UKB dataset to users of PSC and Openmind7 clusters by leveraging Datalad package. Datalad allows its users to store large datasets in a distributed manner and faciliate easier collaboration. Currently the repo has modules dedicated to three types of data stored within the UKB dataset namely Scalar, Genetic and Bulk data types. The library also serves as a directory for all the unique categories and field ids which are part of UKB dataset.

Datalad is designed as a tool to version control large datasets in a manner quite similar to Git but limited to either a single network, HPC cluster. This library has been designed to help the end user make use the principles of datalad to access data stored on a different cluster and even with limited knowledge of how datalad works.

Additionally this repository also houses the notebook files related to this project showcasing exploratory data analysis done on multiple categories.The modules are currently designed to give access to the necessary data through the use of maximum of three-four lines of code.

  1. Import relevant modules and initilize objects
from UKBRepo.UKBRepo import module_scalar_data_handler as scalar_module

scalar_handler_object=scalar_module.scalar_data_handler()
  1. Pick the main category to which your datatype belongs (T1_Images/Freesurfer/Diet/Smoking)
scalar_handler_object.display_all_ukb_categories()
  1. Fetch relevant field for that Category
bulk_handler_object.get_field_ids_for_category(Category_Name)
  1. Retrive the list of subjects who have those particular field ids
bulk_handler_object.get_subject_list_field_ids(Field_Id_List)
  1. Retrive relevant data for the subjects
bulk_handler_object.get_data_bulk(Field_Id,subject_id)

After executing the funtion of data retrival,the output user receives depends upon the type of data bieng requested. In case of scalar data the user will recive the actual data in the output whereas in case of bulk data it will be the path where the fetched bulk data files have been stored.

Requirements

  1. Datalad
  2. Git-annex
  3. Pandas
  4. Numpy

Refrences

http://handbook.datalad.org/en/latest/

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ukbaccessrepo-0.370.tar.gz (9.6 kB view details)

Uploaded Source

Built Distributions

ukbaccessrepo-0.370-py3.9.egg (19.1 kB view details)

Uploaded Source

ukbaccessrepo-0.370-py3-none-any.whl (10.9 kB view details)

Uploaded Python 3

File details

Details for the file ukbaccessrepo-0.370.tar.gz.

File metadata

  • Download URL: ukbaccessrepo-0.370.tar.gz
  • Upload date:
  • Size: 9.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.7

File hashes

Hashes for ukbaccessrepo-0.370.tar.gz
Algorithm Hash digest
SHA256 b1dda9cae6ea4e174f5c21c1abf303610d96353a274159e679c7ff0a3d78cdec
MD5 3146f08d9c9f44d06783df1a010ece99
BLAKE2b-256 7c1bb5b9fe0dae643a3cee1f0ce882b4e51b7d08c3450272b1be9430b70b2dd6

See more details on using hashes here.

File details

Details for the file ukbaccessrepo-0.370-py3.9.egg.

File metadata

  • Download URL: ukbaccessrepo-0.370-py3.9.egg
  • Upload date:
  • Size: 19.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.7

File hashes

Hashes for ukbaccessrepo-0.370-py3.9.egg
Algorithm Hash digest
SHA256 3e62bffd655b489c71524fd7911ef691f3de2beae32548b91d9f351aeb75ab9e
MD5 7bf7b90646a58277525e90e46811c244
BLAKE2b-256 34db443f87ea991746cdffbec0b5abfc85046c57e889b1af7468dddcc8902e4b

See more details on using hashes here.

File details

Details for the file ukbaccessrepo-0.370-py3-none-any.whl.

File metadata

  • Download URL: ukbaccessrepo-0.370-py3-none-any.whl
  • Upload date:
  • Size: 10.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.7

File hashes

Hashes for ukbaccessrepo-0.370-py3-none-any.whl
Algorithm Hash digest
SHA256 2a923ab3f2c8acd6104cedb2f4287e326b18bb3785b90344086278340c567207
MD5 af8193f5edde859836710cd2161b27a1
BLAKE2b-256 957a12ce68951b075bc9b20a97e8e4831cd414ab4b0f348ffdc29bd5ce13edd5

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page