Skip to main content

This package provides access to the e-Science Central data warehouse that can be used to store, access and analyse data collected in scientific studies, including for healthcare applications

Project description

Data Warehouse Client

This package provides access to the e-Science Central data warehouse that can be used to store, access and analyse data collected in scientific studies, including for healthcare applications. The primary aim of the warehouse was to create a general system that enables users to explore data collected in a variety of forms. This might include data collected through questionnaires, data collected from sensors, and features extracted from the analysis of sensor data (e.g. activity levels derived from raw accelerometer data). Researchers might wish to slice, dice, visualise, analyse and explore this data in different ways, e.g. all results for one participant, all results for one type of measure in a study, changes in measurements over time. Others may wish to build models that can then be used in applications that make predictions about future values.

Traditionally, data collected in studies has been stored in a collection of files, often with metadata encoded in the filenames. This makes it difficult, and time consuming, for researchers to explore, interpret and analyse the data. The data warehouse exploits modern database technology to vastly simplify this effort. In doing this we have drawn heavily on the best practice for data warehouse design. However, there is more variety in the types of healthcare data to be stored than there is in a typical warehouse, and so we have been forced to deviate from a conventional data warehouse in some aspect of the design.
There are three guiding principles behind the design:

  1. The data warehouse must be able to store any type of data collected in a study without modifying the schema. This means that when new types of data are collected in studies (e.g. from a new questionnaire, a new data analysis program, or a new sensor) they can be stored in the warehouse without any changes to its design. This has 3 main advantages: firstly, it enables us to fix and optimise the schema for the tables in which the data is stored; secondly it means that applications and tools (e.g. for analysis and visualisation) built on the warehouse do not have to be updated when new types of data are added; thirdly, a single, multi-tenant database server can support many studies. This reduces the overall costs, the start-up time for a new study, and the overheads of managing the warehouse.
  2. Descriptive information about the types of measurement is stored in the warehouse so that tools or humans can interpret the data stored there.
  3. The design is optimised for query performance. In several cases, this has led to denormalization (duplication of data) to reduce the need for expensive joins.
  4. It must support a security regime to restrict each user’s access to the data collected in studies.

Running Instructions

To install from PyPi, run:

pip install data-warehouse-client

In directory in which your executable is run, create a db-credentials.json file containing database credentials (substituting all <VARS>):

{"user": "<USER>", "pass": "<PASSWORD>", "IP": "<IP>", "port": <PORT>}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

data-warehouse-client-0.1.0.tar.gz (30.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

data_warehouse_client-0.1.0-py3-none-any.whl (37.8 kB view details)

Uploaded Python 3

File details

Details for the file data-warehouse-client-0.1.0.tar.gz.

File metadata

  • Download URL: data-warehouse-client-0.1.0.tar.gz
  • Upload date:
  • Size: 30.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.25.0 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/3.8.6

File hashes

Hashes for data-warehouse-client-0.1.0.tar.gz
Algorithm Hash digest
SHA256 cdd92c79e701c820ddee889d38bad49cad667d6849409a4ccdc7427f287f37d0
MD5 cd4a1b2816e3c301c3aeb7f13c3ca1c6
BLAKE2b-256 39295c12696bfe7892bad5e076075fb77488293bb98ffc72628474301333376b

See more details on using hashes here.

File details

Details for the file data_warehouse_client-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: data_warehouse_client-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 37.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.25.0 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/3.8.6

File hashes

Hashes for data_warehouse_client-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 1e2e687dd934099d1f5e73674085c82f61be074b2ddde55a7e5c62b8e78feb68
MD5 9c7c5e70c8b9df129187cdda5a016439
BLAKE2b-256 875c6f078677042e82000bd5fedcbc78d087b295cc57dc12d27cad4e3a2c5b62

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page