Skip to main content

Example ClaML to TranSMART loader

Project description

Build status codecov PyPI PyPI - Downloads

This package contains a mapper that reads ontologies in Classification Markup Language (ClaML) format and translates them to the data model of the TranSMART platform, an open source data sharing and analytics platform for translational biomedical research.

It also provides a utility that applies the mapper and writes the translated data, using transmart-loader, to tab-separated files that can be loaded into a TranSMART database using the transmart-copy tool.

The packages uses the python-claml package for reading ClaML files.

⚠️ Note: this is a very preliminary version, still under development. Issues can be reported at https://github.com/thehyve/python_claml2transmart/issues.

Installation

The package requires Python 3.6.

To install claml2transmart, do:

pip install claml2transmart

Or from source:

git clone https://github.com/thehyve/python_claml2transmart.git
cd python_claml2transmart
pip install .

Run tests (including coverage) with:

python setup.py test

Usage

Read ontology from an XML file in ClaML format and write the output in transmart-copy format to /path/to/output. The output directory should be empty of not existing (then it will be created).

claml2transmart [--code-prefix] <system> <input.xml> /path/to/output

With --code_prefix, the concept code is used as prefix for the concept label.

Example: the ICD-10-GM (German modification of ICD-10) is available at icd10gm2019syst-claml.zip.

# Unzip and navigate to the classification directory
mkdir icd10gm2019syst-claml
cd icd10gm2019sys-claml
unzip ../icd10gm2019syst-claml.zip
# create an output directory
mkdir output
# apply the mapping
claml2transmart http://dimdi.de/icd10gm2019 Klassifikationsdateien/icd10gm2019syst_claml_20180921.xml output

This generates the directories i2b2metadata and i2b2demodata in the output directory. The generated data can be loaded using transmart-copy:

# Download transmart-copy:
curl -f -L https://repo.thehyve.nl/service/local/repositories/releases/content/org/transmartproject/transmart-copy/17.1-HYVE-5.9-RC3/transmart-copy-17.1-HYVE-5.9-RC3.jar -o transmart-copy.jar
# Load data
PGUSER=tm_cz PGPASSWORD=tm_cz java -jar transmart-copy.jar -d output

Acknowledgement

This project was funded by the German Ministry of Education and Research (BMBF) as part of the project DIFUTURE - Data Integration for Future Medicine within the German Medical Informatics Initiative (grant no. 01ZZ1804D).

License

Copyright (c) 2019 The Hyve B.V.

The ClaML ontology to TranSMART loader is licensed under the MIT License. See the file LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

claml2transmart-0.1.5.tar.gz (5.5 kB view details)

Uploaded Source

Built Distribution

claml2transmart-0.1.5-py3-none-any.whl (7.0 kB view details)

Uploaded Python 3

File details

Details for the file claml2transmart-0.1.5.tar.gz.

File metadata

  • Download URL: claml2transmart-0.1.5.tar.gz
  • Upload date:
  • Size: 5.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.6.0 requests-toolbelt/0.9.1 tqdm/4.37.0 CPython/3.7.1

File hashes

Hashes for claml2transmart-0.1.5.tar.gz
Algorithm Hash digest
SHA256 9d4842bf8d0dddb0872126a6f799d6e9638d1bb2a717ce8a43d48c3ef2b1bb75
MD5 f8445164269d08d5fed82b9884e6c6f7
BLAKE2b-256 1b01e393eb3c81f22febe21cc8f4d8976111a07be21206c3c301a353c8252f28

See more details on using hashes here.

File details

Details for the file claml2transmart-0.1.5-py3-none-any.whl.

File metadata

  • Download URL: claml2transmart-0.1.5-py3-none-any.whl
  • Upload date:
  • Size: 7.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.6.0 requests-toolbelt/0.9.1 tqdm/4.37.0 CPython/3.7.1

File hashes

Hashes for claml2transmart-0.1.5-py3-none-any.whl
Algorithm Hash digest
SHA256 ecb7b3a0acecb669e0c5972e9fb326c1f2a524d30201b9b8a8958d5451e92be1
MD5 5842cca287afc6fd2514634865e75156
BLAKE2b-256 b76c0d726b8801d1786dc564ecd256ced0573aa796a410e19ac906c896801a0d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page