Skip to main content

A Unified Medical Language System® Metathesaurus study for the Cumulus project

Project description

Cumulus Library UMLS

An installation of the Unified Medical Language System® Metathesaurus®. Part of the SMART on FHIR Cumulus Project

For more information, browse the documentation.

Usage

In order to use the Metathesaurus, you'll need to get an API key for access from the National Library of Medicine, which you can sign up for here.

You can then install this module by running pip install cumulus-library-umls.

This will add a umls target to cumulus-library. You'll need to pass your API key via the --umls-key CLI flag, or set the UMLS_API_KEY environment variable to the key you received from NIH.

This ends up being a fairly intensive operation - we download a large file, extract it, create parquet files from Athena, and then upload it. It usually takes a half hour to run. We try to preserve some of those artifacts along the way to make rebuilds faster. If you need to force recreation from scratch, the --force-upload CLI flag will handle this.

Note: This study is explicitly namespaced in its own schema, umls. Make sure your database is not using this schema for another use. Do not create tables inside this schema by another means.

Additional custom tables

The following tables are a derived from the primary tables, and are included here as a convenience to avoid having to compute these on a repeated basis

  • mrrel_drug_is_a a subset of the relationships in mrrel, including only those that define that concept A is a member of concept B (i.e. is a child, or is explicitly marked as being a tradename/member belonging to the parent concept), for drugs and drug-related topics.
  • mrconso_drugs a subset of the entity list in mrconso, limited to vocabularies specifically dealing with drug identifiers (i.e. SNOMED, RxNorm, etc.)
  • mrconso_icd10cm/mrrel__icd10cm are slices of the respective main tables, only containing records from the ICD10 coding system
  • icd10_(type) are slices of a given coding system at the relevant level of the ICD10 hierarchy (category,block,chapter,subcategory[1-3], extension)
  • icd10_tree provides a relation-navigable code hierarchy of the individual levels in the ICD10 hierarchy
  • icd10_hierarchy provides a extracted tablular representation of the full ICD10 code system

Licensing details

The cumulus-library-umls study is provided as a convenience to install the UMLS Metathesaurus, but is not shipped with the Metathesaurus dataset. It will require an API key to download the data from NIH directly.

As a reminder, the License Agreement for Use of the UMLS® Metathesaurus® provides several restrictions on this usage of this data (including distributing the dataset). When you sign up for a UMLS key, you are assuming responsibility for complying with these terms, or an alternate licensing agreement with the owner of the Metathesaus data if you are provided with one.

Citations

Bodenreider O. The Unified Medical Language System (UMLS): integrating biomedical terminology. Nucleic Acids Res. 2004 Jan 1;32(Database issue):D267-70. doi: 10.1093/nar/gkh061. PubMed PMID: 14681409; PubMed Central PMCID: PMC308795.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cumulus_library_umls-1.0.0.tar.gz (29.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cumulus_library_umls-1.0.0-py3-none-any.whl (31.9 kB view details)

Uploaded Python 3

File details

Details for the file cumulus_library_umls-1.0.0.tar.gz.

File metadata

  • Download URL: cumulus_library_umls-1.0.0.tar.gz
  • Upload date:
  • Size: 29.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for cumulus_library_umls-1.0.0.tar.gz
Algorithm Hash digest
SHA256 87b160a966e3c0fec8d694bf23da760fe7d7ab5ebd1f8c36940f78932a3a75c6
MD5 a8cc64f68543d798e52ab4fd0449bead
BLAKE2b-256 9fa051f64048b4c61a3ac84a133d604348f0ad84b869c39327621ea5596a6e9b

See more details on using hashes here.

File details

Details for the file cumulus_library_umls-1.0.0-py3-none-any.whl.

File metadata

File hashes

Hashes for cumulus_library_umls-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 00c38040ff07ddb4a335597e328c5488337229bf8f1b86928106b96fde306ea3
MD5 94b5b42b522b7dd19edde645a76a9bad
BLAKE2b-256 b468b9d12b97299cab99b10271fff824ecaa961ff9449b9ed689ff70c2340602

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page