Skip to main content

icd embedding for machine learning

Project description

icdcodex

https://img.shields.io/pypi/v/icdcodex.svg Documentation Status

ICD embedding for machine learning, created for MedHacks2020 ❤️.

What is Medhacks?

MedHacks hosted by Johns Hopkins University aims to unite talented and diverse minds from all backgrounds in order to foster a collaborative environment that aims to solve the world’s medical obstacles and issues.

The Problem

ICD coding is a laborous, but difficult to automate by machine learning because the output space if intractably large. (ICD-10CM has over 70,000 codes.) icdcodex creates a vector embedding for this input space, making it simpler for machine learning practioners to efficiently adapt algorithms for ICD coding.

Our Solution

We rely on the word2vec model to generate this embedding. In this set up, each ICD code represents a “word,” whereas a path sampled from breadth-first or depth-first search represents the “sentence.”

The Team

  • Jeremy Adams Fisher

  • Alhusain Abdalla

  • Natasha Nehra

  • Tejas Patel

  • Hamrish Saravanakumar

Features

  • Curated networkX graphs representing ICD9 and ICD10 hierarchies

  • A simple API to generate continuous embeddings for these hierarchies

Credits

This package was created with Cookiecutter and the audreyr/cookiecutter-pypackage project template.

History

0.1.0 (2020-09-04)

  • First release on PyPI.

0.3.0 (2020-09-05)

  • Finesse API, now consistent between documentation and implementation

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

icdcodex-0.3.0.tar.gz (31.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

icdcodex-0.3.0-py2.py3-none-any.whl (7.0 kB view details)

Uploaded Python 2Python 3

File details

Details for the file icdcodex-0.3.0.tar.gz.

File metadata

  • Download URL: icdcodex-0.3.0.tar.gz
  • Upload date:
  • Size: 31.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.14.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.8.3

File hashes

Hashes for icdcodex-0.3.0.tar.gz
Algorithm Hash digest
SHA256 674d2f817c276addbdf0054d234bdc4693b39f7d7105d9dc70d18a663d075d5d
MD5 e514b65d160574ce2e997efb06776cde
BLAKE2b-256 8179c4777174c954f9818135325488900c2566dd7a195edc6205d72a306d1426

See more details on using hashes here.

File details

Details for the file icdcodex-0.3.0-py2.py3-none-any.whl.

File metadata

  • Download URL: icdcodex-0.3.0-py2.py3-none-any.whl
  • Upload date:
  • Size: 7.0 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.14.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.8.3

File hashes

Hashes for icdcodex-0.3.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 84b98d5c882730042564bf36d15b183f008272313122f70138505e351b12e4cc
MD5 7f3c8c56606d1f7ba719141f8d69dcfa
BLAKE2b-256 9bb5a598b90676364f11f96fcddf602c586bd4f4c4597cc332fff35bb5c44e06

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page