icd embedding for machine learning
Project description
icdcodex
ICD embedding for machine learning, created for MedHacks2020 ❤️.
Free software: MIT license
Documentation: https://icdcodex.readthedocs.io.
What is Medhacks?
MedHacks hosted by Johns Hopkins University aims to unite talented and diverse minds from all backgrounds in order to foster a collaborative environment that aims to solve the world’s medical obstacles and issues.
The Problem
ICD coding is a laborous, but difficult to automate by machine learning because the output space if intractably large. (ICD-10CM has over 70,000 codes.) icdcodex creates a vector embedding for this input space, making it simpler for machine learning practioners to efficiently adapt algorithms for ICD coding.
Our Solution
We rely on the word2vec model to generate this embedding. In this set up, each ICD code represents a “word,” whereas a path sampled from breadth-first or depth-first search represents the “sentence.”
The Team
Jeremy Adams Fisher
Alhusain Abdalla
Natasha Nehra
Tejas Patel
Hamrish Saravanakumar
Features
Curated networkX graphs representing ICD hierarchies
A simple API to generate
Credits
This package was created with Cookiecutter and the audreyr/cookiecutter-pypackage project template.
History
0.1.0 (2020-09-04)
First release on PyPI.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distributions
Hashes for icdcodex-0.1.0-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | e134272485f9c6ea5d69814c582de52655a284dece0092152202bb4be248043d |
|
MD5 | ae26c7461672898d54a0c72cfb813058 |
|
BLAKE2b-256 | c87f7031a70373b14150a603d94a972a88794aaae7cd8ad6b0b6dfbaf9d06d83 |