Skip to main content

De-identification toolkit for clinical text in Hebrew

Project description

HebSafeHarbor

A de-identification toolkit for clinical text in Hebrew.

MIT license Release PyPI version Pypi Downloads

Installation

To install the package, clone the repo and install all dependencies, preferably in a virtual environment:

# Create conda env (optional)
conda create --name hebsafeharbor python=3.8
conda activate hebsafeharbor

# Install dependencies
pip install -r requirements.txt

# (Optional) Install package locally
pip install -e .

# Download the he_ner_news_trf model used by hebsafeharbor
pip install https://github.com/8400TheHealthNetwork/HebSpacy/releases/download/he_ner_news_trf-3.2.1/he_ner_news_trf-3.2.1-py3-none-any.whl

Getting started

Python

from hebsafeharbor import HebSafeHarbor

hsh = HebSafeHarbor()

text = """שרון לוי התאשפזה ב02.02.2012 וגרה בארלוזרוב 16 רמת גן"""
doc = {"text": text}

output = hsh([doc])

print(output)

Docker

  1. Run the following command to install and run the server and demo services:
docker-compose up -d --build
  1. Navigate in the browser http://localhost:8080/docs to test the Server swagger
  2. Navigate in the browser http://localhost:8501 to test the Demo app

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

hebsafeharbor-0.0.2-py3-none-any.whl (58.7 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page