Skip to main content

helix.personmatching

Project description

helix.personmatching

Continuous Integration GitHub license

Entrypoint

helix_personmatching/matchers/matcher.py

Inputs

This code takes in:

  1. A source Patient/Person resource or a Bundle of Patient/Person resources
  2. A target Patient/Person resource or a Bundle of Patient/Person resources
  3. A set of rule options/weights
  4. Whether to return only successful matches or all match results
  5. (Optional) Matching threshold (between 0 and 1). Can also be set as an environment variable: PERSON_MATCH_THRESHOLD
  6. (Optional) Average score boost (between 0 and 1). Can also be set as an environment variable: PERSON_MATCH_AVERAGE_SCORE_BOOST

Outputs

  1. A list of match results where id_source is id of source record and id_target is id of matched record.

Logic

  1. The code runs through each rule and calculates a uniqueness probability (between 0 and 1) if the rule matches.
  2. The code then picks the rule with the highest uniqueness probability and uses that probability
  3. The code then calculates the average of all the other rule probabilities and boosts the uniqueness probability by it.
  4. The code then sees if any other boosting rules were requested and boosts the probability by those

Weights

There are five cases when matching fields of two records in a rule and the weights that are applied for each case:

  1. Field present in both sides and match -> exact_match
  2. Field present in both sides and partial match -> partial_match
  3. Field missing in both sides -> missing
  4. Field missing in one side -> missing
  5. Field present in both sides but do not match -> 0.0

For boosting rules, there is an additional boost weight.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

helix_personmatching-2.0.0.tar.gz (51.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

helix_personmatching-2.0.0-py3-none-any.whl (89.1 kB view details)

Uploaded Python 3

File details

Details for the file helix_personmatching-2.0.0.tar.gz.

File metadata

  • Download URL: helix_personmatching-2.0.0.tar.gz
  • Upload date:
  • Size: 51.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for helix_personmatching-2.0.0.tar.gz
Algorithm Hash digest
SHA256 e0b8d58ddf5cfcec745d5f2ac4c8b69f90ebff3db72ad09d104422232c0c066d
MD5 2c11f130b42fb0e3c90a7fc6496db01e
BLAKE2b-256 fe9721edce712c6f52b56faee2244b6552d5ed7e1f8c7874b9cf2c59682412f8

See more details on using hashes here.

File details

Details for the file helix_personmatching-2.0.0-py3-none-any.whl.

File metadata

File hashes

Hashes for helix_personmatching-2.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 8140005409d5ba6c9035552c1401a913f5105cc4d7f0584e45de73a954051d47
MD5 453c33e62643211d62b3aa2ec9acba3f
BLAKE2b-256 1879d5e6162a3bc7f44e6da2c6838c34dad1df82600440ddb5b15489019ad3e5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page