Skip to main content

helix.personmatching

Project description

helix.personmatching

Continuous Integration GitHub license

Entrypoint

helix_personmatching/matchers/matcher.py

Inputs

This code takes in:

  1. A source Patient/Person resource or a Bundle of Patient/Person resources
  2. A target Patient/Person resource or a Bundle of Patient/Person resources
  3. A set of rule options/weights
  4. Whether to return only successful matches or all match results
  5. (Optional) Matching threshold (between 0 and 1). Can also be set as an environment variable: PERSON_MATCH_THRESHOLD
  6. (Optional) Average score boost (between 0 and 1). Can also be set as an environment variable: PERSON_MATCH_AVERAGE_SCORE_BOOST

Outputs

  1. A list of match results where id_source is id of source record and id_target is id of matched record.

Logic

  1. The code runs through each rule and calculates a uniqueness probability (between 0 and 1) if the rule matches.
  2. The code then picks the rule with the highest uniqueness probability and uses that probability
  3. The code then calculates the average of all the other rule probabilities and boosts the uniqueness probability by it.
  4. The code then sees if any other boosting rules were requested and boosts the probability by those

Weights

There are five cases when matching fields of two records in a rule and the weights that are applied for each case:

  1. Field present in both sides and match -> exact_match
  2. Field present in both sides and partial match -> partial_match
  3. Field missing in both sides -> missing
  4. Field missing in one side -> missing
  5. Field present in both sides but do not match -> 0.0

For boosting rules, there is an additional boost weight.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

helix_personmatching-1.0.50.tar.gz (46.1 kB view details)

Uploaded Source

Built Distribution

helix.personmatching-1.0.50-py3-none-any.whl (81.5 kB view details)

Uploaded Python 3

File details

Details for the file helix_personmatching-1.0.50.tar.gz.

File metadata

  • Download URL: helix_personmatching-1.0.50.tar.gz
  • Upload date:
  • Size: 46.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.6

File hashes

Hashes for helix_personmatching-1.0.50.tar.gz
Algorithm Hash digest
SHA256 d2496ee4393b60b182a1bcd053f99c1847fb22a6a9812f6861c04fdd4b02372d
MD5 ef7b9d325d98fb636d2b306a8a78df51
BLAKE2b-256 5bc62ea0169ba819875e910d64ec120605a63d694c04623bf017de39e2c55dc5

See more details on using hashes here.

File details

Details for the file helix.personmatching-1.0.50-py3-none-any.whl.

File metadata

File hashes

Hashes for helix.personmatching-1.0.50-py3-none-any.whl
Algorithm Hash digest
SHA256 3e76a2df28c4560d4946f989355cf25328038a21cdfbc0ef8da4075ca7e0372b
MD5 d8be7692c5298bcd86f54188dc762188
BLAKE2b-256 4d313098dae7c36f536e20789d2bedd93fc0edad4ed6c7f45a89a56ca7807688

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page