helix.personmatching
Project description
helix.personmatching
Entrypoint
helix_personmatching/matchers/matcher.py
Inputs
This code takes in:
- A source Patient/Person resource or a Bundle of Patient/Person resources
- A target Patient/Person resource or a Bundle of Patient/Person resources
- A set of rule options/weights
- Whether to return only successful matches or all match results
- (Optional) Matching threshold (between 0 and 1). Can also be set as an environment variable:
PERSON_MATCH_THRESHOLD
- (Optional) Average score boost (between 0 and 1). Can also be set as an environment variable:
PERSON_MATCH_AVERAGE_SCORE_BOOST
Outputs
- A list of match results where id_source is id of source record and id_target is id of matched record.
Logic
- The code runs through each rule and calculates a uniqueness probability (between 0 and 1) if the rule matches.
- The code then picks the rule with the highest uniqueness probability and uses that probability
- The code then calculates the average of all the other rule probabilities and boosts the uniqueness probability by it.
- The code then sees if any other boosting rules were requested and boosts the probability by those
Weights
There are five cases when matching fields of two records in a rule and the weights that are applied for each case:
- Field present in both sides and match ->
exact_match
- Field present in both sides and partial match ->
partial_match
- Field missing in both sides ->
missing
- Field missing in one side ->
missing
- Field present in both sides but do not match -> 0.0
For boosting rules, there is an additional boost
weight.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for helix.personmatching-1.0.34.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | be4e6050675a831678e9e33cb94c9db2d5015f2936ec09b771d9f4a13c9e6b62 |
|
MD5 | b0b732b15ee909c915aaaa047fd32c43 |
|
BLAKE2b-256 | c14ad22387cb029d506afe248e55a19f5d0f73b3d7785283a10fb1ca0e27b525 |
Close
Hashes for helix.personmatching-1.0.34-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | f2130fc0f4f94b9f6f5b7c0295e87e811264dedd3a21cdc97f02d495a5399575 |
|
MD5 | 07957b48b8423d54a79c767e117c3550 |
|
BLAKE2b-256 | 3f46fc11a1638c814f4077bcaee8a86b2d4db42d60510bd77b8e52eefb9a4d1c |