Skip to main content

Add a metadata layer to data entry

Project description

pypi actions codecov downloads

edc_metadata

edc-metadata puts a “metadata” layer on top of your data collection forms, namely CRFs and Requisitions. The “metadata” is used on the data entry dashboard (see also edc_dashboard). The metadata may be queried directly by a data manager to review the completion status of CRF and Requisition forms.

  • Metadata is stored in two models, CrfMetaData and RequisitionMetaData. One metadata record is created per form per visit. Metadata is only created for the data collection forms of a visit as defined in the visit schedule.

  • Metadata model instances are created for each visit when the visit model is saved. edc_metadata reads from the visit_schedule to decide which data collection form metadata model instances to create for a visit. (Note: See edc_visit_schedule)

  • Metadata is guaranteed to exist for every form defined in a visit after the visit form has been submitted.

metadata model instances

Each metadata model instance, CrfMetadata or RequisitionMetadata, is managed by an actual CRF or REQUISITION model listed in the visit_schedule. CrfMetadata` model instances are created for each CRF listed in the visit schedule. That is, if the visit schedule schedules a CRF for 5 different visits, 5 ``CrfMetadata` model instances will eventually be created. Metadata model instances are created when the ``visit model for a timepoint is saved. When you save a CRF within a visit, the entry_status of the the metadata instance`s it manages is updated from REQUIRED to KEYED.

The same applies to RequisitionMetadata for REQUISITIONS.

Entry status

By default the entry_status field attribute is set to REQUIRED. You can change the default of each CRF to NOT_REQUIRED in your declaration in the visit schedule. See visit_schedule.crf.

The same applies to REQUISITIONS.

metadata_rules manipulate metadata model instances

metadata_rules are declared to manipulate metadata model instances. The rules change the entry_status field attribute from REQUIRED to NOT_REQUIRED or visa-versa. If the manager of the metadata instance, the CRF or REQUISITION model instance, exists, the entry status is updated to KEYED``and the ``metadata_rules targeting the metadata instance are ignored. metadata rules are run on each save of the visit and managing model instances. If a value on some other form implies that your form should not be completed, your form`s metadata “entry_status” will change from REQUIRED to NOT REQUIRED upon save of the other form. Metadata is updated through a post_save signal that re-runs the metadata rules.

See also edc_metadata_rules

Getting started

Models: Visit, Crfs and Requisitions

Let`s prepare the models that will be used in the scheduled data collection. These models are your visit models, crf models and requisition models.

Your application also has one or more Visit models. Each visit model is declared with the CreatesMetadataModelMixin:

class SubjectVisit(CreatesMetadataModelMixin, PreviousVisitMixin, VisitModelMixin,
                   RequiresConsentModelMixin, BaseUuidModel):

    appointment = models.OneToOneField(Appointment)

    class Meta(RequiresConsentModelMixin.Meta):
        app_label = 'example'

Your Crf models are declared with the CrfModelMixin:

class CrfOne(CrfModelMixin, BaseUuidModel):

    subject_visit = models.ForeignKey(SubjectVisit)

    f1 = models.CharField(max_length=10, default='erik')

    class Meta:
        app_label = 'example'

Your Requisition models are declared with the RequisitionModelMixin:

class SubjectRequisition(RequisitionModelMixin, BaseUuidModel):

    subject_visit = models.ForeignKey(SubjectVisit)

    f1 = models.CharField(max_length=10, default='erik')

    class Meta:
        app_label = 'example'

metadata_rules

As described above, metadata_rules manipulate the entry_status of CRF and Requisition metadata. metadata_rules are registered to site_metadata_rules in module metadata_rules.py. Place this file in the root of your app. Each app can have one metadata_rules.py.

See also edc_metadata_rules

autodiscovering metadata_rules

AppConfig will autodiscover the rule files and print to the console whatever it finds:

  • checking for metadata_rules …

  • registered metadata_rules from application ‘edc_example’

Inspect metadata_rules

Inspect metadata_rules from the site registry:

>>> from edc_metadata.rules.site_metadata_rules import site_metadata_rules

>>> for rule_groups in site_metadata_rules.registry.values():
>>>    for rule_group in rule_groups:
>>>        print(rule_group._meta.rules)

(<edc_example.rule_groups.ExampleRuleGroup: crfs_male>, <edc_example.rule_groups.ExampleRuleGroup: crfs_female>)
(<edc_example.rule_groups.ExampleRuleGroup2: bicycle>, <edc_example.rule_groups.ExampleRuleGroup2: car>)

Writing metadata_rules

metadata_rules are declared in a RuleGroup. The syntax is similar to the django model class.

Let`s start with an example from the perspective of the person entering subject data. On a dashboard there are 4 forms (models) to complete. The “rule” is that if the subject is male, only the first two forms should be complete. If the subject is female, only the last two forms should be complete. So the metadata should show:

Subject is Male:

  • crf_one - REQUIRED, link to entry screen available

  • crf_two - REQUIRED, link to entry screen available

  • crf_three - NOT REQUIRED, link to entry screen not available

  • crf_four - NOT REQUIRED, link to entry screen not available

Subject is Female:

  • crf_one - NOT REQUIRED

  • crf_two - NOT REQUIRED

  • crf_three - REQUIRED

  • crf_four - REQUIRED

A Rule that changes the metadata if the subject is male would look like this:

crfs_male = CrfRule(
    predicate=P('gender', 'eq', 'MALE'),
    consequence=REQUIRED,
    alternative=NOT_REQUIRED,
    target_models=['crfone', 'crftwo'])

The rule above has a predicate that evaluates to True or not. If gender is equal to MALE the consequence is REQUIRED, else NOT_REQUIRED. For this rule, for a MALE, the metadata entry_status for crf_one and crf_two will be updated to REQUIRED. For a FEMALE both will be set to NOT_REQUIRED.

Rules are declared as attributes of a RuleGroup much like fields in a django model:

@register()
class ExampleRuleGroup(CrfRuleGroup):

    crfs_male = CrfRule(
        predicate=P('gender', 'eq', 'MALE'),
        consequence=REQUIRED,
        alternative=NOT_REQUIRED,
        target_models=['crfone', 'crftwo'])

    crfs_female = CrfRule(
        predicate=P('gender', 'eq', FEMALE),
        consequence=REQUIRED,
        alternative=NOT_REQUIRED,
        target_models=['crfthree', 'crffour'])

    class Meta:
        app_label = 'edc_example'

RuleGroup class declarations are placed in file metadata_rules.py in the root of your application. They are registered in the order in which they appear in the file. All rule groups are available from the site_metadata_rules global.

IMPORTANT If the related visit model (e.g. SubjectVisit) has a different app_label than Meta.app_label, a RuleGroupError will be raised because the RuleGroup assumes the app_labels are the same. To avoid this, specify the related visit model label_lower on Meta.

For example:

@register()
class ExampleRuleGroup(CrfRuleGroup):

    crfs_male = CrfRule(
        predicate=P('gender', 'eq', 'MALE'),
        consequence=REQUIRED,
        alternative=NOT_REQUIRED,
        target_models=['crfone', 'crftwo'])

    class Meta:
        app_label = 'edc_example'
        related_visit_model = "edc_visit_tracking.subjectvisit"

Inheritance

When using single inheritance, set Meta class abstract on the base class:

class ExampleRuleGroup(CrfRuleGroup):

    crfs_male = CrfRule(
        predicate=P('gender', 'eq', 'MALE'),
        consequence=REQUIRED,
        alternative=NOT_REQUIRED,
        target_models=['crfone', 'crftwo'])

    class Meta:
        abstract = True


class MyRuleGroup(ExampleRuleGroup):
    class Meta:
        app_label = 'edc_example'
        related_visit_model = "edc_visit_tracking.subjectvisit"

More on Rules

The rule consequence and alternative accept these values:

from edc_metadata.constants import REQUIRED, NOT_REQUIRED
from edc_metadata.rules.constants import DO_NOTHING
  • REQUIRED

  • NOT_REQUIRED

  • DO_NOTHING

It is recommended to write the logic so that the consequence is REQUIRED if the predicate evaluates to True.

In the examples above, the rule predicate can only access values that can be found on the subjects`s current visit instance or registered_subject instance. If the value you need for the rule predicate is not on either of those instances, you can pass a source_model. With the source_model declared you would have these data available:

  • current visit model instance

  • registered subject (see edc_registration)

  • source model instance for the current visit

Let`s say the rules changes and instead of refering to gender (male/female) you wish to refer to the value field of favorite_transport on model CrfTransport. favorite_transport can be “car” or “bicycle”. You want the first rule predicate to read as:

  • If favorite_transport is equal to bicycle then set the metadata entry_status for crf_one and crf_two to REQUIRED, if not, set both to NOT_REQUIRED

and the second to read as:

  • If favorite_transport is equal to car then set the metadata entry_status for crf_three and crf_four to REQUIRED, if not, set both to NOT_REQUIRED.

The field for car/bicycle, favorite_transport is on model CrfTransport. The RuleGroup might look like this:

@register()
class ExampleRuleGroup(RuleGroup):

    bicycle = CrfRule(
        predicate=P('favorite_transport', 'eq', 'bicycle'),
        consequence=REQUIRED,
        alternative=NOT_REQUIRED,
        target_models=['crfone', 'crftwo'])

    car = CrfRule(
        predicate=P('favorite_transport', 'eq', car),
        consequence=REQUIRED,
        alternative=NOT_REQUIRED,
        target_models=['crfthree', 'crffour'])

    class Meta:
        app_label = 'edc_example'
        source_model = 'CrfTransport'

Note that CrfTransport is a crf model in the Edc. That is, it has a foreign key to the visit model. Internally the query will be constructed like this:

# source model instance for the current visit
visit_attr = 'subject_visit'
source_obj = CrfTansport.objects.get(**{visit_attr: visit})

# queryset of source model for the current subject_identifier
visit_attr = 'subject_visit'
source_qs = CrfTansport.objects.filter(**{'{}__subject_identifier'.format(visit_attr): subject_identifier})
  • If the source model instance does not exist, the rules in the rule group will not run.

  • If the target model instance exists, no rule can change it`s metadata from KEYED.

More Complex Rule Predicates

There are two provided classes for the rule predicate, P and PF. With P you can make simple rule predicates like those used in the examples above. All standard opertors can be used.

For example:

predicate = P('gender', 'eq', 'MALE')
predicate = P('referral_datetime', 'is not', None)
predicate = P('age', '<=', 64)

If the logic needs to a bit more complicated, the PF class allows you to pass a lambda function directly:

predicate = PF('age', func=lambda x: True if x >= 18 and x <= 64 else False)

predicate = PF('age', 'gender', func=lambda x, y: True if x >= 18 and x <= 64 and y == MALE else False)

Rule predicates as functions

If the logic needs to be more complicated than is recommended for a simple lambda, you can just pass a function. When writing your function just remember that the rule predicate must always evaluate to True or False.

The function will be called with:

  • visit: the related_visit model instance

  • registered_subject: the instance for the current subject

  • source_obj: the model instance who triggered the post_save signal

  • source_qs

def my_func(visit, registered_subject, source_obj, source_qs) -> bool:
    if registered_subject.age_in_years >= 18 and registered_subject.gender == FEMALE:
        return True
    return False

The function is then called on the RuleGroup like this:

@register()
class ExampleRuleGroup(RuleGroup):

    some_rule = CrfRule(
        predicate=my_func,
        consequence=REQUIRED,
        alternative=NOT_REQUIRED,
        target_models=['crfone', 'crftwo'])

    class Meta:
        app_label = 'edc_example'
        source_model = 'CrfTransport'

Grouping rule predicate functions with PredicateCollection

If you have many RuleGroups and predicate functions, it is useful to collect your predicate functions into a class:

class Predicates:
    household_head_model = "edc_he.healtheconomicshouseholdhead"
    patient_model = "edc_he.healtheconomicspatient"

    @property
    def hoh_model_cls(self):
        return django_apps.get_model(self.household_head_model)

    @property
    def patient_model_cls(self):
        return django_apps.get_model(self.patient_model)

    def patient_required(self, visit, **kwargs) -> bool:
        required = False
        if (
            self.hoh_model_cls.objects.filter(
                subject_visit__subject_identifier=visit.subject_identifier
            ).exists()
            and not self.patient_model_cls.objects.filter(
                subject_visit__subject_identifier=visit.subject_identifier
            ).exists()
        ):
            required = hoh_obj.hoh == YES
        return required

then you might do something like this in your metadata_rules module:

pc = Predicates()

@register()
class ExampleRuleGroup(RuleGroup):

    some_rule = CrfRule(
        predicate=pc.household_head_required,
        consequence=REQUIRED,
        alternative=NOT_REQUIRED,
        target_models=['healtheconomicshouseholdhead'])

    some_other_rule = CrfRule(
        predicate=pc.patient_required,
        consequence=REQUIRED,
        alternative=NOT_REQUIRED,
        target_models=['healtheconomicspatient'])

    class Meta:
        app_label = 'edc_he'
        source_model = "edc_he.healtheconomics"
        related_visit_model = "edc_visit_tracking.subjectvisit"

Setting a custom PredicateCollection for a RuleGroup using Meta

If a RuleGroup has its own Predicate class you can declare it on the Meta class. Set the predicate attribute to the name of the function to call.

@register()
class ExampleRuleGroup(RuleGroup):

    some_rule = CrfRule(
        predicate="household_head_required",
        consequence=REQUIRED,
        alternative=NOT_REQUIRED,
        target_models=['healtheconomicshouseholdhead'])

    some_other_rule = CrfRule(
        predicate="patient_required",
        consequence=REQUIRED,
        alternative=NOT_REQUIRED,
        target_models=['healtheconomicspatient'])

    class Meta:
        app_label = 'edc_he'
        source_model = "edc_he.healtheconomics"
        related_visit_model = "edc_visit_tracking.subjectvisit"
        predicates = Predicates()

Rule Group Order

IMPORTANT: RuleGroups are evaluated in the order they are registered and the rules within each rule group are evaluated in the order they are declared on the RuleGroup.

Updating metadata

It is a good idea to updata metadata after code changes and data migrations. To do so just run the management command:

python manage.py update_metadata

Testing

Since the order in which rules run matters, it is essential to test the rules together. See tests for some examples. When writing tests it may be helpful to know the following:

  • the standard Edc model configuration assumes you have consent->enrollment->appointments->visit->crfs and requisitions.

  • rules can be instected after boot up in the global registry site_metadata_rules.

  • all rules are run when the visit is saved.

More examples

See edc_example for working RuleGroups and how models are configured with the edc_metadata mixins. The tests in edc_metadata.rules use the rule group and model classes in edc_example.

Notes on Edc

The standard Edc model configuration assumes you have a data entry flow like this::

consent->enrollment->appointment->visit (1000)->crfs and requisitions
                     appointment->visit (2000)->crfs and requisitions
                     appointment->visit (3000)->crfs and requisitions
                     appointment->visit (4000)->crfs and requisitions

You should also see the other dependencies, edc_consent, edc_visit_schedule, edc_appointment, edc_visit_tracking, edc_metadata, etc.

Signals

In the signals file:

visit model ``post_save``:

  • Metadata is created for a particular visit and visit code, e.g. 1000, when the visit model is saved for a subject and visit code using the default entry_status configured in the visit_schedule.

  • Immediately after creating metadata, all rules for the app_label are run in order. The app_label is the app_label of the visit model.

crf or requisition model ``post_save``:

  • the metadata instance for the crf/requisition is updated and then all rules are run.

crf or requisition model ``post_delete``:

  • the metadata instance for the crf/requisition is reset to the default entry_status and then all rules are run.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

edc_metadata-0.3.97.tar.gz (92.5 kB view details)

Uploaded Source

Built Distribution

edc_metadata-0.3.97-py3-none-any.whl (137.2 kB view details)

Uploaded Python 3

File details

Details for the file edc_metadata-0.3.97.tar.gz.

File metadata

  • Download URL: edc_metadata-0.3.97.tar.gz
  • Upload date:
  • Size: 92.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.4

File hashes

Hashes for edc_metadata-0.3.97.tar.gz
Algorithm Hash digest
SHA256 969aad476ecacaa93285df34bbc6320291db2feda3ee06fda431544f620929f8
MD5 1fecbf3b58553bb0dfdf15de331d3157
BLAKE2b-256 223dfcbbccac7d7dca4afda0611423f93a410a629b625422583c75ebd90ee6cb

See more details on using hashes here.

File details

Details for the file edc_metadata-0.3.97-py3-none-any.whl.

File metadata

File hashes

Hashes for edc_metadata-0.3.97-py3-none-any.whl
Algorithm Hash digest
SHA256 ae8bb05960143ec59af742441a1d6772d149a348ca7d7eac11321d3ff68d6e41
MD5 96fa734dc8072b562956e85808fd277f
BLAKE2b-256 e5d8823d902207a91aaf1f9e81689d403865d118b8ffed5530fefa2fcfea3d26

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page