Flywheel metadata extraction.

These details have not been verified by PyPI

Project links

Project description

fw-meta

Extract Flywheel upload metadata from fw_file File objects or any mapping that has a dict-like interface.

The most common use case is scraping Flywheel group and project information from DICOM tags where it was entered by a researcher at scan time through a scanner's UI.

The group and project is required for placing (aka. routing) uploaded files correctly within the Flywheel hierarchy.

Installation

Add as a poetry dependency to your project:

poetry add fw-meta

Usage

Given

DICOM context
PatientID being an available and unused field on the scanner's UI
"neuro/Amnesia" being entered in PatientID
using the recommended extraction pattern "[fw://]{group}[/{project}[/{subject}[/{session}[/{acquisition}]]]]"

The extracted metadata should be {"group._id": "neuro", "project.label": "Amnesia"}:

from fw_meta import extract_meta

pattern = "[fw://]{group}[/{project}[/{subject}[/{session}[/{acquisition}]]]]"
data = dict(PatientID="neuro/Amnesia")
meta = extract_meta(data, mappings={"PatientID": pattern})
meta == {"group._id": "neuro", "project.label": "Amnesia"}

Source fields

Metadata can be extracted from any source field such as the tag values in the case of DICOMs. Selecting an appropriate DICOM tag comes down to ones that are:

available fields on the scanner UI
allow entering the routing string (ie. long / versatile enough)
not currently used by researchers (or repurposable)

Some recommended tags that worked well previously:

PatientID
PatientComments
StudyComments
ReferringPhysicianName

Extraction pattern mappings

Extraction patterns are simplified python regexes tailored for scraping Flywheel metadata fields like group._id and project.label from a string using capture groups.

The pattern syntax is shown through a series of examples below. All cases assume the following context:

from fw_meta import extract_meta
data = dict(PatientID="neuro_amnesia")

Extracting a whole string as-is is the simplest use case. For example, get "neuro_amnesia" - the value of PatientID into a single Flywheel field like group._id - here the pattern simply becomes the target field, group._id:

meta = extract_meta(data, mappings={"PatientID": "group._id"})
meta == {"group._id": "neuro_amnesia"}

The simplified capture group notation using {curly braces} gives more flexibility to the patterns, allowing substrings to be ignored for example:

meta = extract_meta(data, mappings={"PatientID": "{group}_*"})
meta == {"group._id": "neuro"}  # "_amnesia" was not captured in the group

Note how the pattern group resulted in the extraction of group._id. This is because Flywheel groups are most commonly routed to by their _id field, and two aliases, group and group.id are configured to allow for simpler and more legible capture patterns.

The simplified optional notation using [square brackets] allows patterns to match with or without an optional part:

# the PatientID doesn't contain 2 underscores - the pattern matches w/o subject
pattern = "{group}_{project}[_{subject}]"
meta = extract_meta(data, mappings={"PatientID": pattern})
meta == {"group._id": "neuro", "project.label": "amnesia"}

# the PatientID contains the optional part thus the subject also gets extracted
data = dict(PatientID="neuro_amnesia_subject")
meta = extract_meta(data, mappings={"PatientID": pattern})
meta == {"group._id": "neuro", "project.label": "amnesia", "subject.label": "subject"}

The recommended extraction pattern has both capture curlies and optional brackets: "[fw://]{group}[/{project}[/{subject}[/{session}[/{acquisition}]]]]" This pattern is:

prefix-consistent with the fw://group/Project as displayed on the UI
usable as an opt-in filter only including data if the value starts with fw://
flexible enough to route to the correct group without the project
flexible enough to specify custom subject/session/acquisition labels

Extracting multiple meta fields from a single value can be done by adding multiple groups with curly braces in the pattern. The following example captures the group and the project separated by an underscore:

meta = extract_meta(data, mappings={"PatientID": "{group}_{project}"})
meta == {"group._id": "neuro", "project.label": "amnesia"}

Extracting a single meta field from multiple values is also possible by treating the left-hand-side as an f-string template to be formatted. This example extracts acquisition.label as the concatenation of SeriesNumber and SeriesDescription:

data = dict(SeriesNumber="3", SeriesDescription="foo")
meta = extract_meta(data, mappings={"{SeriesNumber} - {SeriesDescription}": "acquisition"})
meta == {"acquisition.label": "3 - foo"}

Note that if any of the values appearing in the template are missing, then the whole pattern is considered non-matching and will be skipped.

The same capture group may appear in multiple patterns providing a fallback mechanism where the first non-empty match wins. For example to extract session.label from StudyComments when it's available, but fall back to using StudyDate if it isn't:

data = dict(StudyDate="20001231", StudyComments="foo")
meta = extract_meta(data, mappings=[("StudyComments", "session"), ("StudyDate", "session")])
meta == {"session.label": "foo"}

data = dict(StudyDate="20001231")  # no StudyComments
meta = extract_meta(data, mappings=[("StudyComments", "session"), ("StudyDate", "session")])
meta == {"session.label": "20001231"}  # fall back to StudyDate

Capture groups may have a regex defining what substrings the group should match on:

# match whole string into subject IF it starts with an "s" and is digits after
pattern = "{subject:s\d+}"
data = dict(PatientID="s123")  # should match
meta = extract_meta(data, mappings={"PatientID": pattern})
meta == {"subject.label": "s123"}

data = dict(PatientID="foobar")  # should not match
meta = extract_meta(data, mappings={"PatientID": pattern})
meta == {}

Timestamps are parsed with dateutil.parser. This allows extracting the session.timestamp and acquisition.timestamp metadata fields with minimal configuration:

data = dict(path="/data/20001231133742/file.txt")
pattern = "/data/{acquisition.timestamp}/*"
meta = extract_meta(data, mappings={"path": pattern})
meta == {
    "acquisition.timestamp": "2000-12-31T13:37:42+01:00",
    "acquisition.timezone": "Europe/Budapest",
}

Note that the timezone was auto-populated and the timestamp got localized - see the config section below for more details and options.

Timestamps may be parsed using an strptime pattern to enable loading any formats that might not be handled via dateutil.parser:

data = dict(path="/data/20001231_133742_12345/file.txt")
pattern = "/data/{acquisition.timestamp:%Y%m%d_%H%M%S_%f}/*"
meta = extract_meta(data, mappings={"path": pattern})
meta == {
    "acquisition.timestamp": "2000-12-31T13:37:42.123450+01:00",
    "acquisition.timezone": "Europe/Budapest",
}

Defaults

Some scenarios benefit from setting a default metadata value as a fallback even if one could not be extracted via a pattern. An example is routing any DICOM from scanner "A" that doesn't have a routing string to a group/project pre-created and designated for the data instead of the Unknown group and/or Unsorted project.

meta = extract_meta({}, mappings={"PatientID": "group"})
meta == {}  # PatientID is empty - no group._id extracted

meta = extract_meta({}, mappings={"PatientID": "group"}, defaults={"group": "default"})
meta == {"group._id": "default"}  # group._id defaulted

Configuration

Timestamp metadata fields session.timestamp and acquisition.timestamp are always accompanied by a timezone (session.timezone / acquisition.timezone).

When dealing with zone-naive timestamps, fw-meta assumes they belong to the the currently configured local timezone which is common practice with DICOMs and other medical data. The local timezone is retrieved using tzlocal and defaults to UTC if it's not available.

Setting the environment variable TZ to a timezone name from the tz database can be used to explicitly override the timezone used to localize any tz-naive timestamps with.

Development

Install the package and it's dependencies using poetry and enable pre-commit:

poetry install
pre-commit install

License

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

4.12.0

Jun 30, 2026

4.11.0

Jun 10, 2026

4.10.0

Mar 13, 2026

4.9.8

Feb 12, 2026

4.9.7

Feb 9, 2026

4.9.6

Feb 2, 2026

4.9.5

Jan 30, 2026

4.9.4

Jan 14, 2026

4.9.3

Jan 10, 2026

4.9.2

Jan 10, 2026

4.9.1

Jan 10, 2026

4.9.0

Jan 9, 2026

4.8.3

Dec 23, 2025

4.8.2

Nov 19, 2025

4.8.1

Nov 14, 2025

4.8.0

Nov 6, 2025

4.7.2

Nov 3, 2025

4.7.1

Sep 26, 2025

4.7.0

Sep 23, 2025

4.6.0

Jul 8, 2025

4.5.1

Mar 25, 2025

4.5.0

Feb 28, 2025

4.4.0

Jan 31, 2025

4.3.0

Jan 28, 2025

4.2.2

Aug 29, 2024

4.2.1

Aug 27, 2024

4.2.0

Aug 26, 2024

4.1.2

Aug 23, 2024

4.1.1

Oct 30, 2023

4.1.0

Sep 25, 2023

4.0.1

Jul 31, 2023

4.0.0

Jul 21, 2023

3.3.2

Jun 30, 2023

3.3.1

May 17, 2023

3.3.0

Apr 27, 2023

3.2.0

Apr 6, 2023

3.1.3

Feb 20, 2023

3.1.2

Jan 18, 2023

3.1.1

Jan 18, 2023

3.1.0

Jan 13, 2023

3.0.1

Dec 13, 2022

3.0.0

Dec 12, 2022

2.1.1

Nov 21, 2022

2.1.0

Oct 5, 2022

2.0.8

Jun 28, 2022

2.0.7

Jun 16, 2022

2.0.6

Apr 20, 2022

2.0.5

Feb 17, 2022

2.0.4

Nov 26, 2021

2.0.3

Nov 25, 2021

2.0.2

Nov 25, 2021

2.0.1

Nov 18, 2021

2.0.0

Nov 18, 2021

1.0.3

Nov 4, 2021

1.0.2

Oct 22, 2021

1.0.1

Oct 8, 2021

1.0.0

Aug 10, 2021

0.3.2

May 12, 2021

0.3.1

Mar 22, 2021

0.3.0

Mar 17, 2021

0.2.1

Mar 4, 2021

0.2.0

Feb 22, 2021

0.1.0

Feb 17, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

fw_meta-4.12.0-py3-none-any.whl (19.8 kB view details)

Uploaded Jun 30, 2026 Python 3

File details

Details for the file fw_meta-4.12.0-py3-none-any.whl.

File metadata

Download URL: fw_meta-4.12.0-py3-none-any.whl
Upload date: Jun 30, 2026
Size: 19.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.4.1 CPython/3.13.14 Linux/5.15.154+

File hashes

Hashes for fw_meta-4.12.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f1a37264fd9ed483018eae485409056cbbf459675ccf8eca54529b320810d94c`
MD5	`595b8079857c2c3299d78d601746e070`
BLAKE2b-256	`3832ec4215c157d32a7f65309724eb1dd426a5864cdd738296f202b53c77540a`

See more details on using hashes here.

fw-meta 4.12.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

fw-meta

Installation

Usage

Source fields

Extraction pattern mappings

Defaults

Configuration

Development

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes