Skip to main content

Base module for parsing DICOM files in the pf* family.

Project description

Quick Overview

  • pfdicom is primarily a base module for other, more specialized classes. It typically is never used on its own.


pfdicom in and of itself provides minimal end value. This module/class is intended to be a building block for deeper functionality. Its purpose is to probe a given directory filesystem for DICOM files and construct a tree representation (using pftree), and then, for each directory provide the means to read in a given DICOM file (using pydicom) to provide some minimal tag extraction and output file templating.

Most importantly, derived classes of this parent class can provide detailed and powerful methods to process the directories containing DICOM files, saving results to an output file tree. For example, various modules are built off this chassis, including pfdicom_tagExtract and pfdicom_tagSub.



The following dependencies are installed on your host system/python3 virtual env (they will also be automatically installed if pulled from pypi):

  • pfmisc (various misc modules and classes for the pf* family of objects)
  • pftree (create a dictionary representation of a filesystem hierarchy)

Using PyPI

The best method of installing this script and all of its dependencies is by fetching it from PyPI

pip3 install pfdicom

Command line arguments

-I|--inputDir <inputDir>
Input DICOM directory to examine. By default, the first file in this
directory is examined for its tag information. There is an implicit
assumption that each <inputDir> contains a single DICOM series.

-i|--inputFile <inputFile>
An optional <inputFile> specified relative to the <inputDir>. If
specified, then do not perform a directory walk, but convert only
this file.

[-O|--outputDir <outputDir>]
The directory to contain all output files.

[-O|--outputDir <outputDir>]
The directory to contain all output files.

[--outputLeafDir <outputLeafDirFormat>]
If specified, will apply the <outputLeafDirFormat> to the output
directories containing data. This is useful to blanket describe
final output directories with some descriptive text, such as
'anon' or 'preview'.

This is a formatting spec, so

    --outputLeafDir 'preview-%s'

where %s is the original leaf directory node, will prefix each
final directory containing output with the text 'preview-' which
can be useful in describing some features of the output set.

-e|--extension <DICOMextension>
An optional extension to filter the DICOM files of interest from the

[-t|--threads <numThreads>]
If specified, break the innermost analysis loop into <numThreads>
threads. Please note the following caveats:

    * Only thread if you have a high CPU analysis loop. Since
      most of the operations of this module will entail reading
      and writing DICOM files, and since these operations are
      the bulk of the execution time, adding threading will not
      really help.

    * Threading will change the nature of the innermost looping
      across the problem domain, with the result that *all* of the
      problem data will be read into memory! That means all of
      DICOMs across all of the subdirs! In non-threading mode,
      only DICOMs from a single directory at a time are read
      and then discarded.

This flag is less applicable to this base class. It is here
to provide fall-through compatibility with derived classes.

Show full help.

Show brief help.

If true, dump the final return as JSON formatted string.

If specified, follow symbolic links.

If specified, print a version string.

-v|--verbosity <level>
Set the app verbosity level.

    0: No internal output;
    1: Most important internal output -- none for 'pfdicom';
    2: As with level '1' but with simpleProgress bar in 'pftree';
    3: As with level '2' but with list of input dirs/files in 'pftree';

String processing on tag values

pfidcom offers some functions on tag values – these are typically string based. The syntax is:


For example,


An md5 hash of the PatientID is determined. Of the resultant string, the first 7 chars are used. This is returned as the value for the PatientID tag.


The PatientBirthDate value is masked such that the first six chars are conserved, but the final two are replaced by ‘01’. This has the effect of setting the PatientBirthDate to the first day of the birth month.


The ProtocolName is processed to remove all white space, and using a ‘-‘ character instead of any whitespace components.


Run on a target tree, creating internal representations of specific file and directory strucutres.

pfdicom         -I /var/www/html                \
                -O /tmp                         \
                -o %PatientID-%PatientAge       \
                -e .dcm                         \
                -v 0 --json

which will output only at script conclusion and will log a JSON formatted string.

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Filename, size & hash SHA256 hash help File type Python version Upload date
pfdicom-1.4.14.tar.gz (10.7 kB) Copy SHA256 hash SHA256 Source None Aug 3, 2018

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page