Utilities for parsing files in a directory based on a file name pattern.

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Filepattern Utility

PyPI - Downloads Bower

The filepattern Python utility is designed to information stored in file names. A filepattern is essentially a simplified regular expression with named groups, and regular expressions are valid filepattern expressions provided they do not use groups.

The utility was born from the need to manipulate and organize image data from a variety of microscopes, all of which have a systematic but different file naming conventions. This made abstracting things like image stitching algorithms easier to apply to files with disparate naming conventions by simply changing the filepattern rather than generating new code to parse each new naming convention. Although filepattern was born to wield against image data, it is not limited to image data, and can handle filenames with any extension.

Summary

Read the docs!
Install
Versioning
Authors
License
Acknowledgments

Install

This utility is built in pure Python with no dependencies.

pip install filepattern

Getting Started

What does a filepattern look like? It is probably easiest to show by example. Say there is a folder with the following files:

my_data_folder/x000_y000_z001.tif
my_data_folder/x000_y000_z002.tif
my_data_folder/x000_y000_z003.tif

The filepattern for the above files would be x000_y000_z00{z}.ome.tif. The curly brackets indicate a file name variable, and {z} indicates that the number will be parsed and stored as a z value. If a similar regular expression were to be written, then it would look like x000_y000_z00([0-9]).ome.tif, which is not only longer but would require more code to parse the regular expression.

To easily loop over the values, a FilePattern object can be created and used to iterate over the files in order.

import filepattern, pathlib

pattern = 'x000_y000_z00{z}.ome.tif'
path_to_files = pathlib.Path('/path/to/files')

fp = filepattern.FilePattern(path_to_files,pattern)

# Loop over all files that match the pattern
for files in fp():

    # Files contains a list of all files with identical z-value
    # In this case, there should only be one so select the first item
    file = files[0]

    # Each value in files is a dictionary containing the filename under the
    # "file" key, and the z-value extracted from the file name under the "z" key
    print(f"File {file['file']} has z-value {file['z']}")

The output should be as follows:

File my_data_folder/x000_y000_z001.tif has z-value 0
File my_data_folder/x000_y000_z002.tif has z-value 1
File my_data_folder/x000_y000_z003.tif has z-value 2

Versioning

We use SemVer for versioning. For the versions available, see the tags on this repository.

Authors

Nick Schaub (nick.schaub@nih.gov, nick.schaub@labshare.org)

License

This project is licensed under the MIT License Creative Commons License - see the LICENSE file for details

Acknowledgments

This utility was inspired by the notation found in the MIST algorithm developed at NIST.

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

2.0.5

Feb 7, 2024

2.0.4

Nov 8, 2023

2.0.3

Oct 31, 2023

2.0.2

Oct 31, 2023

2.0.1

Aug 9, 2023

2.0.0

May 10, 2023

This version

1.4.7

Aug 5, 2021

1.4.6

Apr 12, 2021

1.4.5

Apr 6, 2021

1.4.4

Feb 2, 2021

1.4.3

Jan 26, 2021

1.4.2

Jan 22, 2021

1.4.1

Jan 22, 2021

1.4.0

Jan 22, 2021

1.3.2

Jan 18, 2021

1.3.1

Jan 18, 2021

1.3.0

Jan 16, 2021

1.2.4

Jul 22, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

filepattern-1.4.7.tar.gz (16.1 kB view hashes)

Uploaded Aug 5, 2021 Source

Built Distribution

filepattern-1.4.7-py3-none-any.whl (15.9 kB view hashes)

Uploaded Aug 5, 2021 Python 3

Hashes for filepattern-1.4.7.tar.gz

Hashes for filepattern-1.4.7.tar.gz
Algorithm	Hash digest
SHA256	`2ab0a01d6e0a9a936cb0edf49945593a318f56adcb295bceee8d59b1047128d7`
MD5	`fab71985d696838c866c8ddd559ddabb`
BLAKE2b-256	`3ee061a5b533f87c3f508dac97ae9434878324f4b14ca88af1acb44741402d78`

Hashes for filepattern-1.4.7-py3-none-any.whl

Hashes for filepattern-1.4.7-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e4afbc4af689252635c65c4c2d80402c04267ae16020f980b1efb99d5161bd6b`
MD5	`8ec33913785920d443b84968abeacabc`
BLAKE2b-256	`45d00e65905a8a5bc0ed9e6a7c4641c9f8d02201183870539a3b6117bbed12b6`