Managing files and file series

These details have not been verified by PyPI

Project links

Project description

About

filo is a Python 3 package for file management. Its main purpose is to provide a Series class to manage series of files (e.g. series of images or series of spectra), that use a custom File class. In particular, file creation time is detected automatically and can be accessed as a pandas dataframe.

The package also provides a ResultsBase base class to store data and metadata and save/load them into/from files.

Some other useful functions for file management are also provided. See summary of functions and classes below, and associated docstrings for details.

Install

pip install filo

`Series` class

Class to manage series of files of the same type (e.g. image series or spectra series from time-lapse experiments), possibly spread out across multiple folders. The main purpose of the class is to be subclassed in other modules specialized for analysis of specific experiment types, but it can be used as is, i.e. to extract timing info of series of files.

The main idea is that files are attributed a unique identifier (num) that starts at 0 in the first folder. Each file is described by an object of the File class that stores file path, identifier, and a time attribute.

Note: the time attribute is automatically extracted as the creation time of the file (st_mtime), but can be overwritten with external information, or can be defined differently by subclassing the _measure_times() method.

The list of file objects is accessed through the list Series.files containing all filo.File objects (Series.files[i] is the file object with identifier num=i). The correspondence between identifier, actual files, and file times is summarized in the Series.info attribute, which is a pandas DataFrame tied to Series.files, and which can be saved into a csv file. Loading options also exist to update file data using data stored in external files.

`Series` Methods

save_info(): save info of files into csv file,
load_info(): load info of files from csv file (overwrites self.files),
load_time(): keep current file info but only update time from info in csv file.

`Series` Attributes and properties

Regular attributes

folders: list of folders (pathlib.Path objects) across which the file series is spread,
files: list of files (filo.File objects, see below); self.files[num] is the file of identifier num,
savepath: directory in which data extracted/analyzed from files is saved, if applicable,
extension: extension of the files (str).

Read-only properties

(derived from regular attributes and methods)

info: pandas DataFrame containing info (number, folder, file, time) time of files; re-calculated every time self.info is called and thus reflects changes in self.files.
duration: datetime.Timedelta object, time difference between last file and first file in the series

`File` objects

File objects listed in Series.files have the following attributes:

file: Pathlib object of the file,
num: identifier of file within (int). In the series context, num starts at 0 in the first folder,
time: stores unix time (float, in seconds) when Series.set_times() is called,

with the following additional read-only properties derived from the ones above for convenience

folder Pathlib object of the parent directory containing the file,
name: filename (str).

Examples

(See ExampleSeries.ipynb for examples with actual data).

from filo import Series

# create series object of .png files located in 2 folders img1 and img2 ------
series = Series(paths=['img1', 'img2'], savepath='analysis', extension='.png')

# Access individual files in the file series ---------------------------------
series.files[0]        # first file in the first folder
series.files[55].num   # should always be equal to 55
series.files[10].time  # unix time of file creation

# Manage the infos DataFrame -------------------------------------------------
series.info         # see all file info in form of a pandas DataFrame
series.save_info()  # save info into 'FileSeries_Info.txt' (filename can be specified)

# Update Series.files objects and Series.info --------------------------------
series.load_info('Other_File_Info.txt')  # update all file data using data from external file
series.load_time('Other_File_Info.txt')   # update time information but keep other info
series.save_info('FileSeries_Info_New.txt')  # save updated info into new txt file

`ResultsBase` class

This is a base class to store analysis results and associated metadata (e.g. from file series such as images or spectra) and save them to files, or load the data/metadata from these files.

The class needs to be subclassed by redefining the following methods:

_load_data(self, file)
_save_data(self, data, file)
_load_metadata(self, file)
_save_metadata(self, metadata, file)

Then it provides the following methods and attributes:

Methods

save(): save analysis data and metadata
load(): load analysis data and metadata from files (stores them in the data and metadata attributes; see below)
load_data(): load and return data loaded from file
load_metadata(): load and return dictionary of metadata loaded from file
save_data(): save only data
save_metadata(): save only metadata

Attributes

data, analysis data
metadata, analysis metadata

Misc. Functions

# List files and folders -----------------------------------------------------
list_files(path='.', extension='')  # all files in a folder, sorted by name
list_all(path='.')  # all contents of a folder, sorted by name

# Move files and folders -----------------------------------------------------
move_files(src='.', dst='.', extension='')  # move only files with some suffix
move_all(src='.', dst='.')  # move everything

# Line formatting for csv ----------------------------------------------------
load_csv(file, sep='\t', skiprows=2)  # load csv into list of lists
data_to_line(data, sep='\t')  # iterable data to a line with \n, separated with separator sep.
line_to_data(line, sep='\t', dtype=float) # "Inverse of data_to_line(). Returns data as a tuple of type dtype.

# Misc -----------------------------------------------------------------------
batch_file_rename(name, newname, path='.')  # rename recursively files named name into newname
make_iterable(x):  # Transform non-iterables into a tuple, but keeps iterables unchanged

Note: extension is optional, to consider only files with a certain extension, e.g. '.txt'. If left blank, all files considered (excluding directories).

Requirements

(installed automatically by pip if necessary)

python >= 3.6
pandas (for managing data in Series class)
importlib-metadata

Author

Olivier Vincent (ovinc.py@gmail.com)

License

3-clause BSD (see LICENSE file)

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

2.2.1

Oct 28, 2025

2.2.0

Oct 22, 2025

2.1.1

Apr 15, 2025

2.1.0

Apr 14, 2025

2.0.1

Mar 31, 2025

2.0.0

Mar 29, 2025

1.3.0

Mar 18, 2025

This version

1.2.0

Mar 17, 2025

1.1.7

Jan 10, 2025

1.1.6

Jan 10, 2025

1.1.5

Jan 22, 2021

1.1.4

Jan 12, 2021

1.1.3

Jan 11, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

filo-1.2.0.tar.gz (5.0 MB view details)

Uploaded Mar 17, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

filo-1.2.0-py3-none-any.whl (5.0 MB view details)

Uploaded Mar 17, 2025 Python 3

File details

Details for the file filo-1.2.0.tar.gz.

File metadata

Download URL: filo-1.2.0.tar.gz
Upload date: Mar 17, 2025
Size: 5.0 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.10.9

File hashes

Hashes for filo-1.2.0.tar.gz
Algorithm	Hash digest
SHA256	`8d821212d27f6d1f000e657bd11e2625874361637b3ffc7349b1b40e3bfefe92`
MD5	`c7037b5bcec0469b66d1455cebed261b`
BLAKE2b-256	`b1f248bcdb6657bcc58e8dc103686e297fafdebf0f96adab2e80964049e3079d`

See more details on using hashes here.

File details

Details for the file filo-1.2.0-py3-none-any.whl.

File metadata

Download URL: filo-1.2.0-py3-none-any.whl
Upload date: Mar 17, 2025
Size: 5.0 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.10.9

File hashes

Hashes for filo-1.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4f19a8aeabf07eb562d4c7e2aaa3d66e1871fd9b9726cbfc4160dc54a16aa5e0`
MD5	`67322adb46248a46cda91fe85de38fec`
BLAKE2b-256	`3a5f89739326a6e79ec62033738e833b4afec811386008f9f67df8663754baa1`

See more details on using hashes here.

filo 1.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

About

Install

Contents

Series class

Series Methods

Series Attributes and properties

Regular attributes

Read-only properties

File objects

Examples

ResultsBase class

Methods

Attributes

Misc. Functions

Requirements

Author

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`Series` class

`Series` Methods

`Series` Attributes and properties

`File` objects

`ResultsBase` class