MegaFlow2D: A Large-Scale Dataset for 2D Flow Simulation

These details have not been verified by PyPI

Project links

Homepage

Project description

MegaFlow2D

Overview

The MegaFlow2D dataset package of parameteric CFD simulation results for machine learning / super-resolution purposes.

The package contains:

A standard structure for transferring simulation results into graph structure.
Common utility functions for visualizing, retrieving and processing simulation results. (Everything that requires the FEniCS or dolfin package can only be run on linux or wsl.)

Installation

The MegaFlow dataset can be installed by pip:

pip install MegaFlow2D

Running pip install would automatically configure package dependencies, however to build graphical models torch-geometric needs to be installed manually.

Dataset structure

The entire dataset is stored inside a single HDF5 file. Although multiple HDF5 files are created during processing depending on the number of processing cores used to avoid data corruption while concurrently writing to a single file. The reading operation, however, can be done concurrently as long as all operations are restricted in r mode. The dataset is stored in a hierarchical structure, and each group is indexed by the geometry type, mesh resolution and time step. The dataset object is stored as a h5py.dataset object under each group. The dataset structure is shown below:

├── MegaFlow2D
│   ├── <geometry_type>_<geometry_index>
│   │   ├── <mesh_resolution>
│   │   │   ├── <time_step>
│   │   │   │   ├── dataset

In theory, searching through the dataset can have a complexity of O(1) due to the B-tree structure of HDF5 to allow for fast data retrieval in training loading process. However, the process might be slowed down by the auto decompression of the dataset. This may be improved by reprocessing the dataset with a different compression setting in utils.py. Please keep in mind that reprocessing the dataset can take several hours depending on the number of cores used.

Using the MegaFlow package

The MegaFlow package provides a simple interface for initializing and loading the dataset.

from megaflow.dataset.MegaFlow2D import MegaFlow2D

if __name__ == '__main__':
    dataset = MegaFlow2D(root='/path/to/your/directory', download=True, transform='normalize', pre_transform=None, split_scheme='mixed', split_ratio=0.8)
    # if the dataset is not processed, the process function will be called automatically. 
    # to facilitate multi-thread processing, be sure to exceute the process function in '__main__'.

    # get one sample
    sample_low, sample_high = dataset.get(0)
    print('Number of nodes: {}, number of edges: {}'.format(sample_low.num_nodes, sample_low.num_edges))

Using the example scripts

We provide an example script for training a super-resolution model on the MegaFlow2D dataset. The script can be found in the examples directory. The script can be run by (one configuration example):

python examples/train.py --root /path/to/your/directory --dataset MegaFlow2D --tranform normalize --model FlowMLError --epochs 100 --batch_size 32

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.6.0

Sep 21, 2023

This version

0.5.3

May 5, 2023

0.5.2

Mar 21, 2023

0.5.1

Mar 21, 2023

0.5.0

Mar 20, 2023

0.4.3

Mar 20, 2023

0.4.2

Mar 19, 2023

0.4.1

Mar 19, 2023

0.4.0

Mar 19, 2023

0.3.9

Mar 19, 2023

0.3.8

Mar 19, 2023

0.3.7

Mar 19, 2023

0.3.6

Mar 19, 2023

0.3.5

Mar 19, 2023

0.3.4

Mar 19, 2023

0.3.3

Mar 13, 2023

0.3.2

Mar 7, 2023

0.3.1

Mar 7, 2023

0.3.0

Mar 7, 2023

0.2.4

Mar 6, 2023

0.2.3

Feb 8, 2023

0.2.2

Feb 1, 2023

0.2.1

Feb 1, 2023

0.2.0

Feb 1, 2023

0.1.0

Feb 1, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

MegaFlow2D-0.5.3.tar.gz (9.4 kB view hashes)

Uploaded May 5, 2023 Source

Hashes for MegaFlow2D-0.5.3.tar.gz

Hashes for MegaFlow2D-0.5.3.tar.gz
Algorithm	Hash digest
SHA256	`a9a447691a466a1b7e56297f1decaa8e46ab32b770945601f0ca7e61b2c0fa90`
MD5	`5cb9fb425784a8e91d350cbac14e7df0`
BLAKE2b-256	`f42fd42bc328880f64fb348898f3283ad7c4026f963beddc330258ba133e1741`