Package for benchmarking deep learning models on AEM problems

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

Benchmarking AEM problems with various DL structures

This repository stores implemention of paper Benchmarking Data-driven Surrogate Simulators for Artificial Electromagnetic Materials

It includes a suit of AEM data set benchmarks along with implementation of various ready-to-use deep learning architectures (MLP, Transformer, MLP-Mixer) for scientific computation problem and a handful of utility functions.

Data Sets

geometry_illustration Schematics of geometry in three physical problems. (a) Infinite array of all-dielectric metasurfaces consists of four elliptical-resonators supercells. (b) A nanophotonic particle consists of four layers. (c) The three-layers color filter design.

Requirements

Package	Version
Python	>=3.7
Pytorch	>= 1.3.1
Numpy	>=1.17.4
Pandas	>=0.25.3
Tensorboard	>=2.0.0
Tqdm	>=4.42.0
Sklearn	>=0.22.1
Matplotlib	>= 3.1.3
einops	>= 0.3.0
seaborn	>= 0.11.2

Environment

The detailed conda environment is packaged in .yml file.
Add the Benchmarking folder as one of the source directory to make utils and Simulated_Dataset folders visible runtime

Features

Access to various ADM data sets
Off-the-shelf implementation of MLP, Transformer and MLP-Mixer with high individuality
Utilities for data preprocessing and preparation for downstream deep learning tasks
Utilities for plotting and easy analysis of results

Usage

Access to Data Sets

ADM Data Set. Please download and unzip from the Repository.
Particle Data Set. Please download and unzip from the Repository.
Color Data Set. Please download and unzip from the Repository.

Download Pre-trained Models

MLP: Please download and unzip from the folder.
Transformer: Please download and unzip from the folder.
MLP-Mixer: Please download and unzip from the folder.

Install Package

pip install AEML

Loading data and Splitting

Loading benchmark datasets described in Section 4.1 of the paper

ADM refers to the All-dielectric metasurface dataset. Particle dataset refers to the Nanophotonic Particle dataset. The Color dataset refers to the Color filter dataset. The specification of each dataset is provided in the table below:

Dataset	D_in	D_out	Sub_area	Simulations	Simulation CPU time
All-dielectric metasurfac	14	2001	Metamaterials	60,000	7 months
Nanophotonic particle	8	201	Nanophotonics	50,000	1.5 hours
Color	3	3	Optical waveguide	100,000	-

Loading your own benchmark dataset into the framework

Although we used AEM dataset for benchmarking, this suite is open and easily adaptable to a wide range of applications in the scientific computing community. To test your own custom dataset, simply normalize (or not, your choice, our loader would not normalize your dataset) and put your dataset into the Custom folder with the format: data_x.csv, data_y.csv where each file contains the input and output of the application. The shape should be [#Simulations, Dim_x] and [#Simulations, Dim_y] and separated by comma. Note that there should not be any header in the csv.

import AEML
from AEML.data import ADM, Particle, Color, load_custom_dataset

# Load our pre-defined dataset
train_loader, test_loader, test_x, test_y =ADM/Particle/Color(normalize=True/False, batch_size=1024)    # Loading the ADM dataset

# Or, load prepare your own dataset here
# train_loader, test_loader, test_x, test_y = load_custom_dataset()

Loading Models with configurable hyper-paramters and making prediction

Architectures of various DL structures implementd

As dscribed in section 5 in the paper, the architectures are modified slightly from the original Mixer and Transformer models to fit our scientific computing background.

Model hyper-parameter adjustment

from AEML.models.Mixer import DukeMIXER
from AEML.models.MLP import DukeMLP
from AEML.models.Transformer import DukeTransformer

# 1. Defining all the models here (We highly recommend training the models one by one due to GPU RAM constraints
#MLP:
model = DukeMLP(dim_g=3, dim_s=3, linear=[500, 500, 500, 500, 500, 500], skip_connection=False, skip_head=0, dropout=0, model_name=None)

#Transformer:
model= DukeTransformer(dim_g, dim_s, feature_channel_num=32, nhead_encoder=8, 
                        dim_fc_encoder=64, num_encoder_layer=6, head_linear=None, 
                        tail_linear=None, sequence_length=8, model_name=None, 
                        ckpt_dir=os.path.join(os.path.abspath(''), 'models','Transformer'))
#Mixer:
model = DukeMIXER(dim_g, dim_s, mlp_dim=500, patch_size=10, mixer_layer_num=6,
                embed_dim=128, token_dim=128, channel_dim=256, 
                mlp_layer_num_front=3, mlp_layer_num_back=3)

# 2. Model training code

#MLP:
model.train_(train_loader, test_loader, epochs=500, optm='Adam', weight_decay=1e-4,
            lr=1e-4, lr_scheduler_name='reduce_plateau', lr_decay_rate=0.2, eval_step=10,
            stop_threshold=1e-7)

#Transformer:
model.train_(train_loader, test_loader, epochs=500, optm='Adam', reg_scale=5e-4, lr=1e-3, 
                        lr_schedueler_name='reduce_plateau',lr_decay_rate=0.3, eval_step=10)

#Mixer:
model.train_(train_loader, test_loader, epochs=500, optm='Adam', weight_decay=1e-4,
            lr=1e-4, lr_scheduler_name='reduce_plateau', lr_decay_rate=0.2, eval_step=10,
            stop_threshold=1e-7)

# Loading the model you just trained or hypersweeped or our provided pretrained model if 
# you don't want to train it or just want to reproduce our result, only choose one between these 2
model.load_model(pre_trained_model='Particle'\'AMD'\'Color'\None, 
                model_directory='YOUR_MODEL_DIRECOTRY')

# Model inference code: Give it X, output Y
pred_Y = model(test_X)

# Model evaluation code: Give it test_X, test_Y, output MSE and generate a plot of MSE histogram in \data
MSE = model.evaluate(test_x, test_y, save_output=False, save_dir='data/')

Performance of various DL structures on benchmark ADM data sets

Relative size of our pre-trained networks

Support

Please file an issue here.

License

The project is licensed under the MIT license.

Please cite this work if some of the code or datasets are helpful in your scientific endeavours. For specific datasets, please also cite the respective original source(s), given in the preprint.

Project details

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

This version

0.0.1

Sep 10, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

AEML-0.0.1.tar.gz (65.0 kB view details)

Uploaded Sep 10, 2021 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

AEML-0.0.1-py3-none-any.whl (78.1 kB view details)

Uploaded Sep 10, 2021 Python 3

File details

Details for the file AEML-0.0.1.tar.gz.

File metadata

Download URL: AEML-0.0.1.tar.gz
Upload date: Sep 10, 2021
Size: 65.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.6.1 requests/2.24.0 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.8.5

File hashes

Hashes for AEML-0.0.1.tar.gz
Algorithm	Hash digest
SHA256	`6ec9e4273819bee8375d15a41a70b654df12362d4e7f6270a5b94aaef18a3d1b`
MD5	`e6f7994d9aaa7900ad93023cb2fbf121`
BLAKE2b-256	`c6848f9f6c4b571b578fc403516bef491c03cdbd56f813cbf9153369c656578d`

See more details on using hashes here.

File details

Details for the file AEML-0.0.1-py3-none-any.whl.

File metadata

Download URL: AEML-0.0.1-py3-none-any.whl
Upload date: Sep 10, 2021
Size: 78.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.6.1 requests/2.24.0 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.8.5

File hashes

Hashes for AEML-0.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0dea8e3463d18c5db7d452b0e3190d2a85a3110e3f1438743096d0587e65f3ee`
MD5	`e9f970b8eb6e1f893d788eeea81153d3`
BLAKE2b-256	`595f84cac6a7ebac6eb5183069d0bd6bfb008a2956971fbbeeaaeadde88837d7`

See more details on using hashes here.

AEML 0.0.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Benchmarking AEM problems with various DL structures

Data Sets

Requirements

Environment

Features

Usage

Access to Data Sets

Download Pre-trained Models

Install Package

Loading data and Splitting

Loading benchmark datasets described in Section 4.1 of the paper

Loading your own benchmark dataset into the framework

Loading Models with configurable hyper-paramters and making prediction

Architectures of various DL structures implementd

Model hyper-parameter adjustment

Performance of various DL structures on benchmark ADM data sets

Relative size of our pre-trained networks

Support

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes