Library of modular domain generalization for deep learning

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

DomainLab: modular python package for training domain invariant neural networks

GH Actions CI

Distribution shifts, domain generalization and DomainLab

Neural networks trained using data from a specific distribution (domain) usually fails to generalize to novel distributions (domains). Domain generalization aims at learning domain invariant features by utilizing data from multiple domains (data sites, corhorts, batches, vendors) so the learned feature can generalize to new unseen domains (distributions).

DomainLab is a software platform with state-of-the-art domain generalization algorithms implemented, designed by maximal decoupling of different software components thus enhances maximal code reuse.

DomainLab

DomainLab decouples the following concepts or objects:

task $M$: In DomainLab, a task is a container for datasets from different domains. (e.g. from distribution $D_1$ and $D_2$). Task offer a static protocol to evaluate the generalization performance of a neural network: which dataset(s) is used for training, wich dataset(s) used for testing.
neural network: a map $\phi$ from the input data to the feature space and a map $\varphi$ from feature space to output $\hat{y}$ (e.g. decision variable).
model: structural risk in the form of $\ell() + \mu R()$ where
- $\ell(Y, \hat{y}=\varphi(\phi(X)))$ is the task specific empirical loss (e.g. cross entropy for classification task).
- $R(\phi(X))$ is the penalty loss to boost domain invariant feature extraction using $\phi$.
- $\mu$ is the corresponding multiplier to each penalty function factor.
trainer: an object that guides the data flow to model and append further domain invariant losses like inter-domain feature alignment.

We offer detailed documentation on how these models and trainers work in our documentation page: https://marrlab.github.io/DomainLab/

DomainLab makes it possible to combine models with models, trainers with models, and trainers with trainers in a decorator pattern like line of code Trainer A(Trainer B(Model C(Model D(network E), network E, network F))) which correspond to $\ell() + \mu_a R_a() + \mu_b R_b + \mu_c R_c() + \mu_d R_d()$, where Model C and Model D share neural network E, but Model C has an extra neural network F. All models share the same neural network for feature extraction, but can have different auxilliary networks for $R()$.

Getting started

Installation

For development version in Github, see Installation and Dependencies handling

We also offer a PyPI version here https://pypi.org/project/domainlab/ which one could install via pip install domainlab and it is recommended to create a virtual environment for it.

Task specification

We offer various ways for the user to specify a scenario to evaluate the generalization performance via training on a limited number of datasets. See detail in Task Specification

Example and usage

Either clone this repo and use command line

python main_out.py -c ./examples/conf/vlcs_diva_mldg_dial.yaml where the configuration file below can be downloaded here

te_d: caltech                       # domain name of test domain
tpath: examples/tasks/task_vlcs.py  # python file path to specify the task
bs: 2                               # batch size
model: dann_diva                    # combine model DANN with DIVA
epos: 1                             # number of epochs
trainer: mldg_dial                  # combine trainer MLDG and DIAL
gamma_y: 700000.0                   # hyperparameter of diva
gamma_d: 100000.0                   # hyperparameter of diva
npath: examples/nets/resnet.py      # neural network for class classification
npath_dom: examples/nets/resnet.py  # neural network for domain classification

See details in Command line usage

or Programm against DomainLab API

See example here: Transformer as feature extractor, decorate JIGEN with DANN, training using MLDG decorated by DIAL

Benchmark different methods

DomainLab provides a powerful benchmark functionality. To benchmark several algorithms(combination of neural networks, models, trainers and associated hyperparameters), a single line command along with a benchmark configuration files is sufficient. See details in benchmarks documentation and tutorial

One could simply run bash run_benchmark_slurm.sh your_benchmark_configuration.yaml to launch different experiments with specified configuraiton.

For example, the following result (without any augmentation like flip) is for PACS dataset using ResNet.

Benchmark results plot generated from DomainLab, where each rectangle represent one model trainer combination, each bar inside the rectangle represent a unique hyperparameter index associated with that method combination, each dot represent a random seeds.

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.6.4

May 12, 2024

0.6.3

May 8, 2024

0.6.2

Mar 28, 2024

0.6.1

Feb 16, 2024

This version

0.6.0

Jan 25, 2024

0.5.0

Jan 20, 2024

0.4.3

Jan 11, 2024

0.4.2

Jan 10, 2024

0.4.1

Jan 10, 2024

0.4.0

Jan 10, 2024

0.3.3

Jan 7, 2024

0.3.2

Jan 7, 2024

0.3.1

Jan 6, 2024

0.3.0

Jan 6, 2024

0.2.9

Dec 30, 2023

0.2.8

Dec 21, 2023

0.2.7

Dec 21, 2023

0.2.6

Dec 18, 2023

0.2.5

Dec 18, 2023

0.2.4

Dec 17, 2023

0.2.3

Dec 16, 2023

0.2.2

Dec 16, 2023

0.2.1

Dec 15, 2023

0.2.0

Dec 15, 2023

0.1.9

Dec 13, 2023

0.1.8

Dec 11, 2023

0.1.7

Dec 11, 2023

0.1.6

Dec 11, 2023

0.1.5

Dec 11, 2023

0.1.4

Aug 11, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

domainlab-0.6.0.tar.gz (3.0 MB view hashes)

Uploaded Jan 25, 2024 Source

Built Distribution

domainlab-0.6.0-py3-none-any.whl (3.2 MB view hashes)

Uploaded Jan 25, 2024 Python 3

Hashes for domainlab-0.6.0.tar.gz

Hashes for domainlab-0.6.0.tar.gz
Algorithm	Hash digest
SHA256	`cdaaac5cc34ee5ac982687c58f4e92122a97652bf60e3b53942a2e7991de6ae8`
MD5	`ec60366f456e0e8927f6fab2ddb13353`
BLAKE2b-256	`1906db4dac56c86289636d6e980d70b8ebf5a786e771b9723f28dfb87c2abc5d`

Hashes for domainlab-0.6.0-py3-none-any.whl

Hashes for domainlab-0.6.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`18b09845cbb302e1ea2aaec53131027ca65e90ffe836e615a4ee3beba1070758`
MD5	`fb8dceee2fb58677f4b25e167ff4b009`
BLAKE2b-256	`c41f54aafb4584bee90e550bf066cd44ba92f94b427f9a77c882461bdf3c9b1c`