Skip to main content

A PyTorch-based collection of Unsupervised Domain Adaptation methods applied to strong gravitational lenses

Project description

DeepLense Domain Adaptation

A PyTorch-based collection of Unsupervised Domain Adaptation methods applied to strong gravitational lenses.

This project was created for Google Summer of Code 2021 under the Machine Learning for Science (ML4Sci) umbrella organization.

Paper

A paper describing our methodology was published: Domain Adaptation for Simulation-Based Dark Matter Searches Using Strong Gravitational Lensing.

Description

A promising means to identify the nature of dark matter is to study it through dark matter halos, and strong gravitational lenses have seen encouraging results in detecting the existence of dark matter substructure. Unfortunately, there isn't a lot of data of strong gravitational lenses available, which means that, if we want to train a machine learning model to identify the different kinds of dark matter substructure, we'd need to use simulations. The problem though, is that a model trained on simulated data does not generalize well to real data, having a very bad performance. This project aims to fix this problem by using Unsupervised Domain Adaptation (UDA) techniques to adapt a model trained on simulated data to real data!

Blog post

For more about the motivation behind the project and also my Google Summer of Code experience check out this blog post.

Installation

The implementations can be acessed through the deeplense_domain_adaptation package. To install it simply do:

pip install --upgrade deeplense_domain_adaptation

Data

The data loading pipeline implemented here expects the image data to be in the form of a four dimensional numpy array of shape: (number_of_images, 1, height, width). Label data is expected to have a shape: (number_of_images, 1).

The paper's Model A is our source dataset (less complex simulations) and the paper's Model B is our target dataset (more complex simulations). We have three distinct classes: no dark matter substructure, spherical dark matter substructure and vortex dar matter substructure. On our training sets we have 30'000 images for the source and 30'000 images for the target; in both cases there are 10'000 images per class. On our test sets we have 7'500 images for the source and 7'500 images for the target; in both cases there are 2'500 images per class.

How to use deeplense_domain_adaptation

For a tutorial on how to use the deeplense_domain_adaptation package check out tutorial.ipynb. If the file isn't loading properly on GitHub you can also check the Jupyter Notebook on nbviewer here. For more information on specific functions/classes check out the documentation available on the functions/classes definitions.

Before and after UDA

Equivariant Network model

  • Supervised training on source inferring on source: accuracy = 97.09; AUROC = 0.996.

  • Supervised training on source inferring on target: accuracy = 67.53; AUROC = 0.856.

  • Applying UDA and inferring on target:

Algorithm ADDA AdaMatch Self-Ensemble
Accuracy 91.47 85.81 80.09
AUROC 0.980 0.960 0.939

ResNet-18

  • Supervised training on source inferring on source: accuracy = 96.84; AUROC = 0.996.

  • Supervised training on source inferring on target: accuracy = 59.19; AUROC = 0.880.

  • Applying UDA and inferring on target:

Algorithm ADDA AdaMatch Self-Ensemble
Accuracy 85.84 75.55 76.71
AUROC 0.955 0.919 0.917

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

deeplense_domain_adaptation-1.0.0.tar.gz (20.2 kB view details)

Uploaded Source

Built Distribution

File details

Details for the file deeplense_domain_adaptation-1.0.0.tar.gz.

File metadata

  • Download URL: deeplense_domain_adaptation-1.0.0.tar.gz
  • Upload date:
  • Size: 20.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.6.4 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.1 CPython/3.9.5

File hashes

Hashes for deeplense_domain_adaptation-1.0.0.tar.gz
Algorithm Hash digest
SHA256 e50a6467f947ad7b02cb29264895ddfebf8762bcb47319e928084e3580a8f29a
MD5 c57824114c7f2e0e6ee344b86bc4f0fe
BLAKE2b-256 df96ee0ba3cd88afed446898a464874348d999939cc8e42fbdb3f8900cae6642

See more details on using hashes here.

File details

Details for the file deeplense_domain_adaptation-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: deeplense_domain_adaptation-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 28.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.6.4 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.1 CPython/3.9.5

File hashes

Hashes for deeplense_domain_adaptation-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 59d8ee4d32bada4f634ce668892d9a7b9c165bc4e82f22728379c5a0bb4b0c89
MD5 4c70a599f8ee7d98a219fef6cd7f0320
BLAKE2b-256 0a7dc1ab788a9aed31e57a2dded2b29e1c348ffd4150dec5a194324565155f78

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page