Deep learning framework for reconstructing gravitational wave signals and glitches from LIGO detector data
Project description
DeepExtractor
Deep learning framework for reconstructing gravitational-wave signals and glitches from LIGO detector data.
Built for LIGO's O3 observing run (Hanford and Livingston detectors). Described in the paper:
Time-domain reconstruction of signals and glitches in gravitational wave data with deep learning Dooney, Narola, Bromuri, Curier, Van Den Broeck, Caudill, Tan — Phys. Rev. D 112, 044022 (2025) 10.1103/s91m-c2jw
Documentation | GitHub | PyPI
How it works
LIGO strain data contains both astrophysical signals and instrumental glitches — short-duration noise transients that can mimic or obscure gravitational-wave events. DeepExtractor frames glitch reconstruction as a supervised denoising problem:
Input: h(t) = background noise + glitch/signal
Output: n̂(t) = predicted background
Result: ĝ(t) = h(t) − n̂(t) ← reconstructed glitch or signal
The model is a U-Net operating on STFT spectrograms (magnitude + phase). The default model, DeepExtractor_257, uses a 4-level U-Net with feature maps [64, 128, 256, 512].
Installation
pip install deepextractor
Requires Python ≥ 3.10 and PyTorch ≥ 2.1. Pretrained weights are downloaded automatically from Hugging Face Hub on first use — no manual step required.
Install from source:
git clone https://github.com/tomdooney95/deepextractor.git
cd deepextractor
pip install -e ".[dev]"
Quickstart
import numpy as np
import deepextractor
# Load model (bilby noise variant by default)
model = deepextractor.DeepExtractorModel()
# Reconstruct — extract the transient from noisy strain
noisy_strain = np.random.randn(8192) # replace with real data
reconstructed = model.reconstruct(noisy_strain) # extracted signal
background = model.background(noisy_strain) # noise estimate
# One-liner convenience function
reconstructed = deepextractor.reconstruct(noisy_strain)
Two pretrained variants are available:
| Variant | Use case |
|---|---|
bilby_noise (default) |
Simulated LIGO/Virgo noise, injection studies |
real_noise |
Real LIGO O3 detector data |
Bundled dataset
The package ships a sample of the GravitySpy LIGO O3a high-confidence catalogue at assets/data_o3a_sample.csv — 10 H1 examples per glitch class (17 classes, 170 rows total), SNR > 15.
import pandas as pd
import importlib.resources as resources
with resources.path("deepextractor", "assets") as assets:
df = pd.read_csv(assets / "data_o3a_sample.csv")
print(df["label"].value_counts())
CLI tools
# Train a model
deepextractor-train --model DeepExtractor_257 --data-dir data/spectrogram_domain/
# Generate training data
deepextractor-generate --output-dir data/ --num-train 250000
# Convert time-domain data to spectrograms
deepextractor-specgen --input-dir data/time_domain/ --output-dir data/spectrogram_domain/
# Evaluate a trained model
deepextractor-evaluate --model DeepExtractor_257 --checkpoint-dir checkpoints/ --data-dir data/
Citation
@article{s91m-c2jw,
title = {Time-domain reconstruction of signals and glitches in gravitational wave data with deep learning},
author = {Dooney, Tom and Narola, Harsh and Bromuri, Stefano and Curier, R. Lyana and Van Den Broeck, Chris and Caudill, Sarah and Tan, Daniel Stanley},
journal = {Phys. Rev. D},
volume = {112},
issue = {4},
pages = {044022},
numpages = {24},
year = {2025},
month = {Aug},
publisher = {American Physical Society},
doi = {10.1103/s91m-c2jw},
url = {https://link.aps.org/doi/10.1103/s91m-c2jw}
}
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file deepextractor-0.1.2.tar.gz.
File metadata
- Download URL: deepextractor-0.1.2.tar.gz
- Upload date:
- Size: 44.0 MB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5b37fb63fbd53cc400717e4efab8284f2cce362f01de3d037d4c44c198009b3a
|
|
| MD5 |
21b20ffae5dccc9516b0523e1caf087a
|
|
| BLAKE2b-256 |
f7f13631ba5d7f786e57fc821451ab0eb62c93bf274a9386fe4f9240f49864f5
|
Provenance
The following attestation bundles were made for deepextractor-0.1.2.tar.gz:
Publisher:
publish.yml on tomdooney95/deepextractor
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
deepextractor-0.1.2.tar.gz -
Subject digest:
5b37fb63fbd53cc400717e4efab8284f2cce362f01de3d037d4c44c198009b3a - Sigstore transparency entry: 1206178616
- Sigstore integration time:
-
Permalink:
tomdooney95/deepextractor@13ad4526a56df5f878adda3be247884291e01e26 -
Branch / Tag:
refs/tags/v0.1.2 - Owner: https://github.com/tomdooney95
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@13ad4526a56df5f878adda3be247884291e01e26 -
Trigger Event:
push
-
Statement type:
File details
Details for the file deepextractor-0.1.2-py3-none-any.whl.
File metadata
- Download URL: deepextractor-0.1.2-py3-none-any.whl
- Upload date:
- Size: 43.1 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6f916b4b2a65f87dc7c9308662e311edaf5238c7cd26aeb54e68229a86554488
|
|
| MD5 |
5300d6a5895e10d0d280103e5658cc79
|
|
| BLAKE2b-256 |
72ed5f8209dccbafed3828b99e43ed0bdfe8d94296a74a615b0efc6d515748bc
|
Provenance
The following attestation bundles were made for deepextractor-0.1.2-py3-none-any.whl:
Publisher:
publish.yml on tomdooney95/deepextractor
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
deepextractor-0.1.2-py3-none-any.whl -
Subject digest:
6f916b4b2a65f87dc7c9308662e311edaf5238c7cd26aeb54e68229a86554488 - Sigstore transparency entry: 1206178623
- Sigstore integration time:
-
Permalink:
tomdooney95/deepextractor@13ad4526a56df5f878adda3be247884291e01e26 -
Branch / Tag:
refs/tags/v0.1.2 - Owner: https://github.com/tomdooney95
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@13ad4526a56df5f878adda3be247884291e01e26 -
Trigger Event:
push
-
Statement type: