Skip to main content

Deep learning framework for reconstructing gravitational wave signals and glitches from LIGO detector data

Project description

DeepExtractor

PyPI Documentation License

Deep learning framework for reconstructing gravitational-wave signals and glitches from LIGO detector data.

Built for LIGO's O3 observing run (Hanford and Livingston detectors). Described in the paper:

Time-domain reconstruction of signals and glitches in gravitational wave data with deep learning Dooney, Narola, Bromuri, Curier, Van Den Broeck, Caudill, Tan — Phys. Rev. D 112, 044022 (2025) 10.1103/s91m-c2jw

Documentation | GitHub | PyPI


How it works

LIGO strain data contains both astrophysical signals and instrumental glitches — short-duration noise transients that can mimic or obscure gravitational-wave events. DeepExtractor frames glitch reconstruction as a supervised denoising problem:

Input:  h(t)  = background noise  +  glitch/signal
Output: n̂(t)  = predicted background
Result: ĝ(t)  = h(t) − n̂(t)   ← reconstructed glitch or signal

The model is a U-Net operating on STFT spectrograms (magnitude + phase). The default model, DeepExtractor_257, uses a 4-level U-Net with feature maps [64, 128, 256, 512].


Installation

pip install deepextractor

Requires Python ≥ 3.10 and PyTorch ≥ 2.1. Pretrained weights are downloaded automatically from Hugging Face Hub on first use — no manual step required.

Install from source:

git clone https://github.com/tomdooney95/deepextractor.git
cd deepextractor
pip install -e ".[dev]"

Quickstart

import numpy as np
import deepextractor

# Load model (bilby noise variant by default)
model = deepextractor.DeepExtractorModel()

# Reconstruct — extract the transient from noisy strain
noisy_strain = np.random.randn(8192)             # replace with real data
reconstructed = model.reconstruct(noisy_strain)  # extracted signal
background    = model.background(noisy_strain)   # noise estimate

# One-liner convenience function
reconstructed = deepextractor.reconstruct(noisy_strain)

Two pretrained variants are available:

Variant Use case
bilby_noise (default) Simulated LIGO/Virgo noise, injection studies
real_noise Real LIGO O3 detector data

Bundled dataset

The package ships a sample of the GravitySpy LIGO O3a high-confidence catalogue at assets/data_o3a_sample.csv — 10 H1 examples per glitch class (17 classes, 170 rows total), SNR > 15.

import pandas as pd
import importlib.resources as resources

with resources.path("deepextractor", "assets") as assets:
    df = pd.read_csv(assets / "data_o3a_sample.csv")

print(df["label"].value_counts())

CLI tools

# Train a model
deepextractor-train --model DeepExtractor_257 --data-dir data/spectrogram_domain/

# Generate training data
deepextractor-generate --output-dir data/ --num-train 250000

# Convert time-domain data to spectrograms
deepextractor-specgen --input-dir data/time_domain/ --output-dir data/spectrogram_domain/

# Evaluate a trained model
deepextractor-evaluate --model DeepExtractor_257 --checkpoint-dir checkpoints/ --data-dir data/

Citation

@article{s91m-c2jw,
  title     = {Time-domain reconstruction of signals and glitches in gravitational wave data with deep learning},
  author    = {Dooney, Tom and Narola, Harsh and Bromuri, Stefano and Curier, R. Lyana and Van Den Broeck, Chris and Caudill, Sarah and Tan, Daniel Stanley},
  journal   = {Phys. Rev. D},
  volume    = {112},
  issue     = {4},
  pages     = {044022},
  numpages  = {24},
  year      = {2025},
  month     = {Aug},
  publisher = {American Physical Society},
  doi       = {10.1103/s91m-c2jw},
  url       = {https://link.aps.org/doi/10.1103/s91m-c2jw}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

deepextractor-0.1.4.tar.gz (44.0 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

deepextractor-0.1.4-py3-none-any.whl (43.1 MB view details)

Uploaded Python 3

File details

Details for the file deepextractor-0.1.4.tar.gz.

File metadata

  • Download URL: deepextractor-0.1.4.tar.gz
  • Upload date:
  • Size: 44.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for deepextractor-0.1.4.tar.gz
Algorithm Hash digest
SHA256 55e057b30299ff21b44440d515c1a435048fa5b31c14c9a5ade6b69389d38875
MD5 82cae86598556f44697c28096cda24ad
BLAKE2b-256 9a4a65d3ab6c65b3dd805d5e9643858c6118d49472acebabeeca8d3ef41c453e

See more details on using hashes here.

Provenance

The following attestation bundles were made for deepextractor-0.1.4.tar.gz:

Publisher: publish.yml on tomdooney95/deepextractor

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file deepextractor-0.1.4-py3-none-any.whl.

File metadata

  • Download URL: deepextractor-0.1.4-py3-none-any.whl
  • Upload date:
  • Size: 43.1 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for deepextractor-0.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 c439869dddb7f17a92a3e14d8eb4ee7075afa3ff5c3dd2f344ea59b4f0a7ff58
MD5 03474ee575a667242d84d4902aef4ea2
BLAKE2b-256 70c9287a260b1b04a9e6ffee3822f3060a3e2b6fe8cbf29911fd3179164a1948

See more details on using hashes here.

Provenance

The following attestation bundles were made for deepextractor-0.1.4-py3-none-any.whl:

Publisher: publish.yml on tomdooney95/deepextractor

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page