Skip to main content

Deep learning framework for reconstructing gravitational wave signals and glitches from LIGO detector data

Project description

DeepExtractor

PyPI Documentation License

Deep learning framework for reconstructing gravitational-wave signals and glitches from LIGO detector data.

Built for LIGO's O3 observing run (Hanford and Livingston detectors). Described in the paper:

Time-domain reconstruction of signals and glitches in gravitational wave data with deep learning Dooney, Narola, Bromuri, Curier, Van Den Broeck, Caudill, Tan — Phys. Rev. D 112, 044022 (2025) 10.1103/s91m-c2jw

Documentation | GitHub | PyPI


How it works

LIGO strain data contains both astrophysical signals and instrumental glitches — short-duration noise transients that can mimic or obscure gravitational-wave events. DeepExtractor frames glitch reconstruction as a supervised denoising problem:

Input:  h(t)  = background noise  +  glitch/signal
Output: n̂(t)  = predicted background
Result: ĝ(t)  = h(t) − n̂(t)   ← reconstructed glitch or signal

The model is a U-Net operating on STFT spectrograms (magnitude + phase). The default model, DeepExtractor_257, uses a 4-level U-Net with feature maps [64, 128, 256, 512].


Installation

pip install deepextractor

Requires Python ≥ 3.10 and PyTorch ≥ 2.1. Pretrained weights are downloaded automatically from Hugging Face Hub on first use — no manual step required.

Install from source:

git clone https://github.com/tomdooney95/deepextractor.git
cd deepextractor
pip install -e ".[dev]"

Quickstart

import numpy as np
import deepextractor

# Load model (bilby noise variant by default)
model = deepextractor.DeepExtractorModel()

# Reconstruct — extract the transient from noisy strain
noisy_strain = np.random.randn(8192)             # replace with real data
reconstructed = model.reconstruct(noisy_strain)  # extracted signal
background    = model.background(noisy_strain)   # noise estimate

# One-liner convenience function
reconstructed = deepextractor.reconstruct(noisy_strain)

Two pretrained variants are available:

Variant Use case
bilby_noise (default) Simulated LIGO/Virgo noise, injection studies
real_noise Real LIGO O3 detector data

Bundled dataset

The package ships a sample of the GravitySpy LIGO O3a high-confidence catalogue at assets/data_o3a_sample.csv — 10 H1 examples per glitch class (17 classes, 170 rows total), SNR > 15.

import pandas as pd
import importlib.resources as resources

with resources.path("deepextractor", "assets") as assets:
    df = pd.read_csv(assets / "data_o3a_sample.csv")

print(df["label"].value_counts())

CLI tools

# Train a model
deepextractor-train --model DeepExtractor_257 --data-dir data/spectrogram_domain/

# Generate training data
deepextractor-generate --output-dir data/ --num-train 250000

# Convert time-domain data to spectrograms
deepextractor-specgen --input-dir data/time_domain/ --output-dir data/spectrogram_domain/

# Evaluate a trained model
deepextractor-evaluate --model DeepExtractor_257 --checkpoint-dir checkpoints/ --data-dir data/

Citation

@article{s91m-c2jw,
  title     = {Time-domain reconstruction of signals and glitches in gravitational wave data with deep learning},
  author    = {Dooney, Tom and Narola, Harsh and Bromuri, Stefano and Curier, R. Lyana and Van Den Broeck, Chris and Caudill, Sarah and Tan, Daniel Stanley},
  journal   = {Phys. Rev. D},
  volume    = {112},
  issue     = {4},
  pages     = {044022},
  numpages  = {24},
  year      = {2025},
  month     = {Aug},
  publisher = {American Physical Society},
  doi       = {10.1103/s91m-c2jw},
  url       = {https://link.aps.org/doi/10.1103/s91m-c2jw}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

deepextractor-0.1.1.tar.gz (44.0 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

deepextractor-0.1.1-py3-none-any.whl (43.1 MB view details)

Uploaded Python 3

File details

Details for the file deepextractor-0.1.1.tar.gz.

File metadata

  • Download URL: deepextractor-0.1.1.tar.gz
  • Upload date:
  • Size: 44.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for deepextractor-0.1.1.tar.gz
Algorithm Hash digest
SHA256 82568bc93538cb4c6dbad0e0d2e48a3fbc43c4b06a71bba078ffcb28512e1714
MD5 18acdb23a57ddc9e3964387fcb223383
BLAKE2b-256 ce444c6c22b6349f44de2f311cb7d6a0e7e7b75b1955a8ddb36a87ba76db7421

See more details on using hashes here.

Provenance

The following attestation bundles were made for deepextractor-0.1.1.tar.gz:

Publisher: publish.yml on tomdooney95/deepextractor

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file deepextractor-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: deepextractor-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 43.1 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for deepextractor-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 70f343e7dc6e8522e3c3e6f26cfad286aed8bebf418829a334d2758f06bbab12
MD5 b0404fe0236a208ea2335baee3a0b004
BLAKE2b-256 0927728a5a55ebf4110e15715aefd7a82d71c2e1bd89cce4ee26a99a728e74b5

See more details on using hashes here.

Provenance

The following attestation bundles were made for deepextractor-0.1.1-py3-none-any.whl:

Publisher: publish.yml on tomdooney95/deepextractor

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page