Skip to main content

Deep learning framework for reconstructing gravitational wave signals and glitches from LIGO detector data

Project description

DeepExtractor

PyPI Documentation License

Deep learning framework for reconstructing gravitational-wave signals and glitches from LIGO detector data.

Built for LIGO's O3 observing run (Hanford and Livingston detectors). Described in the paper:

Time-domain reconstruction of signals and glitches in gravitational wave data with deep learning Dooney, Narola, Bromuri, Curier, Van Den Broeck, Caudill, Tan — Phys. Rev. D 112, 044022 (2025) 10.1103/s91m-c2jw

Documentation | GitHub | PyPI


How it works

LIGO strain data contains both astrophysical signals and instrumental glitches — short-duration noise transients that can mimic or obscure gravitational-wave events. DeepExtractor frames glitch reconstruction as a supervised denoising problem:

Input:  h(t)  = background noise  +  glitch/signal
Output: n̂(t)  = predicted background
Result: ĝ(t)  = h(t) − n̂(t)   ← reconstructed glitch or signal

The model is a U-Net operating on STFT spectrograms (magnitude + phase). The default model, DeepExtractor_257, uses a 4-level U-Net with feature maps [64, 128, 256, 512].


Installation

pip install deepextractor

Requires Python ≥ 3.10 and PyTorch ≥ 2.1. Pretrained weights are downloaded automatically from Hugging Face Hub on first use — no manual step required.

Install from source:

git clone https://github.com/tomdooney95/deepextractor.git
cd deepextractor
pip install -e ".[dev]"

Quickstart

import numpy as np
import deepextractor

# Load model (bilby noise variant by default)
model = deepextractor.DeepExtractorModel()

# Reconstruct — extract the transient from noisy strain
noisy_strain = np.random.randn(8192)             # replace with real data
reconstructed = model.reconstruct(noisy_strain)  # extracted signal
background    = model.background(noisy_strain)   # noise estimate

# One-liner convenience function
reconstructed = deepextractor.reconstruct(noisy_strain)

Two pretrained variants are available:

Variant Use case
bilby_noise (default) Simulated LIGO/Virgo noise, injection studies
real_noise Real LIGO O3 detector data

Bundled dataset

The package ships a sample of the GravitySpy LIGO O3a high-confidence catalogue at assets/data_o3a_sample.csv — 10 H1 examples per glitch class (17 classes, 170 rows total), SNR > 15.

import pandas as pd
import importlib.resources as resources

with resources.path("deepextractor", "assets") as assets:
    df = pd.read_csv(assets / "data_o3a_sample.csv")

print(df["label"].value_counts())

CLI tools

# Train a model
deepextractor-train --model DeepExtractor_257 --data-dir data/spectrogram_domain/

# Generate training data
deepextractor-generate --output-dir data/ --num-train 250000

# Convert time-domain data to spectrograms
deepextractor-specgen --input-dir data/time_domain/ --output-dir data/spectrogram_domain/

# Evaluate a trained model
deepextractor-evaluate --model DeepExtractor_257 --checkpoint-dir checkpoints/ --data-dir data/

Citation

@article{s91m-c2jw,
  title     = {Time-domain reconstruction of signals and glitches in gravitational wave data with deep learning},
  author    = {Dooney, Tom and Narola, Harsh and Bromuri, Stefano and Curier, R. Lyana and Van Den Broeck, Chris and Caudill, Sarah and Tan, Daniel Stanley},
  journal   = {Phys. Rev. D},
  volume    = {112},
  issue     = {4},
  pages     = {044022},
  numpages  = {24},
  year      = {2025},
  month     = {Aug},
  publisher = {American Physical Society},
  doi       = {10.1103/s91m-c2jw},
  url       = {https://link.aps.org/doi/10.1103/s91m-c2jw}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

deepextractor-0.1.2.tar.gz (44.0 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

deepextractor-0.1.2-py3-none-any.whl (43.1 MB view details)

Uploaded Python 3

File details

Details for the file deepextractor-0.1.2.tar.gz.

File metadata

  • Download URL: deepextractor-0.1.2.tar.gz
  • Upload date:
  • Size: 44.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for deepextractor-0.1.2.tar.gz
Algorithm Hash digest
SHA256 5b37fb63fbd53cc400717e4efab8284f2cce362f01de3d037d4c44c198009b3a
MD5 21b20ffae5dccc9516b0523e1caf087a
BLAKE2b-256 f7f13631ba5d7f786e57fc821451ab0eb62c93bf274a9386fe4f9240f49864f5

See more details on using hashes here.

Provenance

The following attestation bundles were made for deepextractor-0.1.2.tar.gz:

Publisher: publish.yml on tomdooney95/deepextractor

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file deepextractor-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: deepextractor-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 43.1 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for deepextractor-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 6f916b4b2a65f87dc7c9308662e311edaf5238c7cd26aeb54e68229a86554488
MD5 5300d6a5895e10d0d280103e5658cc79
BLAKE2b-256 72ed5f8209dccbafed3828b99e43ed0bdfe8d94296a74a615b0efc6d515748bc

See more details on using hashes here.

Provenance

The following attestation bundles were made for deepextractor-0.1.2-py3-none-any.whl:

Publisher: publish.yml on tomdooney95/deepextractor

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page