Skip to main content

Onsite analysis pipeline for the CTA LST-1

Project description

lstosa

ci Documentation Status coverage Codacy Badge pypi DOI

Onsite processing pipeline for the Large-Sized Telescope prototype (LST-1) of CTAO (Cherenkov Telescope Array Observatory) based on cta-lstchain running on the LST-1 IT onsite data center at Observatorio Roque de los Muchachos (La Palma, Spain). It automatically carries out the next-day analysis of observed data using cron jobs, parallelizing the processing using the job scheduler SLURM. It provides data quality monitoring and tracking of analysis products' provenance. Moreover, it also massively reprocesses the entire LST-1 dataset with each cta-lstchain major release.

Install

We recommend using an isolated conda environment.

  • Install mamba/miniconda first.

  • Clone the repository, create and activate the conda environment using the environment.yml file:

    git clone https://github.com/cta-observatory/lstosa.git
    cd lstosa
    conda env create -n osa -f environment.yml
    conda activate osa
    

Then install lstosa as a user with: pip install lstosa, or as a developer with: pip install -e .. To install test, docs dependencies use pip install -e .[test], pip install -e .[doc] or simply pip install -e .[all]

In case you want to install the lstchain development version instead of a fixed tag, you can run inside the osa environment:

pip install git+https://github.com/cta-observatory/cta-lstchain

To update the environment (provided dependencies get updated), use:

conda env update -n osa -f environment.yml

Note to developers: to enforce a unique code convention, please install pre-commit (pre-commit install) after cloning the repository and creating the conda environment. This will black the committed files automatically.

Workflow management

lstosa workflow is handled daily by the sequencer script, which identifies which observations are to be processed, generates the analysis workflow, and submits the jobs. A first calibration job produces the daily calibration coefficients. Subsequently, data reconstruction jobs are scheduled on a subrun-wise basis (1 job corresponds to around 10 seconds of observed data, and its processing up to DL2 takes about 30-40 mins).

flowchart LR

    daq --> osa_seq
    osa_seq --> slurm --> osa_closer
    daq[DAQ]


    subgraph osa_seq [sequencer]
        direction TB
    A(Daily observation summary)
    B(Generate workflow)
    C(Submit jobs)
    A --> B --> C
    end

    subgraph slurm [SLURM parallel processing]
        direction TB
    H(Calibration sequence)
    I(Reconstruction sequences)
    H --> I
    end

    subgraph osa_closer [autocloser]
        direction TB
    D(Check job completion)
    E(Move files to final directories)
    F(Merge files)
    G(Parse provenance logs)
    D --> E --> F --> G
    end

Usage

To use lstosa, you will first need to symlink some auxiliary files in a similar directory tree structure to the standard data production and set the paths correctly in your lstosa configuration file. Then to process all the runs from a given date, you can run the following command (use first the --simulate option to dry-run without actually submitting jobs):

sequencer --config your_osa_config.cfg --date YYYY-MM-DD LST1

Once all jobs finish, the autocloser script checks job completion, merges files, moves them to their final directories, and parses provenance logs.

autocloser --config your_osa_config.cfg --date YYYY-MM-DD LST1

Dataflow

graph LR

    subgraph DAQ
    D1[R0]
    D2[DRS4 calib run]
    D3[Pedestal calib run]
    D4[Pointing log]
    end

    D2 --> C1
    D3 --> C2

    subgraph Calibration
    C1[DRS4 baseline correction]
    C2[Calibration charge coeffitiens]
    C1 --> C2
    end

    subgraph lstMCpipe
    M1[gamma DL2 MC]
    M2[RF models]
    end

    subgraph Sky-data reconstruction
    S1[DL1a]
    S2[DL1b]
    S3[muons]
    S4[DL1 check]
    S5[DL2]
    S6[DL3]
    S7[IRF]
    D1 & D4 & C1 & C2 --> S1
    S1 --> S3
    S1 --> S2
    S2 & S3 --> S4
    S2 ---> S5
    S5 --> S6
    S7 --> S6
    M2 --> S5
    M1 --> S7
    end

    subgraph High-level Gammapy
    DL4
    DL5
    S6 --> DL4
    DL4 --> DL5 --> ...
    end

Warning: standard production of DL3 data and higher-level results is still under development.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lstosa-0.10.15.tar.gz (1.1 MB view details)

Uploaded Source

Built Distribution

lstosa-0.10.15-py3-none-any.whl (126.2 kB view details)

Uploaded Python 3

File details

Details for the file lstosa-0.10.15.tar.gz.

File metadata

  • Download URL: lstosa-0.10.15.tar.gz
  • Upload date:
  • Size: 1.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for lstosa-0.10.15.tar.gz
Algorithm Hash digest
SHA256 a80c572adec8f4a714d05c779d357cdc0fa4a7ebf641ff160e771dabcf1dd0e5
MD5 f4f638fc9a172c2d9b4dfb3d878c81a3
BLAKE2b-256 54ef6908136ef7fb2f8ffa9c72aae7d3b7f99c45c0ebd616d35da26d6bd3587b

See more details on using hashes here.

File details

Details for the file lstosa-0.10.15-py3-none-any.whl.

File metadata

  • Download URL: lstosa-0.10.15-py3-none-any.whl
  • Upload date:
  • Size: 126.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for lstosa-0.10.15-py3-none-any.whl
Algorithm Hash digest
SHA256 e492e099ae74b8f8809d2ad49d93472f7a4c18deaf1dd3f8752d8880f082bf49
MD5 ba2c13fc3981ece276a64bac4b7c0460
BLAKE2b-256 56113638883c411034477abc53fd4b3307c8d47ff16cfb22d956887321944bfe

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page