MARS-S2L Plume detection, segmentation and quantification of methane plumes in Sentinel-2 and Landsat images
Project description
MARS-S2L
This package contains the operational UNEP International Methane Emissions Observatory (IMEO) pipeline for methane plume detection, segmentation, and quantification in Sentinel-2 and Landsat satellite imagery. The methods and models implemented in this package are described in:
Allen, Mateo-García, Irakulis-Loitxate et al., Artificial intelligence for methane detection: from continuous monitoring to verified mitigation 10.48550/arXiv.2511.21777
Licence
The MARS-S2L database and all trained models are released under a Creative Commons Non-Commercial Share-Alike licence. The code in this package is published under a GNU Lesser GPL v3 licence.
Install
Create the marss2lpy312 virtual environment with conda:
conda env create -f environment/conda_vm.yaml
Install marss2l package:
pip install marss2l
End-to-end inference 
This tutorial demonstrates an end-to-end example of the UNEP IMEO pipeline for detecting methane plumes in Sentinel‑2 and Landsat imagery. It downloads, compute the multi-band, multi-pass (MBMP, Irakullis‑Loitxate et al., 2022) retrieval, run inference with the MARS-S2L model (Allen et al., 2025), quantify the retrieval ΔXCH₄ and estimate the flux rate of detected plumes (Gorroño et al., 2023).
Important considerations:
- The MARS‑S2L model produce false positives.
- Therefore any positive model output must be validated before its use.
- At UNEP IMEO, all plumes are manually validated by at least two independent remote sensing analysts before their publication.
Dataset 🤗
The MARS-S2L database and all trained models are released under a Creative Commons Non-Commercial Share-Alike licence.
Currently the dataset is publicly available in this Hugging-Face🤗 repo: UNEP-IMEO/MARS-S2L.
To download the entire dataset and trained models (~100GB) run:
hf download --local-dir /path/to/localdir/MARS-S2L \
--repo-type dataset UNEP-IMEO/MARS-S2L
The following notebooks show how to load and plot images using the torch.Dataset directly from the Hugging Face repository: notebooks/examples/plot_images_dataset_train.ipynb and notebooks/examples/plot_plumes_dataset_test.ipynb.
Retrain the MARS-S2L model
We trained the model on a A100 GPU with 80Gb RAM. In order to re-train the proposed model and the baseline run:
wandb init # select the wandb project to display the training metrics
python -m marss2l.reproduce_training --csv_path /path/to/dataset/validated_images_all.csv\
--batch_size 128 --num_workers 16 --num_workers_val 8\
--batch_size_val 256 --n_samples_per_epoch_train 65536\
--basename_dir train_logs # Where to store the training outputs
This code will train 4 models the proposed MARS-S2L model and the CH4Net model both trained on the train split of the dataset. In addition it will evaluate these models in the test split of the dataset. After the training the content of the --basename_dir folder should be:
train_logs/
├── marss2l
│ ├── best_epoch
│ ├── config_experiment.json
│ └── preds_test_2023.csv
├── ch4net
│ ├── best_epoch
│ ├── config_experiment.json
│ └── preds_test_2023.csv
├── ch4net_sim
│ ├── best_epoch
│ ├── config_experiment.json
│ └── preds_test_2023.csv
└── marss2l_nosim
├── best_epoch
├── config_experiment.json
└── preds_test_2023.csv
Where:
best_epochare the weights of the trained modelconfig_experiment.jsonis the metadata with the configuration of the experimentpreds_test_2023.csvare the model predictions for all entries in the test set
Reproduce the figures
In the folder notebooks/figures there is the code to reproduce most of the figures of the paper. These notebooks read the data directly from the Hugging Face repository. See this README.
Load and eval trained models
The MARS-S2L model of the paper is stored in this Hugging-Face🤗 repository in the trained_models folder. The files are stored following the same naming convention as described above.
In order to re-run the evaluation in the test set with the trained model proposed see this README.
Dataset structure
The file validated_images_all.csv contains the metadata of all the items in the dataset. Each item comprises a Sentinel-2 or Landsat image of 200x200 pixels
together with its concatenated background image and the cloud mask. The methane enhancement image (ΔXCH₄) in ppb is also provided for positive samples. For positive samples, a vector multipolygon with the extent of the plume in WKT format is provided. In addition, each item
includes several metadata described in the next section. (Images metadata description).
The file validated_images_plumes.csv contains all validated plumes in the dataset. Notice that there are few images that contain more than one plume.
The content of this file is described in the Plumes metadata description section.
Images metadata description
Column description of file validated_images_all.csv:
| Name | Description | Example |
|---|---|---|
| s2path | Path to S2 or Landsat image concatenated with the background image. S2 L1C image with bands: B02, B03, B04, B08, B11, B12. Landsat TOA reflectance image with bands B02, B03, B04, B05, B06, B07 | data/train_2023/43/b82be820-fe9c-411a-af20-82cf162760a7_ea782bfb-21b6-46c5-a6ed-d3b8ef8815a0_s2.tif |
| cloudmaskpath | Path to cloud mask generated with CloudSEN12 with interpretation {0 : "clear", 1: "Thick cloud", 2: "Thin cloud", 3: "Cloud shadow"} |
data/train_2023/43/b82be820-fe9c-411a-af20-82cf162760a7_ea782bfb-21b6-46c5-a6ed-d3b8ef8815a0_cloudmask.tif |
| ch4path | Path to the ΔXCH₄ enhancement image in ppb. Only provided for positive samples (isplume True). |
data/train_2023/43/b82be820-fe9c-411a-af20-82cf162760a7_ea782bfb-21b6-46c5-a6ed-d3b8ef8815a0_ch4.tif |
| wind_u | U component of the wind at 10m | -1.87 |
| wind_v | V component of the wind at 10m | 3.76 |
| wind_source | Source where the wind data is downloaded from. | ECMWF/ERA5_LAND/HOURLY |
| vza | Viewing zenith angle. | 5.40 |
| sza | Solar zenith angle. | 16.32 |
| percentage_clear | Percentage of pixels clear according to CloudSEN12 mask. | 100.0 |
| tile | Name of the S2 or Landsat product | S2A_MSIL1C_20240613T100031_N0510_R122_T32RNS_20240613T134114 or LC08_L1TP_192040_20190601_20200828_02_T1 |
| isplume | Whether there is a methane plume on the image or not | True |
| ch4_fluxrate | Flux rate in kg/h if there is a plume in the tile. | 7756.50 |
| ch4_fluxrate_std | Standard deviation of the fluxrate. | 2941.19 |
| satellite | Name of the satellite | S2A, S2B, LC08 or LC09 |
| tile_date | Date of acquisition | 2024-06-13 10:00:31+00:00 |
| notified | If the image contains a plume, whether the plume was notified or not | False |
| id_location | UUID of the site | b82be820-fe9c-411a-af20-82cf162760a7 |
| last_update | Last time this registry was modified | 2025-05-20 10:00:31+00:00 |
| location_name | Code of the location. | A_10 |
| country | Country of the location | Algeria |
| lon | Longitude of the center of the location | 9.783133 |
| lat | Latitude of the center of the location | 28.087894 |
| offshore | Whether the location is offshore or onshore. | False |
| sector | Sector of the observation | Oil and Gas, Coal, Waste or Unknown |
| observability | Given observability to the image. Clear and cloudy is computed automatically with the cloudmask (50% threshold). Additionally the analyst can change this category (e.g. if the cloudmask is not correct or if there're artifacts in the scene such as snow, smoke, shadows...). | clear, cloudy, bad_retrieval, smoke, shadow, snow, out_of_swath |
| background_image_tile | Name of the S2 or Landsat product used as background. | S2A_MSIL1C_20240603T100031_N0510_R122_T32RNS_20240603T134002 |
| crs | Coordinate reference system of the image | EPSG:32632 |
| transform_a | Geotransform of the image | 10.0 |
| transform_b | Geotransform of the image | 0.0 |
| transform_c | Geotransform of the image | 575940.0 |
| transform_d | Geotransform of the image | 0.0 |
| transform_e | Geotransform of the image | -10.0 |
| transform_f | Geotransform of the image | 3108180.0 |
| width | Width of the image | 200 |
| height | Height of the image | 200 |
| window_row_off | Top left row of the location of the plume within the image (used for the simulation). | 0.0 |
| window_col_off | Top left column of the location of the plume within the image (used for the simulation). | 62.0 |
| window_height | Height in pixels of the plume within the image (used for the simulation). | 101.0 |
| window_width | Width in pixels of the plume within the image (used for the simulation). | 46.0 |
| id_loc_image | Id of the image | ea782bfb-21b6-46c5-a6ed-d3b8ef8815a0 |
| plume | Vector WKT representation of the plume. | MULTIPOLYGON (((9.781389 28.090856, 9.781646 28.090288, 9.781861 28.089796, 9.781818 28.089134, 9.781989 28.088622, 9.782268 28.088187, 9.782633 28.087903, 9.783105 28.087865, 9.783362 28.088244, 9.783083 28.088774, 9.78304 28.089436, 9.783148 28.090061, 9.783384 28.090705, 9.783663 28.091424, 9.783856 28.092522, 9.783642 28.093299, 9.783234 28.093905, 9.783106 28.094794, 9.783041 28.095571, 9.782312 28.096195, 9.78154 28.096422, 9.780896 28.096934, 9.779888 28.097104, 9.779352 28.096915, 9.779759 28.096063, 9.780596 28.09523, 9.781132 28.094472, 9.780896 28.093867, 9.780896 28.09309, 9.781154 28.092314, 9.781239 28.091784, 9.781154 28.091292, 9.781389 28.090856))) |
| footprint | Vector WKT representation of the footprint of the image | POLYGON ((9.772934575210368 28.09701036650939, 9.7728039936767 28.07877712456095, 9.793361656935707 28.07865982254285, 9.7934957112434 28.09689297500473, 9.772934575210368 28.09701036650939)) |
| split_name | Name of the split the item belongs to. One of 'train_2023', 'test_2023', 'val_2023' or 'Not Used' | 'train_2023' |
| case_study | Case study area of the same. | Turkmenistan |
Plumes metadata description
Column description of file validated_images_plumes.csv.
| Name | Description | Example |
|---|---|---|
| id_plume | UUID of the plume | 652fd72a-4a24-4938-8646-84309f899914 |
| id_loc_image | UUID of its corresponding image | 37ecaefe-619e-4dc7-90f1-d57bd48dbf0f |
| tile | Name of the S2 or Landsat product | S2A_MSIL1C_20240613T100031_N0510_R122_T32RNS_20240613T134114 or LC08_L1TP_192040_20190601_20200828_02_T1 |
| tile_date | Date of acquisition | 2024-06-13 10:00:31+00:00 |
| satellite | Name of the satellite | S2A, S2B, LC08 or LC09 |
| lat | Latitude of the source of the plume | 28.088329877970512 |
| lon | Longitude of the source of the plume | 9.78350881586494 |
| ch4_fluxrate | Estimated flux rate of the plume in kg/h | 9639.58562343052 |
| ch4_fluxrate_std | Estimated std of the flux rate of the plume in kg/h | 3964.424919968837 |
| geometry | Plume shape | MULTIPOLYGON (((9.782459 28.090423, 9.782716 28.089211, 9.782802 28.088378, 9.783746 28.088151, 9.78396 28.089249, 9.784217 28.090953, 9.784475 28.092316, 9.784432 28.093528, 9.783788 28.095156, 9.782802 28.095383, 9.78293 28.09402, 9.782716 28.093339, 9.782201 28.092203, 9.782416 28.09118, 9.782459 28.090423))) |
| last_update | Last time this registry was updated | 2025-05-19 05:30:16.151729+00:00 |
| window_row_off | Pixel upper left row coordinate within the image where the plume is located | 15 |
| window_col_off | Pixel upper left column coordinate within the image where the plume is located | 88 |
| window_height | Number of columns from the upper left coordinate | 82 |
| window_width | Number of rows from the upper left coordinate | 25 |
Licence
The MARS-S2L database, all trained models and tutorials are released under a Creative Commons Non-Commercial Share-Alike licence.
The code in this package is published under a GNU Lesser GPL v3 licence .
Cite
If you find this work useful, please cite:
@article{allen_2025,
title = {Artificial intelligence for methane detection: from continuous monitoring to verified mitigation},
author = {Allen, Anna and Mateo-Garcia, Gonzalo and Irakulis-Loitxate, Itziar and Martin, Manuel Montesino-San and Watine, Marc and Requeima, James and Gorroño, Javier and Randles, Cynthia and Mokalled, Tharwat and Guanter, Luis and Turner, Richard E. and Cifarelli, Claudio and Caltagirone, Manfredi},
url = {http://arxiv.org/abs/2511.21777},
doi = {10.48550/arXiv.2511.21777},
month = nov,
year = {2025}
}
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file marss2l-0.2.1.tar.gz.
File metadata
- Download URL: marss2l-0.2.1.tar.gz
- Upload date:
- Size: 157.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
136140c760585a2a6f505ce73e804121da903cb7cd5fe5f097802229070842a8
|
|
| MD5 |
a1485a508aad6b09f58e3280383df80e
|
|
| BLAKE2b-256 |
86ad990b53d3f25edd1410a0752314df3c4d38d01a989475573d65307370eec3
|
File details
Details for the file marss2l-0.2.1-py3-none-any.whl.
File metadata
- Download URL: marss2l-0.2.1-py3-none-any.whl
- Upload date:
- Size: 157.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a5ec0f9b080fc5812cac31aa7703491060ba93ca22d65f380c3f92aa79b44e49
|
|
| MD5 |
bdbc1636b3d2c10821c41a9408e5e39d
|
|
| BLAKE2b-256 |
397d9fe4c0b7fa9f3b363007192a2f52eb15ecc1def38302d8f005ab8ed8b262
|