Skip to main content

A Python library that allows to extract data from satellites, exploiting machine parallelism and relying on Google Earth Engine APIs.

Project description

logo

License: MIT maintained stars


This library provides an easy-to-use, comprehensive, and flexible way to work with satellites data (for now: S1_GRD, S2_SR and LANDSAT8_T1_TOA). Its key advantages include: a well-documented API, support for the mainly used satellites, open-source code, and regular updates.
In addition to the already mentioned advantages, the implemented code relies on Google Earth Engine (GEE) APIs (used to access satellites data and perform some tasks such as cloud masking, image compositing, and time series selection) and exploits multi-threading (designed to work efficiently, by making a lot of parallel I/O requests to GEE).

These advantages make it an excellent tool for anyone working with satellites data, since it allows to generate datasets that can be both easily used for data analysis and efficiently integrated with well-known ML libraries, to deploy models.

How to install it?

$ pip install ee-satellites

This library is composed of lot of functions, the main ones are here described.



get_features() function

It allows to get from an input pandas DataFrame composed of fields information, an output DataFrame that contains for each time a selected satellite (sentinel-1, sentinel-2 or landsat-8) passed over the specified fields, within a given time period, all the mean values of some of the most used indexes (optical, radar or thermal).

The filters_params parameter is a list (default: None) containing the values of the Earth Engine filters to be used for extracting Image Collections, with the specified sentinel satellites. For Sentinel-2 the first parameter in the list is the value of the CLOUDY_PIXEL_PERCENTAGE filter (LESS OR EQUAL TO - values in range [0, 100]), whereas for Sentinel-1 the first parameter in the list is the value of the orbitProperties_pass filter (ASCENDING or DESCENDING).

The fields_threads parameter (default: 4) is the number of threads to dedicate to parallelization of GEE API requests over the fields level, the remaining ones instead are used to apply parallelization over dates level. The value of this parameter should be high (with respect to the overall number of threads exploitable) if you have a lot of crop fields but a little time-span to consider, whereas if you have fewer fields but a bigger time-span you should decrease this parameter. Finally, if you have a lot of fields with a lot of dates to process it is optimal to consider half of the overall number of threads available on the device used to perform the I/O requests.
A correct choice of this parameter can drastically reduce the features extraction time.

See the code for a better understanding.

Input DataFrame

The input DataFrame, lets suppose named fields_df, should be structured as follows (just columns position matters):

field_name polygon_coordinates
P-BLD [(-4.202723286616649, 43.39683579015289), (-4...]
P-BLLT1 [(-4.085622203603083, 43.429605845026266), (-4...]
P-BLLT2 [(-4.084840437376829, 43.430826294936246), (-4...]
P-CBRCS1 [(-4.200826431306206, 43.39067464298489), (-4....]
... ...
P-VNS [(-4.151167740565273, 43.40535762666503), (-4....]

How to execute it?

First you have to initialize and authorize the Google Earth Engine APIs.

import ee, ee_satellites

ee.Authenticate()
ee.Initialize()

Then, supposing that you have already loaded the fields_df pandas DataFrame, you have just to run the following code. Change the satellite parameter value in case you want sentinel-1, or landasat-8, indexes extracted for the fields specified (within the time period selected - be careful that the given date format is consistent with ISO 8601 notation). See the above general description, instead, to comprehend what filters_params and fields_threads parameters are useful for.

df = ee_satellites.get_features(fields_df, '2022-01-01', '2022-12-31', satellite='sentinel-2', filters_params=['40'], fields_threads=3)

Output DataFrame

The output generated DataFrame, namely df, will be structured as follows:

field_name s2_acquisition_date B1 B2 B3 ... MCARI2 BSI GLI ALTERATION SDI
P-BLD 2022-01-06 2.260204 119.981293 550.044218 ... 0.922907 6282.955014 0.513069 2.131255 3484.508503
P-BLD 2022-01-16 13.785714 111.540816 528.481293 ... 0.921371 6545.200602 0.517949 2.127614 3636.382653
... ... ... ... ... ... ... ... ... ... ...
P-VNS 2022-11-17 1133.057399 1364.994619 1726.755157 .... 0.594553 11195.020251 0.070841 1.441033 1939.695067
P-VNS 2022-12-17 955.040359 1208.792825 1617.324664 ... 0.688394 10547.592756 0.107797 1.505526 2913.218834


What's new?

  • version: 0.0.1:

    • This libary comes from the old one (named sentinel-satellites) where Landsat-8 indexes have been added
  • versions: 0.0.2 to 0.0.6:

    • Improved descriptions
    • Added logo

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ee_satellites-0.0.6.tar.gz (15.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ee_satellites-0.0.6-py3-none-any.whl (17.5 kB view details)

Uploaded Python 3

File details

Details for the file ee_satellites-0.0.6.tar.gz.

File metadata

  • Download URL: ee_satellites-0.0.6.tar.gz
  • Upload date:
  • Size: 15.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.2

File hashes

Hashes for ee_satellites-0.0.6.tar.gz
Algorithm Hash digest
SHA256 ff5493a92020c3a0d5f5432b65b4367490b7638eb48e09aed55939c5b21971dd
MD5 abd8e6c7f9471fae802a1a1f2734f23d
BLAKE2b-256 8b3bffb928ffff770941047bb22c2877f03e507aed0e4ce85155a499e763f28b

See more details on using hashes here.

File details

Details for the file ee_satellites-0.0.6-py3-none-any.whl.

File metadata

  • Download URL: ee_satellites-0.0.6-py3-none-any.whl
  • Upload date:
  • Size: 17.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.2

File hashes

Hashes for ee_satellites-0.0.6-py3-none-any.whl
Algorithm Hash digest
SHA256 3eab0adecb55f21df9a991f17f15c908fe9ce9deef050408c1ad72efdbbff0b6
MD5 d32fbdc1658fea84f76892059b8bd058
BLAKE2b-256 a370ce260f4e85116db89f8419322e3dfa4aff777cf1d55411c95952120afa41

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page