Skip to main content

Open source remote sensing dataset with benchmarks

Project description

CropHarvest

CropHarvest is an open source remote sensing dataset for agriculture with benchmarks. It collects data from a variety of agricultural land use datasets and remote sensing products.

The dataset consists of 90,480 datapoints, of which 30,899 (34.2%) have multiclass labels. All other datapoints only have binary crop / non-crop labels.

65,690 (73%) of these labels are paired with remote sensing and climatology data, specifically Sentinel-2, Sentinel-1, the SRTM Digital Elevation Model and ERA 5 climatology data.

21 datasets are aggregated into CropHarvest - these are documented here.

More details about CropHarvest and the benchmarks are available in this paper.

Pipeline

The code in this repository

  1. combines the constituent datasets into a single geoJSON file,
  2. exports the associated satellite data from Earth Engine,
  3. combines both datasets to create (X,y) training tuples and
  4. exposes those tuples via a Dataset object.

The pipeline through which this happens is shown below:

All blue boxes are associated with code in this repository. Anything green is data accessible via Zenodo.

There are unique cases where you may need to use the EarthEngineExporter directly, these use cases are demonstrated in the demo_exporting_data.ipynb notebook.

Installation

Linux and MacOS users can install the latest version of CropHarvest with the following command:

pip install cropharvest

Windows users must install the CropHarvest within a conda environment to ensure all dependencies are installed correctly:

conda install 'fiona>=1.5' 'rasterio>=1.2.6'
pip install cropharvest

Getting started

See the demo.ipynb notebook for an example on how to download the data from Zenodo and train a random forest against this data.

For more examples of models trained against this dataset, see the benchmarks.

Contributing

If you would like to contribute a dataset, please see the contributing readme.

License

CropHarvest has a Creative Commons Attribution-ShareAlike 4.0 International license.

Citation

If you use CropHarvest in your research, please use the following citation:

@inproceedings{
    tseng2021cropharvest,
    title={CropHarvest: A global dataset for crop-type classification},
    author={Gabriel Tseng and Ivan Zvonkov and Catherine Lilian Nakalembe and Hannah Kerner},
    booktitle={Thirty-fifth Conference on Neural Information Processing Systems Datasets and Benchmarks Track (Round 2)},
    year={2021},
    url={https://openreview.net/forum?id=JtjzUXPEaCu}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cropharvest-0.3.0.tar.gz (800.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cropharvest-0.3.0-py3-none-any.whl (806.6 kB view details)

Uploaded Python 3

File details

Details for the file cropharvest-0.3.0.tar.gz.

File metadata

  • Download URL: cropharvest-0.3.0.tar.gz
  • Upload date:
  • Size: 800.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/34.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.63.0 importlib-metadata/4.11.3 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.9.10

File hashes

Hashes for cropharvest-0.3.0.tar.gz
Algorithm Hash digest
SHA256 aeb94f97c069318963db4a47e81fe200d65f74f737d085508050a498dad80401
MD5 93cb510bf18070854caf64d88b4f5a34
BLAKE2b-256 41082e1526e818104136829e5df69e9c64b444323ae94893e954bd128b43ddf5

See more details on using hashes here.

File details

Details for the file cropharvest-0.3.0-py3-none-any.whl.

File metadata

  • Download URL: cropharvest-0.3.0-py3-none-any.whl
  • Upload date:
  • Size: 806.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/34.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.63.0 importlib-metadata/4.11.3 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.9.10

File hashes

Hashes for cropharvest-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 79a6020bff616f053b5eab34f7b9804342c120399b36f2aaa385a1b929b35f2e
MD5 c7605627915b2dc8a4f60ab2909f4f73
BLAKE2b-256 ae4df6cdb39488e8d770ca83a93cf519c77837fd6b8046b5fb1fbef0af44db84

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page