Skip to main content

Remote Sensing Data-Fetcher and Data-Loader for Joint Classification of Hyperspectral and LiDAR Data

Project description

fs-fusion-datasets

PyPI - Version PyPI - Downloads PyPI - Python Version GitHub Created At GitHub License

fs-fusion-datasets is a remote sensing data-fetcher and data-loader for joint classification of hyperspectral and lidar data. rs-fusion-datasets (formerly known as fetch_houston2013) is a Python package that:

  1. Automatically downloads and load many multimodel remote sensing datasets (houston, muufl, trento, berlin, augsburg, etc.)
  2. Provides ready-to-use torch dataloaders
  3. Provides some utils for visulization, dataset spilit, etc.

screenshot

Supported Datasets

  1. Houston 2013: fetch_houston2013
  2. Trento: fetch_trento
  3. Muufl: fetch_muufl
  4. Houston 2018 (Experimental): fetch_houston2018_ouc
  5. Augsberg (Experimental): fetch_augsberg_ouc
  6. Berlin (Experimental): fetch_berlin_ouc

Quick Start

  1. Install this package
pip install rs-fusion-datasets
  1. import and get the dataset
from rs_fusion_datasets import fetch_houston2013, fetch_muufl, fetch_trento, split_spmatrix
# For Houston 2013
hsi, dsm, train_label, test_label, info = fetch_houston2013()
# For Muufl and Trento
casi, lidar, truth, info = fetch_muufl()
train_label, test_label = split_spmatrix(truth, 20)
# For fetch_houston2018_ouc, fetch_augsberg_ouc, fetch_berlin_ouc
hsi, dsm, train_label, test_label, all_label, info = fetch_houston2018_ouc()
  1. Tips: train_label and test_label are sparse matrix, you can either convert them to np.array easily by
train_label=train_label.todense()
test_label =test_label.todense()

or directly use them for getting the value in a very fast way:

    def __getitem__(self, index):
      i = self.truth.row[index]
      j = self.truth.col[index]
      label = self.truth.data[index].item()
      x_hsi = self.hsi[:, i, j]
      x_dsm = self.dsm[:, i, j]
      return x_hsi, x_dsm, label

torch

Ready-to-use Torch vison datasets.

from rs_fusion_datasets import Houston2013, Trento, Muufl, Houston2018Ouc, BerlinOuc, AugsburgOuc
dataset = Muufl(subset='train', patch_size=11)
x_h, x_l, y, extras = dataset[0]

utils

  1. lbl2rgb: convert the label dataset to rgb image
  2. read_roi: read exported .txt file of ENVI roi to sparse matrix
  3. split_spmatrix: split a sparse to get the train dataset and test dataset

Help

Star History

Star History Chart

Contribution

We welcome all contributions, including issues, pull requests, feature requests and discussions.

Credits

Houston2013 dataset: https://machinelearning.ee.uh.edu/?page_id=459
paperswithcode: https://paperswithcode.com/dataset/houston
Muufl dataset: https://github.com/GatorSense/MUUFLGulfport
Dafault url of Trento dataset is https://github.com/tyust-dayu/Trento/tree/b4afc449ce5d6936ddc04fe267d86f9f35536afd
The 2013_IEEE_GRSS_DF_Contest_Samples_VA.txt in this repo is exported from original 2013_IEEE_GRSS_DF_Contest_Samples_VA.roi.
Note: If this data is used in any publication or presentation the following reference must be cited:
P. Gader, A. Zare, R. Close, J. Aitken, G. Tuell, “MUUFL Gulfport Hyperspectral and LiDAR Airborne Data Set,” University of Florida, Gainesville, FL, Tech. Rep. REP-2013-570, Oct. 2013.
If the scene labels are used in any publication or presentation, the following reference must be cited:
X. Du and A. Zare, “Technical Report: Scene Label Ground Truth Map for MUUFL Gulfport Data Set,” University of Florida, Gainesville, FL, Tech. Rep. 20170417, Apr. 2017. Available: http://ufdc.ufl.edu/IR00009711/00001.
If any of this scoring or detection code is used in any publication or presentation, the following reference must be cited:
T. Glenn, A. Zare, P. Gader, D. Dranishnikov. (2016). Bullwinkle: Scoring Code for Sub-pixel Targets (Version 1.0) [Software]. Available from https://github.com/GatorSense/MUUFLGulfport/.

License

Copyright 2025 songyz2023

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rs_fusion_datasets-0.9.2.dev7.tar.gz (288.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

rs_fusion_datasets-0.9.2.dev7-py3-none-any.whl (33.0 kB view details)

Uploaded Python 3

File details

Details for the file rs_fusion_datasets-0.9.2.dev7.tar.gz.

File metadata

File hashes

Hashes for rs_fusion_datasets-0.9.2.dev7.tar.gz
Algorithm Hash digest
SHA256 8e0fdaff0d7e01d2ab1f113cafa6e2097d02e77a701cb23a1fed6e5654320921
MD5 6f7cd35853df8e71aeb242bd4bb9e09c
BLAKE2b-256 ebe4d611ec16f8ccd1d52a31e35a458b6bcb03e21a17c263817d611fb2f70745

See more details on using hashes here.

File details

Details for the file rs_fusion_datasets-0.9.2.dev7-py3-none-any.whl.

File metadata

File hashes

Hashes for rs_fusion_datasets-0.9.2.dev7-py3-none-any.whl
Algorithm Hash digest
SHA256 4207845e5a70c5730f3e766a3e70c57c51ede36424642d6c4c881814ea15b5d9
MD5 61a036057672b5bff053d170d20b2e10
BLAKE2b-256 a0d6149dc8b219e2ee60f9f6aa23ac46cb2b92ca70cf66dcc695cc73127e5456

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page