Skip to main content

This repository allows for easy data access for fair-weather photovoltaic module output.

Project description

Data Package for "Surrogates for Fair-Weather Photovoltaic Module Output"

Overview

For a detailed explanation of how the data was obtained / preprocessed please refer to the paper (Citation at the bottom). For a quick overview see the extended abstract in the data package.

  • Around 40GB worth of Fair-Weather photovoltaic from PVGIS
  • Distribution of module parameters from privately owned modules (obtained using maestr and fused with pvoutput data)
  • Additional data for fixed-location (module parameters are still sampled)
  • Additional data for fixed-parameters (location increases)
  • Easy data access using dataloaders (PACKAGE_LINK)
  • Challenge data set including downward trend
  • Downloadable from archive.org (ID: pv-surrogate-data_dfalkner)

Usage

Install using your favorite package manager:

pip install pv-surrogate-data

Then you can use the data access layer to load the data:

from pv_surrogate_data.dataset import download_data, PVGISGermanyDataset, PVGISDataPackage

def main():
  # downloads all the data from archive.org, may take a while ~60GB
  data_path = Path(__file__) / '.pv_surrogate_data'
  data_path.mkdir(exist_ok=True, parents=True)
  download_data(data_path)

  # lazily loads the data, for more information see constructor
  # Alternatively supports `FixedLocationPVGISGermanyDataset`, `OutwardPointsPVGISGermanyDataset` and `ChallengePVGISGermanyDataset`
  data = PVGISGermanyDataset(PVGISDataPackage(data_path))

  # access all static data via `data.get_all_static_data()`
  all_static = data.get_all_static_data()

  # iterate over dataframe. Support collections.abc.Sequence 
  # additionally supports retreiving parameters (`get_static_data` and `get_all_static_data`)
  for index, sample in enumerate(data):
    print(sample)
    print(data.get_static_data(index))


if __name__ == '__main__':
  main()

Data Package Structure

- <top-level-folder>
  - extended_abstract.pdf
  - figures/
    - # contains additional figures of the data 
  # the following directories relate strictly to the data
  - pvgis/  # metadata: german_enriched_{test,train}.parquet
  - pvgis_fixed_location/  # metadata: german_fixed_location.parquet
  - pvgis_outward/  # metadata: german_outward.parquet
  - system_data/
    # necessary for data access
    - german_outward.parquet
    - german_enriched_train.parquet
    - german_enriched_test.parquet
    - german_fixed_location.parquet

    # intermediate results
    - german_starting_points.parquet  # equally spaced starting points for germany
    - german_total_system_parameter_distribution.parquet  # full parameter distribution from germany
    - mastr_filtered.parquet  # prefiltered results from the Meldestammregister
    - pvoutput_austrian_systems.parquet  # austrian systems from pvoutput
    - pvoutput_austrian_systems_meta.parquet  # meta data for austrian systems
    - pvoutput_german_systems.parquet  # german systems from pvoutput
    - pvoutput_german_systems_meta.parquet  # german system meta data from pvoutput
  • The request in the folders {pvgis, pvgis_fixed_location, pvgis_outward} can contain errors (which means some of the requests failed). The data access layers filters those out.
  • Samples from a module are always saved as a single parquet file using the sample id in the corresponding system_data file. The exception is the pvgis_outward data. It's file name is defined as f'{sample_id}_{bearing}_{distance}.parquet'.

License

You are free to use the produced data or produce your own data using the scripts provided in the main repo.


Authors

Made in Austria by dominik.falkner@risc-software.at

If you use my work please cite me (for details please refer to the paper)!

Falkner, D., Bögl, M., Langthallner, I., Zenisek, J., Affenzeller, M., 2023, Surrogates for Fair-Weather Photovoltaic Module Output. Lecture Notes in Computer Science

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pv_surrogate_data-1.0.2.tar.gz (7.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pv_surrogate_data-1.0.2-py3-none-any.whl (7.5 kB view details)

Uploaded Python 3

File details

Details for the file pv_surrogate_data-1.0.2.tar.gz.

File metadata

  • Download URL: pv_surrogate_data-1.0.2.tar.gz
  • Upload date:
  • Size: 7.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.10.14

File hashes

Hashes for pv_surrogate_data-1.0.2.tar.gz
Algorithm Hash digest
SHA256 c99fb132e1b5da97baee56f99eaa783c4b70a3308ea7b7a8a8d531f6357841fc
MD5 7015729f5875af18076077146b632372
BLAKE2b-256 dfd1ca3e16fd5cafe4258a0f07534d77a00ca28d7cc962771139d549ece6b028

See more details on using hashes here.

File details

Details for the file pv_surrogate_data-1.0.2-py3-none-any.whl.

File metadata

File hashes

Hashes for pv_surrogate_data-1.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 4460885d5067e509836935512030be539237a21bb3cffc36edd4026580af083d
MD5 d7f1bbd432185102ee94754319adb292
BLAKE2b-256 42f45fb02e01b65354f26d5d0f72c9ddb7a89b064af15e5f9b1b8042b2ece009

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page