Skip to main content

Tools for performing common tasks on solar PV data signals

Project description


Latest Release latest release
License license
Build Status documentation build status CircleCi build status
Code Quality Language grade: Python Total alerts
PyPI Downloads PyPI downloads
Conda Downloads conda-forge downloads

Tools for performing common tasks on solar PV data signals. These tasks include finding clear days in a data set, common data transforms, and fixing time stamp issues. These tools are designed to be automatic and require little if any input from the user. Libraries are included to help with data IO and plotting as well.

There is close integration between this repository and the Statistical Clear Sky repository, which provides a "clear sky model" of system output, given only measured power as an input.

See notebooks folder for examples.


Recommended: Set up conda environment with provided .yml file

Updated March 2021

We recommend setting up a fresh Python virtual environment in which to use solar-data-tools. We recommend using the Conda package management system, and creating an environment with the environment configuration file named pvi-user.yml, provided in the top level of this repository. This will install the statistical-clear-sky package as well.

Additional documentation on setting up the Conda environment is available here.

Please see the Conda documentation page, "Creating an environment from an environment.yml file" for more information.

Installing this project as PIP package

$ pip install solar-data-tools

As of March 6, 2019, it fails because scs package installed as a dependency of cxvpy expects numpy to be already installed. scs issue 85 says, it is fixed. However, it doesn't seem to be reflected in its pip package. Also, cvxpy doesn't work with numpy version less than 1.16. As a work around, install numpy separatly first and then install this package. i.e.

$ pip install 'numpy>=1.16'
$ pip install statistical-clear-sky


Currently, this sofware package requires the use of a commercial software package called MOSEK. The included YAML file will install MOSEK for you, but you will still need to obtain a license. More information is available here:

Installing this project as Anaconda package

$ conda install -c slacgismo solar-data-tools

If you are using Anaconda, the problem described in the section for PIP package above doesn't occur since numpy is already installed. And during solar-data-tools installation, numpy is upgraded above 1.16.


By default, ECOS solver is used, which is supported by cvxpy because it is Open Source.

However, it is found that Mosek solver is more stable. Thus, we encourage you to install it separately as below and obtain the license on your own.

Using this project by cloning this GIT repository

From a fresh python environment, run the following from the base project folder:

$ pip install -r requirements.txt


Users will primarily interact with this software through the DataHandler class.

from solardatatools import DataHandler
from solardatatools.dataio import get_pvdaq_data

pv_system_data = get_pvdaq_data(sysid=35, api_key='DEMO_KEY', year=[2011, 2012, 2013])

dh = DataHandler(pv_system_data)

If everything is working correctly, you should see something like the following

total time: 16.67 seconds
Preprocessing              6.52s
Cleaning                   8.62s
Filtering/Summarizing      1.53s
    Data quality           0.23s
    Clear day detect       0.19s
    Clipping detect        0.21s
    Capacity change detect 0.91s


We use Semantic Versioning for versioning. For the versions available, see the tags on this repository.


See also the list of contributors who participated in this project.


This project is licensed under the BSD 2-Clause License - see the LICENSE file for details

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for solar-data-tools, version 0.5.0
Filename, size File type Python version Upload date Hashes
Filename, size solar_data_tools-0.5.0-py3-none-any.whl (60.1 kB) File type Wheel Python version py3 Upload date Hashes View
Filename, size solar-data-tools-0.5.0.tar.gz (50.4 kB) File type Source Python version None Upload date Hashes View

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring DigiCert DigiCert EV certificate Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page