UBC Solar's data analysis and querying tools

These details have not been verified by PyPI

Project description

UBC Solar Data Tools

A collection of data querying, analysis, and structuring tools.

Requirements

Versions for dependencies (except Python) indicated are recommended

Git [^1]
Python >=3.9 [^2]
Poetry >=1.8.3 [^3]

Installation

First, clone this repository.

git clone https://github.com/UBC-Solar/data_tools.git

Then, create and activate a virtual environment. This project uses Poetry for dependency management. Next, use Poetry to install dependencies, with --no-root so that the data_tools package does not itself get installed into your virtual environment. This can be omitted if you're sure you know what you are doing.

poetry install --no-root

Optionally, you can install dependencies for building the documentation and running tests.

poetry install --no-root --with docs --with test

Getting Started

Example of querying data and plotting it as a TimeSeries.

When the InfluxClient class is imported, data_tools will attempt to locate a .env file in order to acquire an InfluxDB API token. If you do not have a .env or it is missing an API token, you will not be able to query data. UBC Solar members can acquire an API token by speaking to their Team Lead.

from data_tools.collections.time_series import TimeSeries
from data_tools.query.influxdb_query import DBClient

client = DBClient()

# ISO 8601-compliant times corresponding to pre-competition testing
start = "2024-07-07T02:23:57Z" 
stop = "2024-07-07T02:34:15Z"

# We can, in one line, make a query to InfluxDB and parse 
# the data into a powerful format: the `TimeSeries` class.
voltage_data: TimeSeries = client.query_time_series(
    start=start,
    stop=stop,
    field="TotalPackVoltage",
    units="V"
)

# Plot the data
voltage_data.plot()

Example of using the FluxQuery module to make a Flux query that selects and aggregates some data. We will use the FluxStatement class to construct a custom Flux statement, as the aggregateWindow statement is not yet included by this API.

from data_tools.query.flux import FluxQuery, FluxStatement
from data_tools.query.influxdb_query import DBClient
from data_tools.collections.time_series import TimeSeries
import pandas as pd

client = DBClient()

# ISO 8601-compliant times corresponding to pre-competition testing
start = "2024-07-07T02:23:57Z" 
stop = "2024-07-07T02:34:15Z"

# The priority argument defines "where" in the Flux query the statement will get placed. Higher priority -> later
aggregate_flux_statement = FluxStatement('aggregateWindow(every: 10m, fn: mean, createEmpty: false)', priority=5)

query = FluxQuery()\
        .range(start=start, stop=stop)\
        .filter(field="VehicleVelocity")\
        .inject_raw(aggregate_flux_statement)

# We can inspect the Flux query
print(query.compile_query())

# Make the query, getting the data as a DataFrame
query_dataframe: pd.DataFrame = client.query_dataframe(query)

# Now, convert the data into a TimeSeries
measurement_period: float = 1.0 / 5  # VehicleVelocity is a 5Hz measurement, so period is 1.0 / 5Hz.
vehicle_velocity = TimeSeries.from_query_dataframe(query_dataframe, measurement_period, 
                                                   field="VehicleVelocity", 
                                                   units="m/s")

# Plot the data
vehicle_velocity.plot()

Appendix

[^1]: use git --version to verify version

[^2]: use python3 --version to verify version

[^3]: use poetry --version to verify version

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

1.3.1

Oct 19, 2024

1.3.0

Oct 19, 2024

1.2.5

Oct 18, 2024

1.2.4

Oct 13, 2024

1.2.3

Oct 12, 2024

1.2.2

Oct 12, 2024

1.2.1

Oct 12, 2024

1.2.0

Oct 12, 2024

1.0.1

Sep 29, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ubc_solar_data_tools-1.3.1.tar.gz (19.3 kB view details)

Uploaded Oct 19, 2024 Source

File details

Details for the file ubc_solar_data_tools-1.3.1.tar.gz.

File metadata

Download URL: ubc_solar_data_tools-1.3.1.tar.gz
Upload date: Oct 19, 2024
Size: 19.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for ubc_solar_data_tools-1.3.1.tar.gz
Algorithm	Hash digest
SHA256	`0670f0b2f22c62db57f001b6eea6486e2ed0deacb43ee15880ed330a3d56f157`
MD5	`8469d38b7c5d68a9ff1c9519582ba3e7`
BLAKE2b-256	`28f66771833723a0c6cc4dbff1499ecacc71384c25ccbc8af0c400ffcb13e7aa`