Measured data visualization and pipelines creation
Project description
Pipeline Development and Data Visualization for Time Series in Physical Measurements
Tide is a tool for developing data processing pipelines and visualizing time series data, particularly suited for physical measurements. Key features include:
-
Efficient Data Management
- Organize and select data using a tagging system
-
Pipeline Construction
- Store and retrieve pipelines easily with JSON-based dictionary structures
- Build dynamic pipelines that adjust based on the selected data
-
Interactive Visualization
- Create interactive plots to explore data (plotly)
- Visualize pipeline or slices of pipelines effects on data
-
Custom Data Enrichment
- Integrate external weather data sources
- Implement autoregressive models for gaps filling
- Develop and incorporate custom data processors
Uses pandas DataFrames and Series for robust data handling. bigtree for tags and data selection. Scikit-learn's API for pipeline construction.
Getting started
1- Install ⬇️
pip install python-tide
2- Load and format data 🌲
To begin, load your time series data into a pandas DataFrame, ensuring the index is a DateTimeIndex:
df = pd.read_csv(
"https://raw.githubusercontent.com/BuildingEnergySimulationTools/tide/main/tutorials/getting_started_ts.csv",
parse_dates=True,
index_col=0
)
Rename columns using Tide's tagging system.
The format is:
name__unit__bloc__sub_bloc with tags separated by double underscores.
The order of the tags matters.
The order of tags is important, and you can use "OTHER" as a placeholder
You can use one or several tags.
df.columns = ["Tin__°C__Building", "Text__°C__Outdoor", "Heat__W__Building"]
Plumber objects are used to help us with pipelines building and data visualization
from tide.plumbing import Plumber
plumber = Plumber(df)
Display the data organization as a tree:
plumber.show()
Select data using tags:
plumber.get_corrected_data("°C")
plumber.get_corrected_data("Building")
plumber.get_corrected_data("Tin")
3- Visualizing data 📈
Show data availability:
plumber.plot_gaps_heatmap(time_step='d')
Plot time series with missing data highlighted:
fig = plumber.plot(plot_gaps=True)
fig.show(None)
4- Building and testing Pipelines 🛠️
Create a pipeline dictionary:
pipe_dict = {
"step_1": [["Common_proc_1"], ["Common_proc_2", ["arg1", "arg2"]]],
"step_2": {
"selection_1": [["Proc_selection_1", {"arg": "arg_value"}]]
}
}
Pipeline Rules:
- Use dictionaries for pipeline description
- Keys represent pipeline steps ex.
"step_1" - Step values can be lists (apply to all columns) or dicts (filter columns)
- Processing objects are listed as [class_name, arguments]
Example Pipeline:
- Resample data to 15-minute intervals
- Interpolate temperature gaps ≤ 3 hours
- Fill large Tin gaps using Autoregressive STLForecast
pipe_dict = {
"resample_15min": [["Resample", ["15min"]]],
"interpolate_temps": {
"°C": [["Interpolate", {"gaps_lte": "3h"}]]
},
"ar_tin": {
"Tin": [
[
"FillGapsAR",
{
"model_name": "Prophet",
"resample_at_td": "1h",
"gaps_gte": "3h",
"gaps_lte": "3d"
}
]
]
}
}
plumber.pipe_dict = pipe_dict
Get pipeline using get_pipeline method.
plumber.get_pipeline(verbose=True)
Get pipelines for specific columns
plumber.get_pipeline(select="Building", verbose=True)
Visualize pipeline effects:
plumber.plot(
steps=None,
plot_gaps=True,
steps_2=slice(None, "interpolate_temps"),
plot_gaps_2=True,
verbose=True
)
Step Arguments:
None: No operation (Identity)str: Process until named steplist[str]: Perform specified stepsslice: Process a slice of the pipeline
Compare full pipeline to raw data:
plumber.plot(
steps=None,
plot_gaps=True,
steps_2=slice(None),
plot_gaps_2=True,
verbose=True
)
Sponsors
|
|
The development of this library has been supported by ENSNARE Project, which has received funding from the European Union's Horizon 2020 Research and Innovation Programme under Grant Agreement No. 953193. The sole responsibility for the content of this library lies entirely with the author's view. The European Commission is not responsible for any use that may be made of the information it contains. |
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file python_tide-0.1.8.tar.gz.
File metadata
- Download URL: python_tide-0.1.8.tar.gz
- Upload date:
- Size: 71.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c6778874d4d16830ebf0b6431160fa2e79846e938c438932774484e515b197d4
|
|
| MD5 |
a5234b8ec76989690a61b7c956178468
|
|
| BLAKE2b-256 |
670cff8d3035f94337612a8405171a31a9f4895c6dc1b2c0d0c370c972f92b2f
|
Provenance
The following attestation bundles were made for python_tide-0.1.8.tar.gz:
Publisher:
build.yaml on BuildingEnergySimulationTools/tide
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
python_tide-0.1.8.tar.gz -
Subject digest:
c6778874d4d16830ebf0b6431160fa2e79846e938c438932774484e515b197d4 - Sigstore transparency entry: 1393353833
- Sigstore integration time:
-
Permalink:
BuildingEnergySimulationTools/tide@8de65e309081519fc7c7d779532f0aa1a18e274f -
Branch / Tag:
refs/tags/0.1.8 - Owner: https://github.com/BuildingEnergySimulationTools
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
build.yaml@8de65e309081519fc7c7d779532f0aa1a18e274f -
Trigger Event:
push
-
Statement type:
File details
Details for the file python_tide-0.1.8-py3-none-any.whl.
File metadata
- Download URL: python_tide-0.1.8-py3-none-any.whl
- Upload date:
- Size: 74.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
318bf6a59e09ea654d95f3975df4e10d8a04dde57649602831f7439dc699ab71
|
|
| MD5 |
b6cd8cf83e524e2d9015c8be51970690
|
|
| BLAKE2b-256 |
6aaa4721c72aa47381f2ef54440dd9ff498a6f1a648436061d434b25df5ea336
|
Provenance
The following attestation bundles were made for python_tide-0.1.8-py3-none-any.whl:
Publisher:
build.yaml on BuildingEnergySimulationTools/tide
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
python_tide-0.1.8-py3-none-any.whl -
Subject digest:
318bf6a59e09ea654d95f3975df4e10d8a04dde57649602831f7439dc699ab71 - Sigstore transparency entry: 1393353839
- Sigstore integration time:
-
Permalink:
BuildingEnergySimulationTools/tide@8de65e309081519fc7c7d779532f0aa1a18e274f -
Branch / Tag:
refs/tags/0.1.8 - Owner: https://github.com/BuildingEnergySimulationTools
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
build.yaml@8de65e309081519fc7c7d779532f0aa1a18e274f -
Trigger Event:
push
-
Statement type: