octoanalytics

Quantitative analysis for power markets

These details have not been verified by PyPI

Project description

octoanalytics logo

octoanalytics is an Python package by Octopus Energy that provides tools for quantitative analysis and risk calculation on energy data. It helps to analyze time series energy consumption data, extract relevant features, and predict future consumption using machine learning models.

Key Features

Time-based Feature Engineering: Extract hourly, daily, and yearly features, as well as detect holidays using a calendar.
Forecasting Model: Utilizes XGBoost regression models to predict hourly energy consumption.
Model Evaluation: Computes MAPE (Mean Absolute Percentage Error) on the validation and test datasets.

Installation

To install octoanalytics, you can use pip:

pip install octoanalytics

Requirements

Python 3.7 or higher
pandas
numpy
xgboost
sklearn
holidays

These dependencies will be automatically installed when you install octoanalytics.

Usage

1. Importing the package

To use octoanalytics, import the eval_forecast module as shown below:

from octoanalytics import eval_forecast

2. Input Data Format

The data required for the function must be a DataFrame with the following columns:

'date': A column containing date-time values in datetime format.
'consumption': A column containing energy consumption values (the target variable).

Example of how the input data should look:

import pandas as pd

data = pd.DataFrame({
    'date': ['2025-01-01 00:00', '2025-01-01 01:00', '2025-01-01 02:00', ...],
    'consumption': [120.5, 115.3, 113.7, ...]
})

data['date'] = pd.to_datetime(data['date'])

3. Main Function: `eval_forecast`

The eval_forecast function trains a machine learning model to forecast energy consumption using XGBoost. Here's how to use it:

model, y_test_pred, y_test, test_mape, y_val_pred, val_mape = eval_forecast(data, country_code='FR')

Parameters

data (pd.DataFrame): A DataFrame containing the columns date and consumption.
country_code (str): The ISO code for the country to detect holidays (default is 'FR' for France).

Return Values

model: The trained XGBoost model.
y_test_pred: The model's predictions on the test set.
y_test: The actual values of the test set.
test_mape: The Mean Absolute Percentage Error (MAPE) of the model on the test set.
y_val_pred: The model's predictions on the validation set.
val_mape: The MAPE of the model on the validation set.

4. Example Usage

import pandas as pd
from octoanalytics import eval_forecast

# Example data (replace with your actual dataset)
data = pd.DataFrame({
    'date': ['2025-01-01 00:00', '2025-01-01 01:00', '2025-01-01 02:00'],
    'consumption': [120.5, 115.3, 113.7]
})
data['date'] = pd.to_datetime(data['date'])

# Run the forecast function
model, y_test_pred, y_test, test_mape, y_val_pred, val_mape = eval_forecast(data)

# Print the results
print(f"Validation MAPE: {val_mape:.2f}%")
print(f"Test MAPE: {test_mape:.2f}%")

Detailed Description of `eval_forecast`

The eval_forecast function is used to train a forecasting model for energy consumption using the XGBoost algorithm. Here's how it works:

Data Preprocessing: The function extracts time-based features such as hour, day of the week, month, year, and week of the year. It also adds a binary feature indicating whether a given date is a holiday in the specified country.
Data Splitting: The data is split into three sets:
- Training set: 60% of the data.
- Validation set: 20% of the data.
- Test set: 20% of the data.
Training the XGBoost Model: The model is trained on the training set, with early stopping based on validation data to prevent overfitting.
Model Evaluation: The MAPE (Mean Absolute Percentage Error) is computed on both the validation and test sets.

XGBoost Model Parameters

n_estimators: The number of boosting rounds (default is 100).
learning_rate: The learning rate for adjusting tree weights (default is 0.1).
max_depth: The maximum depth of the decision trees (default is 5).

These parameters can be adjusted by modifying the call to the XGBRegressor model in the eval_forecast function.

Model Evaluation

The MAPE (Mean Absolute Percentage Error) is calculated on both the validation and test sets. It is expressed as a percentage and provides an indication of how well the model is performing. A lower MAPE value indicates better model performance.

Developer

Author: Jean Bertin
Email: jean.bertin@octopusenergy.fr
Status: In development (planning)

License

This project is licensed under the MIT License. See the LICENSE file for more details.

Contributions

Contributions are welcome! If you would like to suggest a feature or report a bug, please open an issue or submit a pull request.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.3.6

Aug 14, 2025

0.3.5

Aug 14, 2025

0.3.4

Aug 12, 2025

0.3.3

Aug 11, 2025

0.3.2

Aug 11, 2025

0.3.1

Aug 11, 2025

0.3.0

Aug 5, 2025

0.2.4

Aug 4, 2025

0.2.3

Aug 1, 2025

0.2.2

Jul 28, 2025

0.2.1

Jul 28, 2025

0.2.0

Jul 28, 2025

0.1.1

May 22, 2025

0.1.0

May 22, 2025

0.0.3

May 12, 2025

0.0.2

May 12, 2025

This version

0.0.1

May 12, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

octoanalytics-0.0.1.tar.gz (5.3 kB view details)

Uploaded May 12, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

octoanalytics-0.0.1-py3-none-any.whl (5.7 kB view details)

Uploaded May 12, 2025 Python 3

File details

Details for the file octoanalytics-0.0.1.tar.gz.

File metadata

Download URL: octoanalytics-0.0.1.tar.gz
Upload date: May 12, 2025
Size: 5.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.10.17

File hashes

Hashes for octoanalytics-0.0.1.tar.gz
Algorithm	Hash digest
SHA256	`1533fe89f9374f26f145de9910361f3cd09ed9bee049cfb3cea9991deb9baa36`
MD5	`be02d03f3879e61fec4606cc77884514`
BLAKE2b-256	`99184e22b258f520b01b2080d7e14eb130fbc7e2cc9809ab308a186c543cf2ce`

See more details on using hashes here.

File details

Details for the file octoanalytics-0.0.1-py3-none-any.whl.

File metadata

Download URL: octoanalytics-0.0.1-py3-none-any.whl
Upload date: May 12, 2025
Size: 5.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.10.17

File hashes

Hashes for octoanalytics-0.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1169a69dc780a4d56f5789be4f0e452cffaff72928cf93252049d2677ff3fc14`
MD5	`d08bc4f8dff53bd7e12f551c482d72e2`
BLAKE2b-256	`1d3905d3f579f20f94bd120dbf1fdf99bd591e5233384b5ab561fea5ace90c84`

See more details on using hashes here.

octoanalytics 0.0.1

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

Key Features

Installation

Requirements

Usage

1. Importing the package

2. Input Data Format

3. Main Function: `eval_forecast`

Parameters

Return Values

4. Example Usage

Detailed Description of `eval_forecast`

XGBoost Model Parameters

Model Evaluation

Developer

License

Contributions

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

octoanalytics 0.0.1

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

Key Features

Installation

Requirements

Usage

1. Importing the package

2. Input Data Format

3. Main Function: eval_forecast

Parameters

Return Values

4. Example Usage

Detailed Description of eval_forecast

XGBoost Model Parameters

Model Evaluation

Developer

License

Contributions

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

3. Main Function: `eval_forecast`

Detailed Description of `eval_forecast`