Short-Term Forecasting of Regional Electrical Load Based on CatBoost Model

These details have not been verified by PyPI

Project links

License
- Other/Proprietary License
Operating System
- OS Independent
Programming Language
- Python :: 3.10

Project description

Short-Term Load Forecasting Based on CatBoost Model Library (STLFLib)

This is a Python STLF machine learning library designed for generating energy consumption bids for the DAM (day-ahead market). The library is distributed under the KSPEU license (RU 2025688100). For commercial use, please contact the author: caapel@mail.ru.

How to install

To install, you can use the command:

pip install stlflib

Or download the repository from GitHub (private access)

Using

The essence of this project and its library is described in detail in the study Short-Term Forecasting of Regional Electrical Load Based on XGBoost Model

In this file you will not find a detailed description and instructions on how to work with this library; only a description of each of the basic library modules.

Dependency

The dependency module (located in the dependency.py file) contains a complete list of dependencies. The module has only one function:

print_dependency() - prints the versions of installed dependencies

and 5 global variable (for remote control of GUI-program settings):

GUI_model - name of the current working model
GUI_max_depth_br3_act - default max_depth for the br3 model
GUI_learn_period_br3_act - default learn_period for the br3 model
GUI_max_depth_br2_act - default max_depth for the br2 model
GUI_learn_period_br2_act - default learn_period for the br2 model

ServiceDB

The serviceDB module (located in the serviceDB.py file) contains a set of tools for working with the database:
-------------------------------create---------------------------------

generate_volume_df(path) - generate a dataframe with archived energy consumption data from prepared .xls-files located at path
get_weather(date, verify_HTTPS_request=False, verify_result) - generate a weather archive/forecast (outside air temperature) for the specified date with a sampling frequency of 1 hour
get_br_feature(date) - load a BEM (Balancing energy Market) archive/forecast for the specified date
get_RSV_rate(date) - load the unregulated DAM price for the specified date (per month)
updating_or_create_df(get_function, filename, start=datetime(2013, 1, 1).date()) - create a new (from the specified date) or replenish an existing database (filename.xlsx) with missing data up to the end of the previous month, returning the resulting dataframe
merge_and_export_DB(total_volume_df, df_weather, df_br_feature, path='', filename='DataBase.xlsx', update=True) - merge dataframes total_volume_df (Volume.xlsx), df_weather (Weather.xlsx), and df_br_feature (br_feature.xlsx) by the 'Date' column into one common database (by default, DataBase.xlsx)
database_volume_update(total_volume_df, path='', filename='DataBase.csv') - The function updates only the 'Volume' column in the selected database (DataBase.xlsx by default). A simplified version of the merge_and_export_DB function
-------------------------------service--------------------------------
get_empty_daily_df(date) - creates an empty dataframe (25 rows: from 0:00 to 24:00) for the specified date (for full temperature interpolation)
add_date_scalar(df) - adds additional categorical features to the dataframe: Day, Month, Year, WeekDay
is_check_DataBase(df) - checks database integrity
act_pred_reverse(df_br_feature) - replaces missing actual (Act) consumption and BR generation values with planned (Pred) values. This function is used to generate a forecast for the current day, when the actual values of ActCons and ActGen are not available for the entire day
get_files_from_path(path='_raw_Data_TatEnergosbyt', remove=True, jupyter_widget=True) - retrieving operational data from the directory (by default, /_raw_Data_TatEnergosbyt)
update_DataBase(total_oper_df, filename='DataBase.csv', common_check=True) - updating the database by adding operational data from total_oper_df

Preprocessing

The preprocessing module (located in the preprocessing.py file) contains data preprocessing tools for subsequent transfer of this data to the core functions (CatBoostRegressor):

get_type_day(df) - encoding the day type (TypeDay) based on the df.Date column of the DataTime format. The encoding is performed based on the industrial calendar of the Republic of Tatarstan
get_light(df) - encoding the light interval (Light) based on the df.Date column of the DataTime format. The encoding is performed based on the geographic location of the city of Kazan
get_season(df) - encoding seasonality based on the df.Date column of the DataTime format
prepareData(df, lag_start=1, lag_end=7) - data preprocessing function. Preprocessing includes: adding day type, light interval, seasonality, and energy consumption lag (default 1...7 days)

Core

The core module (located in the core.py file) is the main class in the library. It is based on CatBoostRegressor and has a number of functions:

predict_volume(df_general, df_predict, max_depth, learn_period) - model training and energy consumption forecasting
get_df_predicted(df_general, max_depth, learn_period, model, date_start, date_end, jupyter_widget=True, disable_widget=False) - generates a data frame with the predicted energy consumption volume for the specified planning horizon
date_str_format(df_predicted) - generates a date string for the exported xlsx file
get_DAM_order(df_general, max_depth, learn_period, model, date_start, date_end, jupyter_widget=True, path) - generates a DAM order and exports it to xlsx format

Validating

The validating helper module (located in the validating.py file) is designed to validate the core functions:

get_df_val_predicted(df_general, df_general_date_index, max_depth, learn_period, model, date_start, date_end, jupyter_widget=True, disable_widget=False) - function for generating a dataframe with predicted energy consumption volumes for the specified planning horizon, adapted for validation calculations (simulating the absence of 'ActCons' and 'ActGen' data after 7 AM, offline access to the weather forecast and BEM data)
get_df_validate(df_general, df_general_date_index, max_depth, learn_period, model, date_start, date_end, logging=True, jupyter_widget=True) - function for validating the model for the specified time interval. Returns a validation dataframe with predicted values.
get_df_validate_with_loss(df_validate_result, df_RSV_vs_BR_rate) - adds a 'loss' column with BEM losses to the resulting dataframe.
draw_diff_predict_vs_fact(df_validate_result, fontsize=12, font='Palatino Linotype') - outputs validation results in table and graph form (Matplotlib object).
Grid_Search(df_general, df_general_date_index, max_depth_grid, learn_period_grid, model, date_start, date_end, jupyter_widget=True) - grid search for optimal training period and tree depth.
search_result_highlighting(df_search_result) - highlights the search results of the Grid_Search() function.

EDA

The EDA graphics module (located in the EDA.py file) is designed to display the results of exploratory data analysis and includes the following functions:
--------------------------------utility----------------------------------

save_fig_to_pdf(plt, path, filename) - export the rendered figure to PNG format
rus_features(df, order='columns') - русификация датафреймов для диаграмм и графиков
cat_codes_data(df_general, model, translation) - russification of dataframes for charts and graphs
build_and_fit_cat_selector(df_general_cat, max_depth) - creating and training a CatBoost feature selector
get_shap_values(selector, df_general_cat) - generating the SHAP_value CatBoost Regressor with SHAP.Explainer
draw_learning_curve(df_general, max_depth, model, fontsize=15) - calculates and plots the learning curve.
------------------------------visualisation------------------------------
draw_learning_curve(df_general, max_depth, model, fontsize, font, save_fig_dir) - calculation and construction of the learning curve
graf_full_consumption(df, groupby_months, fontsize, font, save_fig_dir) - construction of a year-by-year energy consumption graph (with and without grouping by month)
graf_year_consumption(df, year, groupby_days, fontsize, font, save_fig_dir) - construction of an annual energy consumption chart traced by month (with and without grouping by days)
graf_temp_cons_corr(df, year, fontsize, font, save_fig_dir) - plotting the dependence of average daily electricity consumption on the average daily temperature for a given year
graf_month_consumption(df, month, year, fontsize, font, save_fig_dir) - plotting a monthly energy consumption graph with tracing by day of the week
draw_features_correlation_heatmaps(df_general, season, fontsize, font, translation, save_fig_dir) - constructing a heat map of Pearson correlations for the target and source features
draw_hour_of_day_correlation_heatmaps(df_general, fontsize, font, translation, save_fig_dir) - constructing a heat map of hourly Pearson correlations for the original features
draw_bar_CatBoost_feature_importances(selector, df_general_cat, fontsize, font, translation, save_fig_dir) - plotting the value diagram of features obtained using the built-in selector of the CatBoost regressor
draw_bar_SHAP_feature_importances(shap_values, fontsize, font, translation, save_fig_dir) - constructing a feature value diagram obtained using SHAP.Explainer
plot_and_save_shap_scatter(feature_name, df_general_cat, shap_values, interaction_index, fontsize, font, translation, save_fig_dir, filename) - plotting shap.dependence_plot for the selected feature
draw_example_decision_tree(df_general_cat, max_depth, tree_idx) - constructing an instance of the CatBoost regressor decision tree

Project details

These details have not been verified by PyPI

Project links

License
- Other/Proprietary License
Operating System
- OS Independent
Programming Language
- Python :: 3.10

Release history Release notifications | RSS feed

1.2.1

Apr 27, 2026

1.2.0

Apr 24, 2026

1.1.4

Apr 20, 2026

This version

1.1.2

Apr 5, 2026

1.1.0 yanked

Apr 4, 2026

Reason this release was yanked:

not added dependency xlrd==2.0.1

1.0.14

Mar 20, 2026

1.0.6

Dec 11, 2025

1.0.2

Nov 16, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

stlflib-1.1.2.tar.gz (41.5 kB view details)

Uploaded Apr 5, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

stlflib-1.1.2-py3-none-any.whl (40.9 kB view details)

Uploaded Apr 5, 2026 Python 3

File details

Details for the file stlflib-1.1.2.tar.gz.

File metadata

Download URL: stlflib-1.1.2.tar.gz
Upload date: Apr 5, 2026
Size: 41.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.7

File hashes

Hashes for stlflib-1.1.2.tar.gz
Algorithm	Hash digest
SHA256	`74f3b100691ac7d736ab4ee3a7ab22f234b1937350c285d6cfb5a159e952349b`
MD5	`867063488d3c119f12ae21b7669a64ef`
BLAKE2b-256	`fa56fd3b892a584aa38948e561b6f4d5ea9cd9263c3546fb804a1c2bf35d90c1`

See more details on using hashes here.

File details

Details for the file stlflib-1.1.2-py3-none-any.whl.

File metadata

Download URL: stlflib-1.1.2-py3-none-any.whl
Upload date: Apr 5, 2026
Size: 40.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.7

File hashes

Hashes for stlflib-1.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`353f31d593c392dd8831a1e835721ca664fddcbc3603a5ab3efbef66e995fe23`
MD5	`d215eaedebf73510be141fe12d5fe106`
BLAKE2b-256	`b76405c9b7cd3d617f8ec080162f815cab4d6d7a0dcc4df99dbb463e0102b96a`

See more details on using hashes here.

stlflib 1.1.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Short-Term Load Forecasting Based on CatBoost Model Library (STLFLib)

How to install

Using

Dependency

ServiceDB

Preprocessing

Core

Validating

EDA

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes