The goal of sinergym is to create an environment following OpenAI Gym interface for wrapping simulation engines for building control using deep reinforcement learning.
Project description
Sinergym
Welcome to Sinergym!
The goal of this project is to create an environment following Gymnasium interface for wrapping simulation engines (EnergyPlus) for building control using deep reinforcement learning or any external control.
For more information about Sinergym, please visit our documentation.
To report questions and issues, please use our issue tracker. We appreciate your feedback and contributions. Check out our CONTRIBUTING.md for more details on how to contribute.
The main functionalities of Sinergym are the following:
-
Simulation Engine Compatibility: Uses EnergyPlus Python API for Python-EnergyPlus communication. Future plans include more engines like OpenModelica.
-
Benchmark Environments: Designs environments for benchmarking and testing deep RL algorithms or other external strategies, similar to Atari or Mujoco.
-
Customizable Environments: Allows easy modification of experimental settings. Users can create their own environments or modify pre-configured ones in Sinergym.
-
Customizable Components: Enables creation of new custom components for new environments, making Sinergym scalable, such as function rewards, wrappers, controllers, etc.
-
Automatic Building Model Adaptation: Sinergym automates the process of adapting the building model to user changes in the environment definition.
-
Automatic Actuators Control: Controls actuators through the Gymnasium interface based on user specification, only actuators names are required and Sinergym will do the rest.
-
Extensive Environment Information: Provides comprehensive information about Sinergym background components from the environment interface.
-
Stable Baseline 3 Integration: Customizes functionalities for easy testing of environments with SB3 algorithms, such as callbacks and customizable training real-time logging. However, Sinergym is agnostic to any DRL algorithm.
-
Google Cloud Integration: Offers guidance on using Sinergym with Google Cloud infrastructure.
-
Weights & Biases Compatibility: Automates and facilitates training, reproducibility, and comparison of agents in simulation-based building control problems. WandB assists in managing and monitoring model lifecycle.
-
Notebook Examples: Provides code in notebook format for user familiarity with the tool.
-
Extensive Documentation, Unit Tests, and GitHub Actions Workflows: Ensures Sinergym is an efficient ecosystem for understanding and development.
-
And much more!
This is a project in active development. Stay tuned for upcoming releases.
Project Structure
This repository is organized into the following directories:
sinergym/
: Contains the source code for Sinergym, including the environment, modeling, simulator, and tools such as wrappers and reward functions.docs/
: Online documentation generated with Sphinx and using Restructured Text (RST).examples/
: Jupyter notebooks illustrating use cases with Sinergym.tests/
: Unit tests for Sinergym to ensure stability.scripts/
: Scripts for various tasks such as agent training and performance checks, allowing configuration using JSON format.
Available Environments
For a complete and up-to-date list of available environments, please refer to our documentation.
Installation
Please visit INSTALL.md for detailed installation instructions.
Usage example
If you used our Dockerfile during installation, you should have the try_env.py file in your workspace as soon as you enter in. In case you have installed everything on your local machine directly, place it inside our cloned repository. In any case, we start from the point that you have at your disposal a terminal with the appropriate python version and Sinergym running correctly.
Sinergym uses the standard Gymnasium API. So a basic loop should look like:
import gymnasium as gym
import sinergym
# Create the environment
env = gym.make('Eplus-datacenter-mixed-continuous-stochastic-v1')
# Initialize the episode
obs, info = env.reset()
truncated = terminated = False
R = 0.0
while not (terminated or truncated):
a = env.action_space.sample() # random action selection
obs, reward, terminated, truncated, info = env.step(a) # get new observation and reward
R += reward
print('Total reward for the episode: %.4f' % R)
env.close()
A folder will be created in the working directory after creating the environment. It will contain the Sinergym outputs produced during the simulation.
For more examples and details, please visit our usage examples documentation section.
Google Cloud Platform support
For more information about this functionality, please, visit our documentation here.
Projects using Sinergym
The following are some of the projects benefiting from the advantages of Sinergym:
:pencil: If you want to appear in this list, do not hesitate to send us a PR and include the following badge in your repository:
Repo Activity
Citing Sinergym
If you use Sinergym in your work, please cite our paper:
@inproceedings{2021sinergym,
title={Sinergym: A Building Simulation and Control Framework for Training Reinforcement Learning Agents},
author={Jiménez-Raboso, Javier and Campoy-Nieves, Alejandro and Manjavacas-Lucas, Antonio and Gómez-Romero, Juan and Molina-Solana, Miguel},
year={2021},
isbn = {9781450391146},
publisher = {Association for Computing Machinery},
address = {New York, NY, USA},
url = {https://doi.org/10.1145/3486611.3488729},
doi = {10.1145/3486611.3488729},
booktitle = {Proceedings of the 8th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation},
pages = {319–323},
numpages = {5},
}
For more detailed information, please visit our documentation.
Installation
Sinergym relies on several dependencies, the specifics of which vary by version. The table below provides a summary of the versions supported by Sinergym across its releases:
Sinergym version | Ubuntu version | Python version | EnergyPlus version | Building model file |
---|---|---|---|---|
0.0 | 18.04 LTS | 3.6 | 8.3.0 | IDF |
1.1.0 | 18.04 LTS | 3.6 | 9.5.0 | IDF |
1.7.0 | 18.04 LTS | 3.9 | 9.5.0 | IDF |
1.9.5 | 22.04 LTS | 3.10.6 | 9.5.0 | IDF |
2.4.0 | 22.04 LTS | 3.10.6 | 9.5.0 | epJSON |
2.5.0 | 22.04 LTS | 3.10.6 | 23.1.0 | epJSON |
3.3.6 | 24.04 LTS | 3.12.3 | 23.1.0 | epJSON |
3.5.8 | 24.04 LTS | 3.12.3 | 24.1.0 | epJSON |
We recommend always using the latest version of Sinergym that is supported by the container. This approach helps you avoid the complexities of the installation process. However, if you prefer to manually install Sinergym on your computer, we provide the necessary documentation in the subsequent sections.
Docker container
We provide a Dockerfile to install all dependencies and prepare the image for running Sinergym. This is the recommended option, since it ensures that all dependencies and versions are correctly installed and configured.
In essence, this Dockerfile installs the compatible operating system, EnergyPlus, Python, and Sinergym, along with the necessary dependencies for its proper functioning. If you have cloned the repository, it can be used as follows:
$ docker build -t <tag_name> .
However, Sinergym has a set of optional dependencies that enhance the tool's usage. These dependencies can be installed in the following way when building the image:
$ docker build -t <tag_name> --build-arg SINERGYM_EXTRAS="drl notebooks gcloud" .
These optional dependencies allow you to use stable-baselines3, wandb, notebooks or gcloud directly.
For more information, please refer to the pyproject.toml
file at the root of the repository
([tool.poetry.extras]
section). If you desire to install all optional
packages, you can use extras
directly in the SINERGYM_EXTRAS
argument.
:memo: Note: Our container can also be directly installed from the Docker Hub repository. It contains all the project's releases with secondary dependencies or lite versions.
Once the container image is ready, you can execute any command as follows:
$ docker run -it --rm <tag_name> <command>
By default, the command executed is python scripts/try_env.py
, which is a minimal working example.
If you want to run a DRL experiment, for example, you can do it like this:
$ docker build -t example/sinergym:latest --build-arg SINERGYM_EXTRAS="drl" .
$ docker run -e WANDB_API_KEY=$WANDB_API_KEY -it --rm example/sinergym:latest python scripts/train/train_agent.py -conf scripts/train/train_agent_PPO.json
If the script you want to use requires a WandB account, remember to include the environment variable in the container with the token.
It’s also possible to keep a session open in the image to copy and run your own scripts. For more information, please refer to the Docker documentation. This can be useful when you want to run your own scripts in the container.
$ docker run -it <tag_name> /bin/bash
:memo: Note: For Visual Studio Code users, simply open the root directory and click on the Reopen in container pop-up button. This action will automatically install all dependencies and enable you to run Sinergym in an isolated environment. For more details on this feature, refer to the VSCode Containers extension documentation.
Manual installation
If you prefer not to use containers and have everything installed natively on your system, we’ll explain how to do it.
First, make sure that you meet the compatibility matrix; otherwise, no support or guarantees of functionality will be provided by the development team.
Configure Python Environment
Start by installing the desired version of Python and pip. It is recommended to set up a working environment for Python (multiple options are available). Finally, install the necessary dependencies for Sinergym in that environment:
$ pip install sinergym
You can also install the optional packages from here, just like in the Docker container:
$ pip install sinergym[extras]
If you want to install the cloned repository directly, you can do so by running the following command in its root.
$ poetry install --no-interaction --only main --extras <optional_extras>
# or
$ pip install .[<optional_extras>]
With this, you have the correct Python version and the necessary modules to run Sinergym. Let's proceed with the installation of other required programs outside of Python to run the simulations.
Install EnergyPlus 24.1.0
You need to install EnergyPlus. We have tested and confirmed compatibility
with version 24.1.0
. The code might work with other versions, but we
have not tested them.
Follow the instructions here to install
it for Linux (we only support Ubuntu). You can choose any location for the
installation. After installation, a folder named Energyplus-24-1-0
should
appear in the chosen location.
Include Energyplus Python API in Python Path
Sinergym uses the Energyplus Python API as its backend. The modules of this
API are located in the Energyplus folder that you installed in the previous
step. You must add this installation path to the PYTHONPATH
environment
variable so that the interpreter can access these modules.
Develop in Sinergym
Whether you have chosen to use Docker or a manual installation, we offer facilities for developing and extending Sinergym.
If you've used a container, Visual Studio Code will set up a development environment with all
the necessary packages automatically, including documentation, tests, DRL, etc integrated in the
IDE. If you've opted to use a container without Visual Studio Code, you can use the Dockerfile available in the
.devcontainer
folder instead of the one in the root of the repository. If you are creating your own
Dockerfile, make sure to perform the following installation so that all development modules
are available:
RUN poetry install --no-interaction
The default installation includes all development packages. To avoid this, you should specify
--only main
or --without <develop_groups>
. The development groups can also be found
in pyproject.toml
.
If you have manually installed the project, you can install the development packages from Poetry in the same way. Once the repository is cloned, run the same command we explained earlier, adding the following:
$ poetry install --no-interaction
As you can see, the command is the same as the one shown in the manual installation section, but without specifying groups or extras, so that all development packages are installed. In this case, it is not possible to use pip because it does not include information about development dependencies (except those listed in extras).
:memo: For more information on how Poetry dependencies work, visit the official documentation.
Verify Installation
This project is automatically monitored using tests specifically developed for it.
To verify that Sinergym has been installed correctly, execute pytest tests/ -vv
in the repository root. Remember to have installed test extra packaging to use it.
Furthermore, each time the Sinergym repository is updated, the tests are automatically executed in a remote container
built using the Dockerfile. This task is performed by Github Action
(refer to the Github Actions
section for more details).
Cloud Computing
You also have the option to run your experiments in the Cloud. We utilize Google Cloud
for this purpose. Our team is working on setting up an account to run our Sinergym container
with remote storage and Weights&Biases tracking.
For more information on installation and preparing the Google Cloud SDK to run your experiments,
please visit our Preparing Google Cloud
section.
Contributor Covenant Code of Conduct
Our Pledge
We as members, contributors, and leaders pledge to make participation in our community a harassment-free experience for everyone, regardless of age, body size, visible or invisible disability, ethnicity, sex characteristics, gender identity and expression, level of experience, education, socio-economic status, nationality, personal appearance, race, religion, or sexual identity and orientation.
We pledge to act and interact in ways that contribute to an open, welcoming, diverse, inclusive, and healthy community.
Our Standards
Examples of behavior that contributes to a positive environment for our community include:
- Demonstrating empathy and kindness toward other people
- Being respectful of differing opinions, viewpoints, and experiences
- Giving and gracefully accepting constructive feedback
- Accepting responsibility and apologizing to those affected by our mistakes, and learning from the experience
- Focusing on what is best not just for us as individuals, but for the overall community
Examples of unacceptable behavior include:
- The use of sexualized language or imagery, and sexual attention or advances of any kind
- Trolling, insulting or derogatory comments, and personal or political attacks
- Public or private harassment
- Publishing others' private information, such as a physical or email address, without their explicit permission
- Other conduct which could reasonably be considered inappropriate in a professional setting
Enforcement Responsibilities
Community leaders are responsible for clarifying and enforcing our standards of acceptable behavior and will take appropriate and fair corrective action in response to any behavior that they deem inappropriate, threatening, offensive, or harmful.
Community leaders have the right and responsibility to remove, edit, or reject comments, commits, code, wiki edits, issues, and other contributions that are not aligned to this Code of Conduct, and will communicate reasons for moderation decisions when appropriate.
Scope
This Code of Conduct applies within all community spaces, and also applies when an individual is officially representing the community in public spaces. Examples of representing our community include using an official e-mail address, posting via an official social media account, or acting as an appointed representative at an online or offline event.
Enforcement
Instances of abusive, harassing, or otherwise unacceptable behavior may be reported to the community leaders responsible for enforcement at [INSERT CONTACT METHOD]. All complaints will be reviewed and investigated promptly and fairly.
All community leaders are obligated to respect the privacy and security of the reporter of any incident.
Enforcement Guidelines
Community leaders will follow these Community Impact Guidelines in determining the consequences for any action they deem in violation of this Code of Conduct:
1. Correction
Community Impact: Use of inappropriate language or other behavior deemed unprofessional or unwelcome in the community.
Consequence: A private, written warning from community leaders, providing clarity around the nature of the violation and an explanation of why the behavior was inappropriate. A public apology may be requested.
2. Warning
Community Impact: A violation through a single incident or series of actions.
Consequence: A warning with consequences for continued behavior. No interaction with the people involved, including unsolicited interaction with those enforcing the Code of Conduct, for a specified period of time. This includes avoiding interactions in community spaces as well as external channels like social media. Violating these terms may lead to a temporary or permanent ban.
3. Temporary Ban
Community Impact: A serious violation of community standards, including sustained inappropriate behavior.
Consequence: A temporary ban from any sort of interaction or public communication with the community for a specified period of time. No public or private interaction with the people involved, including unsolicited interaction with those enforcing the Code of Conduct, is allowed during this period. Violating these terms may lead to a permanent ban.
4. Permanent Ban
Community Impact: Demonstrating a pattern of violation of community standards, including sustained inappropriate behavior, harassment of an individual, or aggression toward or disparagement of classes of individuals.
Consequence: A permanent ban from any sort of public interaction within the community.
Attribution
This Code of Conduct is adapted from the Contributor Covenant, version 2.0, available at https://www.contributor-covenant.org/version/2/0/code_of_conduct.html.
Community Impact Guidelines were inspired by Mozilla's code of conduct enforcement ladder.
For answers to common questions about this code of conduct, see the FAQ at https://www.contributor-covenant.org/faq. Translations are available at https://www.contributor-covenant.org/translations.
MIT License
Copyright (c) 2021 Javier Jiménez
Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:
The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file sinergym-3.6.2.tar.gz
.
File metadata
- Download URL: sinergym-3.6.2.tar.gz
- Upload date:
- Size: 3.2 MB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/5.1.1 CPython/3.12.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 285b0f22e085643ae5714051e00b824c733e2be97287affb550330c6b0e1c09f |
|
MD5 | 36e6f13362173c3b9dce745a66c2897b |
|
BLAKE2b-256 | 4cb3eefbcea1ed928d8fb62fd9098c4058d4d10a1e5fc810248ae7d440f131c6 |
File details
Details for the file sinergym-3.6.2-py3-none-any.whl
.
File metadata
- Download URL: sinergym-3.6.2-py3-none-any.whl
- Upload date:
- Size: 3.4 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/5.1.1 CPython/3.12.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 289222c2472ae6c68d9b7a27f714bbb012a37854881c89966835360a378e8054 |
|
MD5 | d0b965cd7853a5f7fc0c028389f1fc94 |
|
BLAKE2b-256 | dca1b461968082b908dd7ef26c4ee2c4a1c4c8a0b9e1eff5f4e8ed6db0dda183 |