A library that makes Evolutionary Strategies (ES) simple to use.
Project description
EvoStrat
A library that makes Evolutionary Strategies (ES) simple to use.
Installation
pip install evostrat
Usage
pop = PopulationImpl(...) # See complete examples for implementations.
optim = torch.optim.Adam(pop.parameters()) # Use any torch.optim optimizer
for i in range(N):
optim.zero_grads()
pop.fitness_grads(n_samples=200) # Computes approximate gradients
optim.step()
For complete examples that solves 'LunarLander-v2' see the examples/.
Description
Evolutionary Strategies is a powerful approach to solve reinforcement learning problems and other optimization problems where the gradients cannot be computed with backprop. See "Evolution strategies as a scalable alternative to reinforcement learning" for an excellent introduction.
In ES the objective is to maximize the expected fitness of a distribution over individuals, referred to as the population. With a few math tricks this objective can be maximized with gradient ascent, even if the fitness function itself is not differentiable.
This library offers
- A flexible and natural interface for ES that cleanly separates the environment, the reinforcement learning agent, the population distribution and the optimization.
- A plug-and-play approach for reinforcement learning agents with
torch.nn.Module
policy networks. See examples/lunar_lander.py and examples/normal_lunar_lander.py. - Several population distributions and variants
- Independent Normal. equivalent to OpenAI ES or PEPG depending on whether the standard deviation is fixed or learned. See examples/normal_lunar_lander.py
- Multivariate Normal with a full covariance matrix. Similar to CMA-ES. See examples/multivariate_normal_lunar_lander.py
- Categorical. For agents with categorical parameters, demonstrating the ability to handle non-normal distributions. See the examples/binary_lunar_lander.py.
- A simple interface for creating your own populations, without having to derive any gradients! Just subclass Population and implement the sampling process. See the built in populations for inspiration.
Attribution
If you use this software in your academic work please cite
@misc{palm2020,
author = {Palm, Rasmus Berg},
title = {EvoStrat},
year = {2020},
publisher = {GitHub},
journal = {GitHub repository},
howpublished = {\url{https://github.com/rasmusbergpalm/evostrat}}
}
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file evostrat-1.4.3.tar.gz
.
File metadata
- Download URL: evostrat-1.4.3.tar.gz
- Upload date:
- Size: 10.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.1 importlib_metadata/3.10.1 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.7.10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 119bef99e980452a57873734c1ab6fd0d10597c34ed1a2c470574ea7c3335623 |
|
MD5 | cd539781dacfc8b122d93b06ffc33caf |
|
BLAKE2b-256 | 0d99ab3e420640ae76e62d906959ed8167abeb274bb9ee0155dceba588b49870 |
File details
Details for the file evostrat-1.4.3-py3-none-any.whl
.
File metadata
- Download URL: evostrat-1.4.3-py3-none-any.whl
- Upload date:
- Size: 16.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.1 importlib_metadata/3.10.1 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.7.10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 80491f0165568b476ab61e02d687d130ff2f088fdd2b9c868ffaea63b1466679 |
|
MD5 | 09c5b3e493530d53762475c3ab37bc62 |
|
BLAKE2b-256 | 71bdc0f32d41d3cab2ee792af1346d365e567c3b30f62f80265a4399d2cd392b |