Library for Fast Offline RL Analysis with Minimum Dependencies
Project description
lightRaven -- Offline RL with Maximum Speed
This library provides convenient tools for people to create their own seldonian algorithms with optimum performance. A detailed example is also included in dynamic_training.ipynb
. Performance test is in ci_performance.ipynb
.
Dependencies
gym==0.17.3
numpy==1.19.1
scipy==1.5.2
numba == 0.51.2
Supplementary Materials
- Definition of Seldonian Framework
- Definition of different Importance Sampling estimators
- Definition of the new concentration bound
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
lightRaven-0.1.0.tar.gz
(16.0 kB
view hashes)
Built Distribution
lightRaven-0.1.0-py3-none-any.whl
(23.5 kB
view hashes)
Close
Hashes for lightRaven-0.1.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | d00fee161046e6065ec5c3b0312e929b59133007840f6c107c6740d8dce7db11 |
|
MD5 | 2f5a5cea7bb50ec4d3ff601b613ae6cd |
|
BLAKE2b-256 | 06ecaa809461703f0306ef91e94535541338edc439c0bb42964d9363ad36be84 |