Skip to main content

A python module desgined for Offline RL algorithms developing and benchmarking.

Project description

OfflineRL-Lib

🚧 This repo is not ready for release, benchmarking is ongoing. 🚧

OfflineRL-Lib provides unofficial and benchmarked PyTorch implementations for selected OfflineRL algorithms, including:

Benchmark Results

When certain design choices, e.g. the choice of autodiff backend (jax or tf or pytorch) vary, the preference for each hyper-parameters may vary as well. Hence when benchmarking, we tested each algorithm's performace in three ways:

  • Paper Performance: the performance reported in white paper;
  • OfflineRL-Lib (with paper args): the performance obtained by using OfflineRL-Lib implementation and the configs in paper or original implementations;
  • OfflineRL-Lib (with CORL args): the performance obtained by using OfflineRL-Lib implementation and the configs in CORL.

For the last option, arguments are directly borrowed from CORL. CORL provides simplified single-file implementations of these algorithms as well as their finetuned hyper-parameters based on pytorch, please check their repo as well.

XQL :page_facing_up: :chart_with_upwards_trend:

Task Dataset Quality Paper Performance
(consistent)
Paper Performance
(tuned)
OfflineRL-Lib
(paper args)
(consistent)
OfflineRL-Lib
(paper args)
(tuned)
halfcheetah random-v2NANANANA
medium-v247.748.3NA47.9±0.2
medium-replay-v244.845.2NA44.3±0.4
medium-expert-v289.894.2NA92.1±1.0
hopper random-v2NANANANA
medium-v271.174.2NA67.0±6.8
medium-replay-v297.3100.7NA96.9±6.2
medium-expert-v2107.1111.2NA101.9±5.2
walker2d random-v2NANANANA
medium-v281.584.2NA83.8±0.4
medium-replay-v275.982.2NA76.5±5.2
medium-expert-v2110.1112.7NA110.1±0.4

IQL :page_facing_up: :chart_with_upwards_trend:

Task Dataset Quality Paper Performance OfflineRL-Lib
(with paper args)
OfflineRL-Lib
(with CORL args)
halfcheetah random-v2NA9.4±3.913.5±3.9
medium-v247.447.3±0.248.6±0.2
medium-replay-v244.243.7±0.744.3±0.4
medium-expert-v286.789.7±2.993.9±1.6
full-replay-v2NA73.5±0.874.9±0.3
expert-v2NA94.8±0.495.7±2.6
hopper random-v2NA7.9±0.37.3±0.1
medium-v266.364.8±7.254.5±1.6
medium-replay-v294.793.4±7.941.5±23.0
medium-expert-v291.597.8±9.0108.1±3.1
full-replay-v2NA104.5±6.0106.3±1.0
expert-v2NA110.1±0.8103.8±7.9
walker2d random-v2NA6.0±1.03.1±0.9
medium-v278.383.5±2.281.3±8.7
medium-replay-v273.966.6±16.277.0±7.3
medium-expert-v2109.6108.9±2.5112.4±0.8
full-replay-v2NA92.9±3.599.2±0.7
expert-v2NA109.7±0.3112.6±0.4

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

offlinerllib-0.0.3.tar.gz (21.3 kB view details)

Uploaded Source

File details

Details for the file offlinerllib-0.0.3.tar.gz.

File metadata

  • Download URL: offlinerllib-0.0.3.tar.gz
  • Upload date:
  • Size: 21.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.11.2

File hashes

Hashes for offlinerllib-0.0.3.tar.gz
Algorithm Hash digest
SHA256 52c2fe37f7119cd61ca8dd02d81d9e2a05a36155ad5a61866e00c41a5d3d904a
MD5 886d30ee4262bba3e54c932cbb875078
BLAKE2b-256 702e3304079852331441a87c10189e241b7a447419bd48b6af87e8b319eb15bc

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page