The Cross-Entropy Method for either rare-event sampling or optimization.

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

The Cross Entropy Method

The Cross Entropy Method (CE or CEM) is an approach for optimization or rare-event sampling in a given class of distributions {D_p} and a score function R(x).

In its sampling version, it is given a reference p0 and aims to sample from the tail of the distribution x ~ (D_p0 | R(x)<q), where q is defined as either a numeric value q or a quantile alpha (where q=q_alpha(R)).
In its optimization version, it aims to find argmin_x{R(x)}.

The exact implementation of the CEM depends on the distributions family {D_p} as defined in the problem. This repo provides a general implementation as an abstract class, where a concrete use requires writing a simple, small inherited class. The attached tutorial.ipynb provides a more detailed background on the CEM and on this package, along with usage examples.

Installation: pip install cross-entropy-method.


CEM for sampling (left): the mean of the sample distribution (blue) aims to coincide with the mean of the tail of the original distribution (black). CEM for optimization (right): the mean of the sample distribution aims to be minimized. (images from `tutorial.ipynb`)

Supporting non-stationary score functions

On top of the standard CEM, we also support a non-stationary score function R. This affects the reference distribution of scores and thus the quantile threshold q (if specified as a quantile). Thus, we have to repeatedly re-estimate q, using importance-sampling correction to compensate for the CEM distributional shift.

In our separate work, we demonstrate the use of the CEM for the more realistic problem of sampling high-risk environment-conditions in risk-averse reinforcement learning. There, D_p determines the distribution of the environment-conditions, p0 corresponds to the original distribution (or test distribution), and R(x; agent) is the return function of the agent given the conditions x. Note that since the agent evolves with the training, the score function is indeed non-stationary.

Cite this repo

@misc{cross_entropy_method,
  title={Cross Entropy Method with Non-stationary Score Function},
  author={Ido Greenberg},
  howpublished={\url{https://pypi.org/project/cross-entropy-method/}},
  year={2022}
}

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.1.1

Nov 2, 2022

0.1.0

Oct 10, 2022

0.0.9

Sep 13, 2022

0.0.8

Aug 14, 2022

0.0.7

Aug 13, 2022

This version

0.0.6

Aug 9, 2022

0.0.5

Jul 31, 2022

0.0.4

Jul 8, 2022

0.0.3

Jun 8, 2022

0.0.2

Jun 6, 2022

0.0.1

May 18, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cross-entropy-method-0.0.6.tar.gz (11.6 kB view hashes)

Uploaded Aug 9, 2022 Source

Built Distribution

cross_entropy_method-0.0.6-py3-none-any.whl (17.8 kB view hashes)

Uploaded Aug 9, 2022 Python 3

Hashes for cross-entropy-method-0.0.6.tar.gz

Hashes for cross-entropy-method-0.0.6.tar.gz
Algorithm	Hash digest
SHA256	`b3e158e7e584a8ddb3025a0625b720fb8af50c9f2fedbd6bbb597ee0610354f3`
MD5	`b1f68d792a046382052e6d5c8ce58f67`
BLAKE2b-256	`94b062070ff8d3a1cb8416db069b9ebb8707c224fad006098e3947d77f7b4c37`

Hashes for cross_entropy_method-0.0.6-py3-none-any.whl

Hashes for cross_entropy_method-0.0.6-py3-none-any.whl
Algorithm	Hash digest
SHA256	`8fd5acf56aa1c08aeb5d41300d97acae532b8fbb2318b99b51eda5bf2233e81c`
MD5	`df72d75f653127c2086cbf9773696e80`
BLAKE2b-256	`cbbc5aa92bc4095f1113b0b72cdeba37863d6e47629cefa62c67177c7e346966`