Skip to main content

Sample in hypercubes, select diverse subsets, and measure diversity

Project description

Description

diversipy is a collection of algorithms dealing with three different but related topics. The first topic is super-uniform sampling of the unit hypercube. ‘Super-uniform’ in this context means that the obtained point sample should be more uniform than a random uniform sample, which is a desirable property in many applications. One such application is the design of computer experiments, where typically space-filling experimental designs are used. After creation, the samples can be transformed from the unit hypercube to arbitrary cuboids.

The task of subset selection is defined as follows: suppose you have a set of points in R^n and want to select a sample of them distributed as uniformly as possible. This may be necessary because the original set is too large to be processed entirely. The selection problem is related to clustering, with the difference that when using clustering, you usually want to retain the structure of the original point set.

Once one has created (or obtained from somewhere) a point set, one may want to assess its properties. Therefore, diversipy contains several functions to measure diversity and a few related concepts. Several different indicators are offered because they have different advantages and disadvantages (in terms of run time and what they measure).

Example

>>> from diversipy import *
>>> design = transform_spread_out(lhd_matrix(50, 2)) # create latin hypercube design
>>> subset = psa_select(design, 10) # select subset, for whatever reason
>>> unanchored_L2_discrepancy(subset) # calculate discrepancy

Note that points are stored row-wise, in accordance with numpy convention.

Documentation

The documentation is located at https://www.simonwessing.de/diversipy/doc/

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

diversipy-0.9.tar.gz (27.7 kB view details)

Uploaded Source

File details

Details for the file diversipy-0.9.tar.gz.

File metadata

  • Download URL: diversipy-0.9.tar.gz
  • Upload date:
  • Size: 27.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.11.2

File hashes

Hashes for diversipy-0.9.tar.gz
Algorithm Hash digest
SHA256 165a31e946aa360d49b7dc1470524e2da9582aa774490cb934294040f7222f9c
MD5 11f1f2aac07cae160a5918599ce9d18d
BLAKE2b-256 53f6136e03239c2205497b482b04e05c961add0638e92d26b0d13b6499fedb40

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page