edo · PyPI

Generating artificial datasets through evolution.

These details have not been verified by PyPI

Project links

Homepage

Project description

https://github.com/daffidwilde/edo/workflows/CI/badge.svg

https://img.shields.io/badge/code%20style-black-000000.svg

Evolutionary Dataset Optimisation

A library for generating artificial datasets through evolution.

The edo library provides an evolutionary algorithm that optimises any real-valued function over a subset of the space of all possible datasets that we call Evolutionary Dataset Optimisation. The output of the algorithm is a bank of effective datasets for which the provided function performs well that can then be studied.

The applications of this method are varied but an important and relevant one is in learning an algorithm’s strengths and weaknesses.

When determining the quality of an algorithm, the standard route is to run the comparable algorithms on a finite set of existing (or newly simulated) datasets and calculating some metric. The algorithm(s) with the smallest value of this metric are chosen to be the best performing.

An issue with this approach is that it pays little regard to the reliability and quality of the datasets being used, which begs the question: what makes a dataset “good” for an algorithm? Or, why is it that an algorithm performs well on some datasets but not others?

By passing the objective function of the algorithm to the edo.DataOptimiser class, questions like these can be answered by studying the properties of the resultant datasets. Beyond that, a combination of objective functions could be used to determine how an algorithm performs against any number of other algorithms. A comprehensive description of the evolutionary algorithm and an examplar case study is available at https://doi.org/10.1007/s10489-019-01592-4.

Installation

The edo library requires Python 3.6+ and is pip-installable:

$ python -m pip install edo

To install from source then clone the GitHub repo:

$ git clone https://github.com/daffidwilde/edo.git
$ cd edo
$ python setup.py install

A command line tool has been developed to make using edo for larger experiments easier: https://github.com/daffidwilde/edolab

Publications and documentation

Full documentation for the library is available at https://edo.readthedocs.io.

An article on the theory behind the algorithm has been published:

Wilde, H., Knight, V. & Gillard, J. Evolutionary dataset optimisation: learning algorithm quality through evolution. Appl Intell 50, 1172-1191 (2020). https://doi.org/10.1007/s10489-019-01592-4

Citation instructions

Citing the library

Please use the following to cite the library:

@misc{edo-library,
    author = {{The EDO library developers}},
    title = {edo: <RELEASE TITLE>},
    year = <RELEASE YEAR>,
    doi = {<DOI INFORMATION>},
    url = {http://doi.org/<DOI INFORMATION>}
}

To check the relevant details (i.e. RELEASE TITLE, RELEASE YEAR and DOI NUMBER) head to the library’s Zenodo page:

Citing the paper

If you wish to cite the paper, then use the following:

@article{edo-paper,
    title = {Evolutionary dataset optimisation: learning algorithm quality
             through evolution},
    author = {Wilde, Henry and Knight, Vincent and Gillard, Jonathan},
    journal = {Applied Intelligence},
    year = 2020,
    volume = 50,
    pages = {1172--1191},
    doi = {10.1007/s10489-019-01592-4},
}

Contributing to the library

Contributions are always welcome whether they come in the form of providing a fix for a current issue, reporting a bug or implementing an enhancement to the library code itself. Pull requests (PRs) will be reviewed and collaboration is encouraged.

To make a contribution via a PR, follow these steps:

Make a fork of the GitHub repo and clone your fork locally:
```
$ git clone https://github.com/<your-username>/edo.git
```
Install the library in development mode. If you use Anaconda, there is a conda environment file (environment.yml) with all of the development dependencies:
```
$ cd edo
$ conda env create -f environment.yml
$ conda activate edo-dev
$ python setup.py develop
```
Make your changes and write tests to go with them. Ensure that they pass and you have 100% coverage:
```
$ python -m pytest --cov=edo --cov-fail-under=100 tests
```
Push to your fork and open a pull request.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

0.3.6

Jan 3, 2021

0.3.5

Aug 5, 2020

0.3.4

Jul 29, 2020

0.3.3

Jul 28, 2020

0.3.2

Jul 28, 2020

0.3.1

Jul 28, 2020

0.3.0

Jul 27, 2020

0.2.1

Apr 25, 2019

0.2

Apr 15, 2019

0.1

Feb 5, 2019

0.0.4

Jan 30, 2019

0.0.4a0 pre-release

Jan 31, 2019

0.0.3

Jan 30, 2019

0.0.2

Jan 30, 2019

0.0.1a0 pre-release

Aug 24, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

edo-0.3.6.tar.gz (30.4 kB view details)

Uploaded Jan 3, 2021 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

edo-0.3.6-py3-none-any.whl (23.2 kB view details)

Uploaded Jan 3, 2021 Python 3

File details

Details for the file edo-0.3.6.tar.gz.

File metadata

Download URL: edo-0.3.6.tar.gz
Upload date: Jan 3, 2021
Size: 30.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.2.0.post20200714 requests-toolbelt/0.9.1 tqdm/4.47.0 CPython/3.8.3

File hashes

Hashes for edo-0.3.6.tar.gz
Algorithm	Hash digest
SHA256	`7489c2c90da2ab350bfbebea533afff8ee4c6a364aabc80d51fe835411c9253e`
MD5	`b7b056540d2efc494513557c3ff84f46`
BLAKE2b-256	`1cff7014894ed1e4c0c23c31d31ea03a4daa66603091723d2d02b6236d9eedd8`

See more details on using hashes here.

File details

Details for the file edo-0.3.6-py3-none-any.whl.

File metadata

Download URL: edo-0.3.6-py3-none-any.whl
Upload date: Jan 3, 2021
Size: 23.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.2.0.post20200714 requests-toolbelt/0.9.1 tqdm/4.47.0 CPython/3.8.3

File hashes

Hashes for edo-0.3.6-py3-none-any.whl
Algorithm	Hash digest
SHA256	`41319427e21a34808ccb624c1730cd36725847f3d87d2fb5de60ab0f3f9e3395`
MD5	`6f9c4558fe3073f0e70bb68eab3803df`
BLAKE2b-256	`d23fe5c98b7304540899037b479bb02466561c0b9e16c87ab773b99b979468cf`

See more details on using hashes here.

edo 0.3.6

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

Evolutionary Dataset Optimisation

A library for generating artificial datasets through evolution.

Installation

Publications and documentation

Citation instructions

Citing the library

Citing the paper

Contributing to the library

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes