Skip to main content

Randomly generate populations.

Project description

CI Status Documentation Status

This package can be used to generate random populations based on (publicly availabled) aggregated data.

These populations can then for example be used for experimentation in the field of machine learning or simulation studies.

The simago package is available on PyPI, the source code on Github and the documentation on ReadTheDocs. The package is released under the GPLv3 licence and tested for Python 3.7 and up.

See the changes in the changelog.

Usage

The easiest way to get started, after installing the package with pip install simago, is to use the function generate_population() from simago.population. This function creates an instance of the PopulationClass object with the generated population Pandas DataFrame as the PopulationClass.population attribute.

The population DataFrame contains a row for every person and a column for each property. The values for these properties are randomly drawn from probability distributions defined by the supplied data. This is done by supplying a settings (YAML) file, a data (CSV) file and possibly a conditions (CSV) file. For more guidance see the example in the documentation.

How to contribute

If you would like to contribute to this project, please check out the contributing guidelines.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

simago-0.3.2.tar.gz (131.3 kB view hashes)

Uploaded Source

Built Distribution

simago-0.3.2-py3-none-any.whl (23.4 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page