Skip to main content

Generate synthetic data with clusters.

Project description

Tests Coverage

repliclust (pronounced rep-lee-clust, from replicate and cluster) is a Python package for generating synthetic data sets with clusters.

The package is based on data set archetypes, high-level geometric descriptions of whole classes of data sets. Our framework allows you to generate similar but distinct data sets, making benchmarks more informative and reproducible; repliclust builds upon SciPy and NumPy and is meant to complement scikit-learn.

For a full documentation, visit the project website: https://repliclust.org.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

repliclust-0.0.3.tar.gz (38.3 MB view hashes)

Uploaded Source

Built Distribution

repliclust-0.0.3-py3-none-any.whl (42.8 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page