Skip to main content

clustering packages with DDCAL algorithm

Project description

Overview

A heuristic one dimensional clustering algorithm called DDCAL (Density Distribution Cluster Algorithm) that is based on iterative feature scaling. The algorithm aims as first order to even distribute data into clusters by considering as well as second order to minimize the variance inside each cluster and maximizing the distances between clusters.

The algorithm is designed to be used for visualization, e.g., on choropleth maps.

Basic Usage

pip install -i https://pypi.org/simple/ ddcal
from clustering.ddcal import DDCAL

# load data
frequencies = [0, 1, 1, 1, 5, 5, 5, 30, 88]

# initialize parameters
ddcal = DDCAL(n_clusters=3, feature_boundary_min=0.1, feature_boundary_max=0.49,
                  num_simulations=20, q_tolerance=0.45, q_tolerance_increase_step=0.5)

# execute DDCAL algorithm
ddcal.fit(frequencies)

# print/use results
print(ddcal.sorted_data)
print(ddcal.labels_sorted_data)

Supplemental Material

Supplemental material for the paper DDCAL: Evenly Distributing Data into High Density Clusters based on Iterative Feature Scaling can be found in the folder:

supplemental

Synthetic Data Sets

The synthetic data sets, which were used in the paper DDCAL: Evenly Distributing Data into High Density Clusters based on Iterative Feature Scaling which includes a description on each data set, can be found in the folder:

tests/data

Acknowledgements

"ICT of the Future" program - an initiative of the Federal Ministry for Climate Protection, Environment, Energy, Mobility, Innovation and Technology (BMK)

accessibility text

SPRINGER NATURE Link/DOI

https://doi.org/10.1007/s00357-022-09428-6

Citation

@article{cite-key,
	author = {Lux, Marian and Rinderle-Ma, Stefanie},
	journal = {Journal of Classification},
	number = {1},
	pages = {106--144},
	title = {DDCAL: Evenly Distributing Data into Low Variance Clusters Based on Iterative Feature Scaling},
	volume = {40},
	year = {2023}}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ddcal-1.2.1.tar.gz (7.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ddcal-1.2.1-py3-none-any.whl (6.1 kB view details)

Uploaded Python 3

File details

Details for the file ddcal-1.2.1.tar.gz.

File metadata

  • Download URL: ddcal-1.2.1.tar.gz
  • Upload date:
  • Size: 7.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.0

File hashes

Hashes for ddcal-1.2.1.tar.gz
Algorithm Hash digest
SHA256 6066027be675f500ee3c1529c8f593894e6268cb0b0c7f22ac854069b301fb5e
MD5 f9c1a919a5cd00da4fff5bd22ad1468b
BLAKE2b-256 23494d80a45245244053ea6c33a54658d14b6d4f566df45266fc4a06d5972ed3

See more details on using hashes here.

File details

Details for the file ddcal-1.2.1-py3-none-any.whl.

File metadata

  • Download URL: ddcal-1.2.1-py3-none-any.whl
  • Upload date:
  • Size: 6.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.0

File hashes

Hashes for ddcal-1.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 f91d846c5b918ec857d20e2df94a0fb5160ed81327485bc33be2f2751c068aa9
MD5 742af5d4fe6d9316ff51e8ec51b68c2e
BLAKE2b-256 a0fedd886ef1922fde08293d474d634f487d9f4796691c299275aba3cc9a957d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page