Skip to main content

Adaptation of X means algorithm for circular data

Project description

circular_clustering

Adaptation of X means algorithm for circular data

Install the package using:

pip install circular-clustering

X means algorithm with quantiles

The class CircularXMeansQuantiles contains the X means algorithm for circular data. The use is similar to the clustering algorithms in scipy.

To import it:

from circular_clustering.circular_x_means_quantiles import CircularXMeansQuantiles

To invoke the class:

circXmeans = CircularXMeansQuantiles(x, kmax=8, confidence=0.99, use_optimal_k_means=True)
  • x must be a one-dimensional NumPy array of angles between and π.
  • kmax= sets the maximum number of clusters.

To fit the algorithm:

circXmeans.fit()

Centroids are available at circXmeans.centroids, and labels at circXmeans.labels.

Example (circular data):

import numpy as np
import matplotlib.pyplot as plt

from circular_clustering.circular_x_means_quantiles import CircularXMeansQuantiles

x = np.array([ 1.658,  1.369,  1.783,  1.587,  0.942,  1.268,
               1.740,  2.245,  1.955,  1.132, -1.694, -1.121,
              -1.249, -1.834, -1.868, -1.351, -1.492, -1.607,
              -1.323, -1.913,  0.099,  0.060, -0.074, -0.127,
               0.179,  0.006,  0.273, -0.285,  0.080,  0.301])

circXmeans = CircularXMeansQuantiles(x, kmax=8, confidence=0.99, use_optimal_k_means=True)
circXmeans.fit()

plt.figure(figsize=(5,5))
plt.axes().set_aspect('equal', 'datalim')
plt.scatter(np.cos(x), np.sin(x))

for c in circXmeans.centroids:
    plt.scatter(np.cos(c), np.sin(c), c="r")

for cl in circXmeans.cluster_points:
    plt.scatter(np.cos(cl), np.sin(cl), c=np.random.rand(3,))

plt.show()

Cylindrical clustering with HDR-based XMeans

The XMeansHDR class supports clustering in cylindrical coordinates, where data has both an angular and linear component (θ, y). Clustering is performed using HDR-based region separation and a custom cylindrical distance metric.

To import:

from circular_clustering.cylindrical_hdr_x_means import XMeansHDR

Example (cylindrical data):

import numpy as np
import matplotlib.pyplot as plt
from circular_clustering import XMeansHDR

def make_cluster(center_theta, center_y, spread_theta, spread_y, n):
    angles = np.random.vonmises(center_theta, 1 / (spread_theta ** 2), size=n)
    heights = np.random.normal(center_y, spread_y, size=n)
    return np.column_stack([angles, heights])

# Simulation data
np.random.seed(123)
true_k = 4
points_per_cluster = 200
spread_theta = 0.25
spread_y = 0.4
alpha = 0.1
confidence = 1 - alpha

centers_theta = np.random.uniform(-np.pi, np.pi, true_k)
centers_y = np.random.uniform(-3, 3, true_k)

clusters = [
    make_cluster(c_theta, c_y, spread_theta, spread_y, points_per_cluster)
    for c_theta, c_y in zip(centers_theta, centers_y)
]
X = np.vstack(clusters)

# Fit HDR XMeans
xmeans = XMeansHDR(X, kmax=6, confidence=confidence)
xmeans.fit()

# Plot result
colors = plt.cm.tab10.colors
plt.figure(figsize=(8, 6))
for i in range(xmeans.k):
    cluster_points = X[xmeans.labels == i]
    plt.scatter(cluster_points[:, 0], cluster_points[:, 1],
                color=colors[i % 10], alpha=0.6, label=f"Cluster {i}")

plt.xlabel("Angle θ (radians)")
plt.ylabel("Height y")
plt.title(f"Cylindrical XMeans Clustering\nTrue: {true_k}, Found: {xmeans.k}")
plt.grid(True)
plt.legend()
plt.show()

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

circular_clustering-0.0.7.tar.gz (10.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

circular_clustering-0.0.7-py3-none-any.whl (12.1 kB view details)

Uploaded Python 3

File details

Details for the file circular_clustering-0.0.7.tar.gz.

File metadata

  • Download URL: circular_clustering-0.0.7.tar.gz
  • Upload date:
  • Size: 10.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.2

File hashes

Hashes for circular_clustering-0.0.7.tar.gz
Algorithm Hash digest
SHA256 4dab87b770ebd8b6372079edcab9f7f76ba0f506e522ae1cf961d6b64c6a10aa
MD5 b269ca05fe02ddbb28d87641e3af1529
BLAKE2b-256 c2373eeda27f76e73a0f18bb1e30a41941668e273ba0df42bd0d932555ba4f63

See more details on using hashes here.

File details

Details for the file circular_clustering-0.0.7-py3-none-any.whl.

File metadata

File hashes

Hashes for circular_clustering-0.0.7-py3-none-any.whl
Algorithm Hash digest
SHA256 7116dd446442e9face30438190e713f027f31bc4a70f9d5fbb7e1ac1c0858d07
MD5 44bb21525797bf97f8028dfc10a2166a
BLAKE2b-256 55148c89ff929b1c90da46bad6805cc95a49aee8bbb7805495f79c73c5c0f955

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page