Skip to main content

Enterprise-grade clustering solution with local web UI for preprocessing, clustering, and visualization.

Project description

Enterprise Cluster Solution

A Python package that launches a local web UI to build and run clustering pipelines with minimal code.

Installation

python3 -m pip install --break-system-packages -e .

Quickstart (Web UI)

from enterprise_cluster_solution import ClusteringSolution

# Binds to 0.0.0.0 by default; shows a localhost URL
model = ClusteringSolution()
url = model.launch_ui()
print("UI available at:", url)
  • To override host/port, use environment variables or constructor:
    • ECS_HOST or HOST (e.g., 0.0.0.0 for external access)
    • ECS_PORT or PORT

Python Workflow (No UI)

import pandas as pd
from enterprise_cluster_solution import run_pipeline, auto_segment

# dataframe or CSV path
df = pd.DataFrame({
    'age':[25,37,29,41,33,52,47,23,39,31],
    'income':[40,72,50,90,60,120,95,35,70,55],
    'city':['A','B','A','B','A','B','B','A','B','A']
})

# AutoML-style segmentation
result = auto_segment(df)
print('Algorithm:', result['algorithm'])
print('Silhouette:', result['silhouette'])
print('Counts:', result['label_counts'])

# Or manual pipeline
pipeline = run_pipeline(
    df,
    preprocessing={
        'imputation': {'numeric': 'mean', 'categorical': 'most_frequent', 'fill_value': 0},
        'encoding': 'onehot',
        'scaling': 'standard',
        'outliers': {'method': 'isoforest', 'contamination': 'auto'}
    },
    clustering={'algorithm': 'kmeans', 'params': {'n_clusters': 3}},
    visualization={'method': 'pca', 'n_components': 2}
)
print('KMeans silhouette:', pipeline['silhouette'])

Notes on access from containers/remote

  • If running UI in a container or remote VM, bind to 0.0.0.0 and open http://<public-ip>:<port>.

Features

  • Dtype editing; per-column and global imputation; include/exclude columns
  • Outlier handling (IsolationForest), separate outliers artifact
  • Algorithms: KMeans, K-Medoids, Agglomerative, BIRCH, DBSCAN, OPTICS, GMM
  • Visualization: PCA, t-SNE, UMAP
  • Metrics/insights: Silhouette, per-point silhouette, top features, cluster profiles
  • Auto Segment: tries algorithms/params and selects best silhouette
  • Artifacts: segmented.csv, outliers.csv

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

autoaicluster-0.1.2.tar.gz (11.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

autoaicluster-0.1.2-py3-none-any.whl (2.5 kB view details)

Uploaded Python 3

File details

Details for the file autoaicluster-0.1.2.tar.gz.

File metadata

  • Download URL: autoaicluster-0.1.2.tar.gz
  • Upload date:
  • Size: 11.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.11

File hashes

Hashes for autoaicluster-0.1.2.tar.gz
Algorithm Hash digest
SHA256 8c43eeb3877befb6b7b61b6cd2ea6bffc9e0ca0c58e6bbd6504ad9a040c67beb
MD5 d974fe46dc2fa4a5af57ef9c7b111cb9
BLAKE2b-256 afcc0034e42a362f6a3333893ed4e4aa8464f206337ee2831c37f353e4b7f8bf

See more details on using hashes here.

File details

Details for the file autoaicluster-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: autoaicluster-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 2.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.11

File hashes

Hashes for autoaicluster-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 8ccdbae6eea111c882c59a74f3a78934a55496ca225f97409090846065f2dca4
MD5 121933127fbab6f4dd7dfaa90cbfb617
BLAKE2b-256 81fda5fa318ed2399d3d5c43ac9dd63c46e7e976e0afff58b625e7a6b2de9671

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page