Skip to main content

Enterprise-grade clustering solution with local web UI for preprocessing, clustering, and visualization.

Project description

Enterprise Cluster Solution

A Python package that launches a local web UI to build and run clustering pipelines with minimal code.

Installation

pip install autoaicluster

Quickstart (Web UI)

from enterprise_cluster_solution import ClusteringSolution

# Binds to 0.0.0.0 by default; shows a localhost URL
model = ClusteringSolution()
url = model.launch_ui()
print("UI available at:", url)
  • To override host/port, use environment variables or constructor:
    • ECS_HOST or HOST (e.g., 0.0.0.0 for external access)
    • ECS_PORT or PORT

Python Workflow (No UI)

import pandas as pd
from enterprise_cluster_solution import run_pipeline, auto_segment

# dataframe or CSV path
df = pd.DataFrame({
    'age':[25,37,29,41,33,52,47,23,39,31],
    'income':[40,72,50,90,60,120,95,35,70,55],
    'city':['A','B','A','B','A','B','B','A','B','A']
})

# AutoML-style segmentation
result = auto_segment(df)
print('Algorithm:', result['algorithm'])
print('Silhouette:', result['silhouette'])
print('Counts:', result['label_counts'])

# Or manual pipeline
pipeline = run_pipeline(
    df,
    preprocessing={
        'imputation': {'numeric': 'mean', 'categorical': 'most_frequent', 'fill_value': 0},
        'encoding': 'onehot',
        'scaling': 'standard',
        'outliers': {'method': 'isoforest', 'contamination': 'auto'}
    },
    clustering={'algorithm': 'kmeans', 'params': {'n_clusters': 3}},
    visualization={'method': 'pca', 'n_components': 2}
)
print('KMeans silhouette:', pipeline['silhouette'])

Notes on access from containers/remote

  • If running UI in a container or remote VM, bind to 0.0.0.0 and open http://<public-ip>:<port>.

Features

  • Dtype editing; per-column and global imputation; include/exclude columns
  • Outlier handling (IsolationForest), separate outliers artifact
  • Algorithms: KMeans, K-Medoids, Agglomerative, BIRCH, DBSCAN, OPTICS, GMM
  • Visualization: PCA, t-SNE, UMAP
  • Metrics/insights: Silhouette, per-point silhouette, top features, cluster profiles
  • Auto Segment: tries algorithms/params and selects best silhouette
  • Artifacts: segmented.csv, outliers.csv

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

autoaicluster-0.1.3.tar.gz (11.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

autoaicluster-0.1.3-py3-none-any.whl (2.5 kB view details)

Uploaded Python 3

File details

Details for the file autoaicluster-0.1.3.tar.gz.

File metadata

  • Download URL: autoaicluster-0.1.3.tar.gz
  • Upload date:
  • Size: 11.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.11

File hashes

Hashes for autoaicluster-0.1.3.tar.gz
Algorithm Hash digest
SHA256 7721245a603e8e792923af14a9c31d124ab7763fee71ef273f64b90bc38bfb2e
MD5 62bf942660921b531e34cf9be34a45c5
BLAKE2b-256 dfc9d07d76edcb8ec0fd2ad8afe87a59ebe556a35a35f1a23fc59d20673e1b47

See more details on using hashes here.

File details

Details for the file autoaicluster-0.1.3-py3-none-any.whl.

File metadata

  • Download URL: autoaicluster-0.1.3-py3-none-any.whl
  • Upload date:
  • Size: 2.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.11

File hashes

Hashes for autoaicluster-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 072a6d38258c08631d561ebcad4d273683257b49ac3e508ba85a2f721ea20e8d
MD5 24c75089c6e0af466949b46bd9f1c5cc
BLAKE2b-256 46952f1a35d9f30e6d7c36082c9058f3ac37e7c08dd5a2e0c129ccda790aa3bd

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page