Skip to main content

Enterprise-grade clustering solution with local web UI for preprocessing, clustering, and visualization.

Project description

Enterprise Cluster Solution

A Python package that launches a local web UI to build and run clustering pipelines with minimal code.

Installation

pip install autoaicluster

Quickstart (Web UI)

from autoaicluster import AutoAI

# Binds to 0.0.0.0 by default; shows a localhost URL
model = AutoAI()
url = model.launch_ui()
print("UI available at:", url)
  • To override host/port, use environment variables or constructor:
    • ECS_HOST or HOST (e.g., 0.0.0.0 for external access)
    • ECS_PORT or PORT

Python Workflow (No UI)

import pandas as pd
from enterprise_cluster_solution import run_pipeline, auto_segment

# dataframe or CSV path
df = pd.DataFrame({
    'age':[25,37,29,41,33,52,47,23,39,31],
    'income':[40,72,50,90,60,120,95,35,70,55],
    'city':['A','B','A','B','A','B','B','A','B','A']
})

# AutoML-style segmentation
result = auto_segment(df)
print('Algorithm:', result['algorithm'])
print('Silhouette:', result['silhouette'])
print('Counts:', result['label_counts'])

# Or manual pipeline
pipeline = run_pipeline(
    df,
    preprocessing={
        'imputation': {'numeric': 'mean', 'categorical': 'most_frequent', 'fill_value': 0},
        'encoding': 'onehot',
        'scaling': 'standard',
        'outliers': {'method': 'isoforest', 'contamination': 'auto'}
    },
    clustering={'algorithm': 'kmeans', 'params': {'n_clusters': 3}},
    visualization={'method': 'pca', 'n_components': 2}
)
print('KMeans silhouette:', pipeline['silhouette'])

Notes on access from containers/remote

  • If running UI in a container or remote VM, bind to 0.0.0.0 and open http://<public-ip>:<port>.

Features

  • Dtype editing; per-column and global imputation; include/exclude columns
  • Outlier handling (IsolationForest), separate outliers artifact
  • Algorithms: KMeans, K-Medoids, Agglomerative, BIRCH, DBSCAN, OPTICS, GMM
  • Visualization: PCA, t-SNE, UMAP
  • Metrics/insights: Silhouette, per-point silhouette, top features, cluster profiles
  • Auto Segment: tries algorithms/params and selects best silhouette
  • Artifacts: segmented.csv, outliers.csv

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

autoaicluster-0.1.6.tar.gz (7.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

autoaicluster-0.1.6-py3-none-any.whl (2.5 kB view details)

Uploaded Python 3

File details

Details for the file autoaicluster-0.1.6.tar.gz.

File metadata

  • Download URL: autoaicluster-0.1.6.tar.gz
  • Upload date:
  • Size: 7.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.11

File hashes

Hashes for autoaicluster-0.1.6.tar.gz
Algorithm Hash digest
SHA256 db8c91cbb8a2566ceae8dc18a9df2d7191f78a3c671484cacce09ad9499b0cde
MD5 f015b1cb1bfb7da5469826c2c45040ce
BLAKE2b-256 955ce69c36c0b3f0937b5d78fc48f9e9d9d1a1a0b6434224318f70e62622c501

See more details on using hashes here.

File details

Details for the file autoaicluster-0.1.6-py3-none-any.whl.

File metadata

  • Download URL: autoaicluster-0.1.6-py3-none-any.whl
  • Upload date:
  • Size: 2.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.11

File hashes

Hashes for autoaicluster-0.1.6-py3-none-any.whl
Algorithm Hash digest
SHA256 e48bae488dd6aa6bf227f784b8c2cd9e5b2f9e759a11f869d4f33eb60fdd73d1
MD5 d3a7b8c6df9e096086613e0a3cef4f8f
BLAKE2b-256 15887b409082ce8e4642398127b871316ff9389b8fd389284e149b486faf5afc

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page