Skip to main content

Enterprise-grade clustering solution with local web UI for preprocessing, clustering, and visualization.

Project description

Enterprise Cluster Solution

A Python package that launches a local web UI to build and run clustering pipelines with minimal code.

Installation

pip install autoaicluster

Quickstart (Web UI)

from autoai import AutoAi

# Binds to 0.0.0.0 by default; shows a localhost URL
model = AutoAi`()
url = model.launch_ui()
print("UI available at:", url)
  • To override host/port, use environment variables or constructor:
    • ECS_HOST or HOST (e.g., 0.0.0.0 for external access)
    • ECS_PORT or PORT

Python Workflow (No UI)

import pandas as pd
from enterprise_cluster_solution import run_pipeline, auto_segment

# dataframe or CSV path
df = pd.DataFrame({
    'age':[25,37,29,41,33,52,47,23,39,31],
    'income':[40,72,50,90,60,120,95,35,70,55],
    'city':['A','B','A','B','A','B','B','A','B','A']
})

# AutoML-style segmentation
result = auto_segment(df)
print('Algorithm:', result['algorithm'])
print('Silhouette:', result['silhouette'])
print('Counts:', result['label_counts'])

# Or manual pipeline
pipeline = run_pipeline(
    df,
    preprocessing={
        'imputation': {'numeric': 'mean', 'categorical': 'most_frequent', 'fill_value': 0},
        'encoding': 'onehot',
        'scaling': 'standard',
        'outliers': {'method': 'isoforest', 'contamination': 'auto'}
    },
    clustering={'algorithm': 'kmeans', 'params': {'n_clusters': 3}},
    visualization={'method': 'pca', 'n_components': 2}
)
print('KMeans silhouette:', pipeline['silhouette'])

Notes on access from containers/remote

  • If running UI in a container or remote VM, bind to 0.0.0.0 and open http://<public-ip>:<port>.

Features

  • Dtype editing; per-column and global imputation; include/exclude columns
  • Outlier handling (IsolationForest), separate outliers artifact
  • Algorithms: KMeans, K-Medoids, Agglomerative, BIRCH, DBSCAN, OPTICS, GMM
  • Visualization: PCA, t-SNE, UMAP
  • Metrics/insights: Silhouette, per-point silhouette, top features, cluster profiles
  • Auto Segment: tries algorithms/params and selects best silhouette
  • Artifacts: segmented.csv, outliers.csv

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

autoaicluster-0.1.4.tar.gz (7.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

autoaicluster-0.1.4-py3-none-any.whl (2.5 kB view details)

Uploaded Python 3

File details

Details for the file autoaicluster-0.1.4.tar.gz.

File metadata

  • Download URL: autoaicluster-0.1.4.tar.gz
  • Upload date:
  • Size: 7.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.11

File hashes

Hashes for autoaicluster-0.1.4.tar.gz
Algorithm Hash digest
SHA256 bca6fc66396098d118b5c9f517e13bc1e1c301dffe8112ad53855731ed72b4fd
MD5 2809b09aa820ae829154f4fe4afd9ce8
BLAKE2b-256 2f6065b750db73b9181091beb8e99ff0fa777b437c23e4ba08ac4041ad2dadfa

See more details on using hashes here.

File details

Details for the file autoaicluster-0.1.4-py3-none-any.whl.

File metadata

  • Download URL: autoaicluster-0.1.4-py3-none-any.whl
  • Upload date:
  • Size: 2.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.11

File hashes

Hashes for autoaicluster-0.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 e52747ce277fdae2a70251e35ccde20bee1d18908f8b58b561f4b03ed207ab5b
MD5 51ae750a2b121eb4fd7c4c88dc8ae6cf
BLAKE2b-256 858acac58efc423a74fb1a2d0f5fad1b09419876e69af232747784ddc07db42b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page