Skip to main content

Enterprise-grade clustering solution with local web UI for preprocessing, clustering, and visualization.

Project description

Enterprise Cluster Solution

A Python package that launches a local web UI to build and run clustering pipelines with minimal code.

Installation

pip install autoaicluster

Quickstart (Web UI)

from autoai import AutoAI

# Binds to 0.0.0.0 by default; shows a localhost URL
model = AutoAI()
url = model.launch_ui()
print("UI available at:", url)
  • To override host/port, use environment variables or constructor:
    • ECS_HOST or HOST (e.g., 0.0.0.0 for external access)
    • ECS_PORT or PORT

Python Workflow (No UI)

import pandas as pd
from enterprise_cluster_solution import run_pipeline, auto_segment

# dataframe or CSV path
df = pd.DataFrame({
    'age':[25,37,29,41,33,52,47,23,39,31],
    'income':[40,72,50,90,60,120,95,35,70,55],
    'city':['A','B','A','B','A','B','B','A','B','A']
})

# AutoML-style segmentation
result = auto_segment(df)
print('Algorithm:', result['algorithm'])
print('Silhouette:', result['silhouette'])
print('Counts:', result['label_counts'])

# Or manual pipeline
pipeline = run_pipeline(
    df,
    preprocessing={
        'imputation': {'numeric': 'mean', 'categorical': 'most_frequent', 'fill_value': 0},
        'encoding': 'onehot',
        'scaling': 'standard',
        'outliers': {'method': 'isoforest', 'contamination': 'auto'}
    },
    clustering={'algorithm': 'kmeans', 'params': {'n_clusters': 3}},
    visualization={'method': 'pca', 'n_components': 2}
)
print('KMeans silhouette:', pipeline['silhouette'])

Notes on access from containers/remote

  • If running UI in a container or remote VM, bind to 0.0.0.0 and open http://<public-ip>:<port>.

Features

  • Dtype editing; per-column and global imputation; include/exclude columns
  • Outlier handling (IsolationForest), separate outliers artifact
  • Algorithms: KMeans, K-Medoids, Agglomerative, BIRCH, DBSCAN, OPTICS, GMM
  • Visualization: PCA, t-SNE, UMAP
  • Metrics/insights: Silhouette, per-point silhouette, top features, cluster profiles
  • Auto Segment: tries algorithms/params and selects best silhouette
  • Artifacts: segmented.csv, outliers.csv

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

autoaicluster-0.1.5.tar.gz (7.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

autoaicluster-0.1.5-py3-none-any.whl (2.5 kB view details)

Uploaded Python 3

File details

Details for the file autoaicluster-0.1.5.tar.gz.

File metadata

  • Download URL: autoaicluster-0.1.5.tar.gz
  • Upload date:
  • Size: 7.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.11

File hashes

Hashes for autoaicluster-0.1.5.tar.gz
Algorithm Hash digest
SHA256 4c918e69e7d6e73a02b7e452bf93ab6b45ae8fe440adab89806547d4498a331b
MD5 b11128a36be57e3fdc9657b18dee2cc8
BLAKE2b-256 4cf12d60fc3112a11138333e99067fe324cedda25f6cd0097e2a874ce5338525

See more details on using hashes here.

File details

Details for the file autoaicluster-0.1.5-py3-none-any.whl.

File metadata

  • Download URL: autoaicluster-0.1.5-py3-none-any.whl
  • Upload date:
  • Size: 2.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.11

File hashes

Hashes for autoaicluster-0.1.5-py3-none-any.whl
Algorithm Hash digest
SHA256 8384fda93ed4fe9a869b2e5ae79eec440fdf1ea45a4ce93ecff1c01e3add8131
MD5 61f9a0301c5f61b943293bd4d0cb7b75
BLAKE2b-256 cc210d8c9c0f901e136e29700b3f304edc1c8bd66ab1eb90b9638f8b625fd09e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page