Enterprise-grade clustering solution with local web UI for preprocessing, clustering, and visualization.
Project description
Enterprise Cluster Solution
A Python package that launches a local web UI to build and run clustering pipelines with minimal code.
Installation
python3 -m pip install --break-system-packages -e .
Quickstart (Web UI)
from enterprise_cluster_solution import ClusteringSolution
# Binds to 0.0.0.0 by default; shows a localhost URL
model = ClusteringSolution()
url = model.launch_ui()
print("UI available at:", url)
- To override host/port, use environment variables or constructor:
ECS_HOSTorHOST(e.g., 0.0.0.0 for external access)ECS_PORTorPORT
Python Workflow (No UI)
import pandas as pd
from enterprise_cluster_solution import run_pipeline, auto_segment
# dataframe or CSV path
df = pd.DataFrame({
'age':[25,37,29,41,33,52,47,23,39,31],
'income':[40,72,50,90,60,120,95,35,70,55],
'city':['A','B','A','B','A','B','B','A','B','A']
})
# AutoML-style segmentation
result = auto_segment(df)
print('Algorithm:', result['algorithm'])
print('Silhouette:', result['silhouette'])
print('Counts:', result['label_counts'])
# Or manual pipeline
pipeline = run_pipeline(
df,
preprocessing={
'imputation': {'numeric': 'mean', 'categorical': 'most_frequent', 'fill_value': 0},
'encoding': 'onehot',
'scaling': 'standard',
'outliers': {'method': 'isoforest', 'contamination': 'auto'}
},
clustering={'algorithm': 'kmeans', 'params': {'n_clusters': 3}},
visualization={'method': 'pca', 'n_components': 2}
)
print('KMeans silhouette:', pipeline['silhouette'])
Notes on access from containers/remote
- If running UI in a container or remote VM, bind to
0.0.0.0and openhttp://<public-ip>:<port>.
Features
- Dtype editing; per-column and global imputation; include/exclude columns
- Outlier handling (IsolationForest), separate outliers artifact
- Algorithms: KMeans, K-Medoids, Agglomerative, BIRCH, DBSCAN, OPTICS, GMM
- Visualization: PCA, t-SNE, UMAP
- Metrics/insights: Silhouette, per-point silhouette, top features, cluster profiles
- Auto Segment: tries algorithms/params and selects best silhouette
- Artifacts: segmented.csv, outliers.csv
License
MIT
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
autoaicluster-0.1.0.tar.gz
(19.9 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file autoaicluster-0.1.0.tar.gz.
File metadata
- Download URL: autoaicluster-0.1.0.tar.gz
- Upload date:
- Size: 19.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.10.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
78ded4299b572a0a5040dc750ca5ccef7a0b22ab93351df456b7a6769c21d691
|
|
| MD5 |
e9c7845f44eaa3dd2b2c6226c15bd661
|
|
| BLAKE2b-256 |
9ffb9b2c99683d494622406466940414865cad62abbe3602624a809abd7d1234
|
File details
Details for the file autoaicluster-0.1.0-py3-none-any.whl.
File metadata
- Download URL: autoaicluster-0.1.0-py3-none-any.whl
- Upload date:
- Size: 17.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.10.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f921eb83f5c5921a56d440464b2d61ac0fcffdcd85b7ead16524fed8116ac4ca
|
|
| MD5 |
2c7064d67223a91d0293b89a2d4cfb21
|
|
| BLAKE2b-256 |
7208b1932049d71fa5c8ec401268aced44955a1938d02ac5d6431c185c469981
|