Swarmchestrate cluster builder

Project description

Swarmchestrate - Cluster Builder

This repository contains the codebase for [cluster-builder], which builds K3s clusters for Swarmchestrate using OpenTofu.

Key features:

Create: Provisions infrastructure using OpenTofu and installs K3s.
Add: Add worker or HA nodes to existing clusters.
Remove: Selectively remove nodes from existing clusters.
Delete: Destroys the provisioned infrastructure when no longer required.

Prerequisites

Before proceeding, ensure the following prerequisites are installed:

Git: For cloning the repository.
Python: Version 3.9 or higher.
pip: Python package manager.
OpenTofu: Version 1.6 or higher for infrastructure provisioning.
Make: To run the provided Makefile.
PostgreSQL: For storing OpenTofu state.
(Optional) Docker: To create a dev Postgres
For detailed instructions on edge device requirements, refer to the Edge Device Requirements document.

Getting Started

1. Clone the Repository

To get started, clone this repository:

git clone https://github.com/Swarmchestrate/cluster-builder.git

2. Navigate to the Project Directory

cd cluster-builder

3. Install Dependencies and Tools

Run the Makefile to install all necessary dependencies, including OpenTofu:

 make install

This command will:

Install Python dependencies listed in requirements.txt.
Download and configure OpenTofu for infrastructure management.

 make db

This command will:

Spin up an empty dev Postgres DB (in Docker) for storing state

in ths makefile database details are provide you update or use that ones name pg-db -e POSTGRES_USER=admin -e POSTGRES_PASSWORD=adminpass -e POSTGRES_DB=swarmchestrate

For database setup as a service, refer to the database setup as service document

4. Populate .env file with access config

The .env file is used to store environment variables required by the application. It contains configuration details for connecting to your cloud providers, the PostgreSQL database, and any other necessary resources.

4.1. Rename or copy the example file to .env

cp .env_example .env

4.2. Open the .env file and add the necessary configuration for your cloud providers and PostgreSQL:

## PG Configuration
POSTGRES_USER=postgres
POSTGRES_PASSWORD=secret
POSTGRES_HOST=db.example.com
POSTGRES_DATABASE=terraform_state
POSTGRES_SSLMODE=prefer

## AWS Auth
TF_VAR_aws_region=us-west-2
TF_VAR_aws_access_key=AKIAXXXXXXXXXXXXXXXX
TF_VAR_aws_secret_key=XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

## OpenStack Auth - AppCreds Mode
TF_VAR_openstack_auth_method=appcreds
TF_VAR_openstack_auth_url=https://openstack.example.com:5000
TF_VAR_openstack_application_credential_id=fdXXXXXXXXXXXXXXXX
TF_VAR_openstack_application_credential_secret=XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
TF_VAR_openstack_region=RegionOne

## OpenStack Auth - User/Pass Mode
# TF_VAR_openstack_auth_method=userpass
# TF_VAR_openstack_auth_url=https://openstack.example.com:5000
# TF_VAR_openstack_region=RegionOne
# TF_VAR_openstack_user_name=myuser
# TF_VAR_openstack_password=mypassword
# TF_VAR_openstack_project_id=project-id-123
# TF_VAR_openstack_user_domain_name=Default

Basic Usage

Initialisation

from cluster_builder import Swarmchestrate

# Initialise the orchestrator
orchestrator = Swarmchestrate(
    template_dir="/path/to/templates",
    output_dir="/path/to/output"
)

Creating a New Cluster

To create a new k3s cluster, use the add_node method with the master role:

# Configuration for a new cluster
config = {
    "cloud": "aws",  # Can be 'aws', 'openstack', or 'edge'
    "k3s_role": "master",  # Role can be 'master', 'worker', or 'ha'
    "ha": False,  # Set to True for high availability (HA) deployments
    "instance_type": "t2.small",  # AWS instance type
    "ssh_key_name": "g",  # SSH key name for AWS or OpenStack
    "ssh_user": "ec2-user",  # SSH user for the instance
    "ssh_private_key_path": "/workspaces/cluster-builder/scripts/g.pem",  # Path to SSH private key
    "ami": "ami-0c0493bbac867d427",  # AMI ID for AWS (specific to region)
    "tcp_ports": [10020],  # Optional list of TCP ports to open
    "udp_ports": [1003]  # Optional list of UDP ports to open
}

# Create the cluster (returns the cluster name)
cluster_name = orchestrator.add_node(config)
print(f"Created cluster: {cluster_name}")

Adding Nodes to an Existing Cluster

To add worker or high-availability nodes to an existing cluster:

# Configuration for adding a worker node
worker_config = {
    "cloud": "aws",  # Cloud provider (can be 'aws', 'openstack', or 'edge')
    "k3s_role": "worker",  # Role can be 'worker' or 'ha'
    "ha": False,  # Set to True for high availability (HA) deployments
    "instance_type": "t2.small",  # AWS instance type
    "ssh_key_name": "g",  # SSH key name
    "ssh_user": "ec2-user",  # SSH user for the instance
    "ssh_private_key_path": "/workspaces/cluster-builder/scripts/g.pem",  # Path to SSH private key
    "ami": "ami-0c0493bbac867d427",  # AMI ID for AWS
    # Optional parameters:
    # "master_ip": "12.13.14.15",  # IP address of the master node (required for worker/HA roles)
    # "cluster_name": "elastic_mcnulty",  # Name of the cluster
    # "security_group_id": "sg-xxxxxxxxxxxxxxx",  # Security group ID for AWS or OpenStack
    # "tcp_ports": [80, 443],  # List of TCP ports to open
    # "udp_ports": [53]  # List of UDP ports to open
}

# Add the worker node
cluster_name = orchestrator.add_node(worker_config)
print(f"Added worker node to cluster: {cluster_name}")

Removing a Specific Node

To remove a specific node from a cluster:

# Remove a node by its resource name
orchestrator.remove_node(
    cluster_name="your-cluster-name",
    resource_name="aws_eloquent_feynman"  # The resource identifier of the node
)

The remove_node method:

Destroys the node's infrastructure resources
Removes the node's configuration from the cluster

Destroying an Entire Cluster

To completely destroy a cluster and all its nodes:

# Destroy the entire cluster
orchestrator.destroy(
    cluster_name="your-cluster-name"
)

The destroy method:

Destroys all infrastructure resources associated with the cluster
Removes the cluster directory and configuration files

Note for Edge Devices: Since the edge device is already provisioned, the destroy method will not remove K3s directly from the edge device. You will need to manually uninstall K3s from your edge device after the cluster is destroyed.

Important Configuration Requirements

High Availability Flag (ha):

For k3s_role="worker" or k3s_role="ha", you must specify a master_ip (the IP address of the master node).
For k3s_role="master", you must not specify a master_ip.
The ha flag should be set to True for high availability deployment (usually when adding a ha or worker node to an existing master).

SSH Credentials:

For all roles (k3s_role="master", k3s_role="worker", k3s_role="ha"), you must specify both ssh_user and ssh_private_key_path except for edge.
The ssh_private_key_path should be the path to your SSH private key file. Ensure that the SSH key is copied to the specified path before running the script.
The ssh_key_name and the ssh_private_key_path are different—ensure that your SSH key is placed correctly at the provided ssh_private_key_path.

Ports:

You can specify custom ports for your nodes in the tcp_ports and udp_ports fields. However, certain ports are required for Kubernetes deployment (even if not specified explicitly):

TCP Ports:

2379-2380: For etcd communication
6443: K3s API server
10250: Kubelet metrics
51820-51821: WireGuard (for encrypted networking)
22: SSH access
80, 443: HTTP/HTTPS access
53: DNS (CoreDNS)
5432: PostgreSQL access (master node)

UDP Ports:

8472: VXLAN for Flannel
53: DNS

OpenStack:

When provisioning on OpenStack, you should provide the value for 'floating_ip_pool' from which floating IPs can be allocated for the instance. If not specified, OpenTofu will not assign floating IP.

Advanced Usage

Dry Run Mode

All operations support a dryrun parameter, which validates the configuration without making changes. A node created with dryrun should be removed with dryrun.

# Validate configuration without deploying
orchestrator.add_node(config, dryrun=True)

# Validate removal without destroying
orchestrator.remove_node(cluster_name, resource_name, dryrun=True)

# Validate destruction without destroying
orchestrator.destroy(cluster_name, dryrun=True)

Custom Cluster Names

By default, cluster names are generated automatically. To specify a custom name:

config = {
    "cloud": "aws",
    "k3s_role": "master",
    "cluster_name": "production-cluster",
    # ... other configuration ...
}

orchestrator.add_node(config)

Template Structure

Templates should be organised as follows:

templates/ - Base directory for templates
templates/{cloud}/ - Terraform modules for each cloud provider
templates/{role}_user_data.sh.tpl - Node initialisation scripts
templates/{cloud}_provider.tf.j2 - Provider configuration templates

DEMO

Some test scripts have been created for demonstrating the functionality of the cluster builder. These scripts can be referred to for understanding how the system works and for testing various configurations.

For detailed service deployment examples and to explore the test scripts, refer to the test scripts document

Project details

Release history Release notifications | RSS feed

0.5.0

May 1, 2026

0.4.5

Feb 12, 2026

0.4.4

Jan 19, 2026

0.4.3

Jan 17, 2026

0.4.2

Jan 15, 2026

0.4.1

Nov 14, 2025

0.4.0

Sep 19, 2025

0.3.3

Sep 17, 2025

0.3.2

Sep 16, 2025

This version

0.3.1

Sep 16, 2025

0.3.0

Mar 28, 2025

0.2.1

Mar 28, 2025

0.2.0

Mar 28, 2025

0.1.2

Mar 28, 2025

0.1.1

Mar 27, 2025

0.1.0

Mar 27, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cluster_builder-0.3.1.tar.gz (31.2 kB view details)

Uploaded Sep 16, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

cluster_builder-0.3.1-py3-none-any.whl (33.9 kB view details)

Uploaded Sep 16, 2025 Python 3

File details

Details for the file cluster_builder-0.3.1.tar.gz.

File metadata

Download URL: cluster_builder-0.3.1.tar.gz
Upload date: Sep 16, 2025
Size: 31.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for cluster_builder-0.3.1.tar.gz
Algorithm	Hash digest
SHA256	`ad76baaf61bead3a437b09f31ff151d977317be86b9b1d4fef43d53cf0da9da4`
MD5	`93ddf00efb8f3b78bb4130ba89333c1f`
BLAKE2b-256	`39f9b810ec865383b565753b5e204d1b1810937ac2115b27f56ee0ce865759c8`

See more details on using hashes here.

Provenance

The following attestation bundles were made for cluster_builder-0.3.1.tar.gz:

Publisher: release.yml on Swarmchestrate/cluster-builder

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: cluster_builder-0.3.1.tar.gz
- Subject digest: ad76baaf61bead3a437b09f31ff151d977317be86b9b1d4fef43d53cf0da9da4
- Sigstore transparency entry: 524110146
- Sigstore integration time: Sep 16, 2025
Source repository:
- Permalink: Swarmchestrate/cluster-builder@2bd2b5c7d23bb72c89fda3d4fd608f65d25d8bb6
- Branch / Tag: refs/tags/v0.3.1
- Owner: https://github.com/Swarmchestrate
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@2bd2b5c7d23bb72c89fda3d4fd608f65d25d8bb6
- Trigger Event: push

File details

Details for the file cluster_builder-0.3.1-py3-none-any.whl.

File metadata

Download URL: cluster_builder-0.3.1-py3-none-any.whl
Upload date: Sep 16, 2025
Size: 33.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for cluster_builder-0.3.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7d1f54b327cb1416e4bb7c4da709f007b9e95c045863abd39975eee43e12a242`
MD5	`b6fe418a4acaa444b0c77ff42c986efd`
BLAKE2b-256	`efef9901975555b157de0fdf01dae57bc44a1195829bea7a3aeb601a0f7bfa59`

See more details on using hashes here.

Provenance

The following attestation bundles were made for cluster_builder-0.3.1-py3-none-any.whl:

Publisher: release.yml on Swarmchestrate/cluster-builder

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: cluster_builder-0.3.1-py3-none-any.whl
- Subject digest: 7d1f54b327cb1416e4bb7c4da709f007b9e95c045863abd39975eee43e12a242
- Sigstore transparency entry: 524110151
- Sigstore integration time: Sep 16, 2025
Source repository:
- Permalink: Swarmchestrate/cluster-builder@2bd2b5c7d23bb72c89fda3d4fd608f65d25d8bb6
- Branch / Tag: refs/tags/v0.3.1
- Owner: https://github.com/Swarmchestrate
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@2bd2b5c7d23bb72c89fda3d4fd608f65d25d8bb6
- Trigger Event: push

cluster-builder 0.3.1

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Swarmchestrate - Cluster Builder

Prerequisites

Getting Started

1. Clone the Repository

2. Navigate to the Project Directory

3. Install Dependencies and Tools

4. Populate .env file with access config

4.1. Rename or copy the example file to .env

4.2. Open the .env file and add the necessary configuration for your cloud providers and PostgreSQL:

Basic Usage

Initialisation

Creating a New Cluster

Adding Nodes to an Existing Cluster

Removing a Specific Node

Destroying an Entire Cluster

Important Configuration Requirements

High Availability Flag (ha):

SSH Credentials:

Ports:

OpenStack:

Advanced Usage

Dry Run Mode

Custom Cluster Names

Template Structure

DEMO

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance