A modular, pure-NumPy federated learning simulation and benchmark framework for research and education

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

thecola13

These details have not been verified by PyPI

Project description

fed-playground

A modular federated learning simulation framework for research and education.

Overview
Project Structure
Requirements
Installation
Configuration
Usage
Running Tests
Contributing
License

Overview

fed-playground is a pure-Python federated learning playground designed to make it easy to prototype, analyse, and teach the core ideas behind federated learning (FL) — without requiring a cluster or real network infrastructure.

The library implements a strategy-pattern architecture: every key concern (encryption, aggregation, model type, data loading) is expressed as an abstract base class that can be swapped out independently. This makes it straightforward to answer questions such as "what happens to model divergence as I increase the number of parties?" or "how does closed-form averaging compare to gradient descent in a federated setting?"

Out of the box the framework ships with:

Models — gradient descent linear regression, closed-form linear regression, L2-regularised ridge regression, logistic regression, and a single-hidden-layer MLP (all pure NumPy).
Aggregation strategies — FedAvg (mean), coordinate-wise median, and trimmed mean for Byzantine robustness.
Encryption schemes — passthrough baseline, Gaussian differential-privacy noise, and additive secret sharing; plus an interface compatible with Concrete ML when the optional [fhe] extra is installed.
Four visualizers — training history curves, model comparison bar charts, and divergence analysis plots with per-party breakdowns.

Research-grounded components

Beyond the baselines above, the framework ships implementations of published algorithms across every strategy type. Each class names its source in its docstring:

Type	Class	Reference
Aggregation	`KrumAggregation`	Blanchard et al., Byzantine-Tolerant Gradient Descent, NeurIPS 2017
Aggregation	`BulyanAggregation`	El Mhamdi et al., The Hidden Vulnerability of Distributed Learning in Byzantium, ICML 2018
Aggregation	`GeometricMedianAggregation` (RFA)	Pillutla et al., Robust Aggregation for Federated Learning, IEEE TSP 2022
Aggregation	`MedianOfMeansAggregation`	Nemirovski & Yudin 1983; Lugosi & Mendelson 2019
Aggregation	`CenteredClippingAggregation`	Karimireddy, He & Jaggi, Learning from History for Byzantine Robust Optimization, ICML 2021
Model	`SVMModel`	Shalev-Shwartz et al., Pegasos, Math. Programming 2011
Model	`LassoRegressionModel`	Tibshirani 1996; Friedman, Hastie & Tibshirani 2010
Model	`ElasticNetRegressionModel`	Zou & Hastie, Elastic Net, JRSS-B 2005
Model	`PoissonRegressionModel`	Nelder & Wedderburn, GLMs, JRSS-A 1972
Model	`HuberRegressionModel`	Huber, Robust Estimation of a Location Parameter, Ann. Math. Stat. 1964
Encryption	`LaplaceDPEncryption`	Dwork et al., Calibrating Noise to Sensitivity, TCC 2006
Encryption	`PairwiseMaskingEncryption`	Bonawitz et al., Practical Secure Aggregation, ACM CCS 2017
Visualizer	`PrivacyUtilityVisualizer`	Abadi et al., Deep Learning with Differential Privacy, ACM CCS 2016

Robust aggregators and additive-masking schemes carry an is_linear_only flag: masking schemes (secret sharing, pairwise masks) hide individual updates, so order/distance-based aggregators (Krum, median, Bulyan, …) correctly refuse them — only linear aggregation (MeanAggregation) is sound over masked shares.

Runnable demos for all of the above live in examples/ (e.g. example_byzantine_robust.py, example_svm_lasso.py, example_dp_and_secure_agg.py, example_elasticnet_poisson.py, example_bulyan_mom.py, example_privacy_utility_curve.py).

Benchmarking (`fedbench`)

Describe an experiment in TOML and run the whole sweep with one command:

fedbench run benchmarks/robustness.toml      # attack × defense matrix
fedbench run benchmarks/impossibility.toml   # privacy × robustness frontier
fedbench run benchmarks/privacy.toml         # DP utility cost

Each run writes a results CSV and a Markdown leaderboard (NaN cells = incompatible combinations, e.g. masking secure-aggregation × order-statistic defense). The committed benchmarks/STUDY.md interprets the three — reproducible byte-for-byte under the configs' fixed seeds.

Optional: Fully Homomorphic Encryption (FHE)

The [fhe] optional extra installs Concrete ML (Zama's FHE library). This requires Python 3.11 or 3.12 (concrete-ml does not yet support 3.13+). When installed, custom EncryptionScheme subclasses can perform genuine ciphertext-level operations during aggregation. Without it the framework operates entirely in plaintext — which is correct for educational purposes.

Project Structure

fed_env/
├── fed_playground/             # Installable Python package
│   ├── __init__.py             # Public API — import everything from here
│   └── src/
│       ├── aggregation.py      # AggregationStrategy ABC + MeanAggregation
│       ├── dataloader.py       # DataLoader: CSV / DataFrame / numpy inputs
│       ├── encryption.py       # EncryptionScheme ABC + NoEncryption baseline
│       ├── environment.py      # Environment: high-level simulation driver
│       ├── models.py           # Model ABC + Linear / ClosedForm implementations
│       ├── orchestrator.py     # Orchestrator: broadcast + aggregate
│       ├── party.py            # Party: local training + encrypted updates
│       ├── utils_data.py       # Synthetic data generation and splitting
│       └── visualization.py   # TrainingHistory / Comparison / Divergence plots
├── examples/
│   ├── basic_simulation.py     # Minimal end-to-end FL demo
│   ├── visualization_demo.py   # All three visualizers in one script
│   ├── divergence_analysis.py  # CLI tool for divergence experiments
│   └── test_data.csv           # Bundled 200-sample toy dataset
├── tests/                      # pytest test suite (mirrors src/ layout)
├── .env.example                # Template for environment variables
├── pyproject.toml              # Project metadata, dependencies, tool config
└── LICENSE                     # MIT

Requirements

Python ≥ 3.11, < 3.13
System: no compiled extensions required for the core library
Optional FHE: concrete-ml ≥ 1.9.0 (Python 3.11 or 3.12 only, macOS/Linux)

Core Python dependencies (installed automatically):

Package	Purpose
`numpy ≥ 1.24`	Numerical computing
`pandas ≥ 2.0`	Data loading and manipulation
`matplotlib ≥ 3.7`	Visualizations
`tqdm ≥ 4.65`	Progress bars in example scripts

Installation

Using uv (recommended)

uv is the fastest way to get started — it manages the Python version and virtual environment automatically.

# 1. Clone the repository
git clone https://github.com/thecola13/fed-playground.git
cd fed-playground

# 2. Install dependencies and create the virtual environment
uv sync

# 3. (Optional) Include development tools
uv sync --extra dev

# 4. (Optional) Include FHE support (Python 3.11 or 3.12 required)
uv sync --extra fhe

Run scripts directly without activating the environment:

uv run python examples/basic_simulation.py --parties 4 --rounds 10
uv run pytest

Or activate the environment first:

source .venv/bin/activate          # Windows: .venv\Scripts\activate
python examples/basic_simulation.py
pytest

Using pip (alternative)

# 1. Clone the repository
git clone https://github.com/thecola13/fed-playground.git
cd fed-playground

# 2. Create and activate a virtual environment
python3.11 -m venv .venv
source .venv/bin/activate          # Windows: .venv\Scripts\activate

# 3. Install the library in editable mode
pip install -e .

# 4. (Optional) Install development tools
pip install -e ".[dev]"

# 5. (Optional) FHE support
pip install -e ".[fhe]"

Verify the installation:

python -c "import fed_playground; print('fed-playground installed successfully')"

Configuration

The library reads a small number of optional environment variables. Copy .env.example to .env and edit as needed — the file is git-ignored.

Variable	Required	Default	Description
`THESIS_DATA_DIR`	No	—	Path to an external research data directory. Used by private example scripts only; not needed for the published examples.
`FED_LOG_LEVEL`	No	`WARNING`	Python logging level for the `fed_playground` logger (`DEBUG`, `INFO`, `WARNING`, `ERROR`).

The library uses Python's standard logging module. To see round-by-round progress, set FED_LOG_LEVEL=INFO or configure logging in your script:

import logging
logging.basicConfig(level=logging.INFO)

Usage

Basic simulation

from fed_playground import (
    Environment,
    NoEncryption,
    MeanAggregation,
    ClosedFormLinearRegressionModel,
    DataLoader,
)

# Load data from a CSV file
loader = DataLoader(file_path="examples/test_data.csv", target_column="target")

# Build and run a 3-party federated simulation for 10 rounds
env = Environment(
    n_parties=3,
    encryption_scheme=NoEncryption(),
    aggregation_strategy=MeanAggregation(),
    model_class=ClosedFormLinearRegressionModel,
    data_loader=loader,
)

history = env.run_simulation(rounds=10)

print(history["global_loss"])   # per-round MSE on the held-out test set
print(history["party_loss"])    # per-round average local training MSE

Using synthetic data

env = Environment(
    n_parties=5,
    n_features=10,
    n_samples=500,
    encryption_scheme=NoEncryption(),
    aggregation_strategy=MeanAggregation(),
    model_class=ClosedFormLinearRegressionModel,
)
history = env.run_simulation(rounds=20)

Visualizing results

from fed_playground import TrainingHistoryVisualizer, ComparisonVisualizer

# Plot training curves (displayed interactively)
viz = TrainingHistoryVisualizer()
viz.plot(
    data={"Global Loss": history["global_loss"], "Party Loss": history["party_loss"]},
    title="Federated Training",
    xlabel="Round",
    ylabel="MSE",
)

# Save to a file instead
viz_save = TrainingHistoryVisualizer(save_dir="./plots")
viz_save.plot(data=..., filename="training.png")

Extending with a custom model

import numpy as np
from fed_playground import Model, Environment, NoEncryption, MeanAggregation

class RidgeRegressionModel(Model):
    def __init__(self, input_dim: int, alpha: float = 0.01) -> None:
        self.input_dim = input_dim
        self.alpha = alpha
        self.params = np.zeros(input_dim + 1)

    def train(self, X, y):
        n = X.shape[0]
        X_b = np.hstack([X, np.ones((n, 1))])
        I = np.eye(X_b.shape[1])
        I[-1, -1] = 0  # do not regularize bias term
        self.params = np.linalg.solve(X_b.T @ X_b + self.alpha * I, X_b.T @ y)

    def get_parameters(self): return self.params
    def set_parameters(self, p): self.params = p
    def predict(self, X): return np.hstack([X, np.ones((X.shape[0], 1))]) @ self.params
    def evaluate(self, X, y): return float(np.mean((y - self.predict(X)) ** 2))

env = Environment(
    n_parties=4,
    n_features=5,
    n_samples=200,
    model_class=RidgeRegressionModel,
    model_params={"alpha": 0.1},
)
env.run_simulation(rounds=5)

CLI example scripts

# Basic simulation with a table of per-round metrics
python examples/basic_simulation.py --parties 4 --rounds 10

# Divergence analysis: vary party count, display plots interactively
python examples/divergence_analysis.py \
    --data-path examples/test_data.csv \
    --features feature_1 feature_2 feature_3 feature_4 feature_5 \
    --target target \
    --instances-diff --min-instances 2 --max-instances 8

# Save plots to disk instead of showing them
python examples/divergence_analysis.py \
    --data-path examples/test_data.csv \
    --features feature_1 feature_2 feature_3 feature_4 feature_5 \
    --target target \
    --instances-diff \
    --save-dir ./results

Running Tests

# Run the full suite
uv run pytest

# Run a specific module
uv run pytest tests/test_environment.py

# Run with coverage report
uv run pytest --cov=fed_playground --cov-report=term-missing

A passing run looks like:

tests/test_aggregation.py    .....
tests/test_dataloader.py     ............
tests/test_encryption.py     ......
tests/test_environment.py    .......
tests/test_models.py         ...........
tests/test_orchestrator.py   .....
tests/test_party.py          .......
tests/test_utils_data.py     .......

Contributing

Setting up a dev environment

git clone https://github.com/thecola13/fed-playground.git
cd fed-playground
uv sync --extra dev

Branch naming

Type	Prefix	Example
New feature	`feat/`	`feat/add-dp-noise`
Bug fix	`fix/`	`fix/dataloader-elif`
Documentation	`docs/`	`docs/update-readme`
Refactoring	`refactor/`	`refactor/split-orchestrator`

Commit style

Follow Conventional Commits:

feat(aggregation): add weighted FedAvg strategy
fix(dataloader): correct elif branch for DataFrame input
docs(readme): add custom model example

Pull request checklist

pytest passes with zero failures
ruff check . reports zero violations
black --check . reports no formatting issues
New public functions/classes have Google-style docstrings
New behaviour is covered by at least one test

License

This project is licensed under the MIT License — see LICENSE for the full text.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

thecola13

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.2.1

Jun 17, 2026

This version

0.2.0

Jun 17, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fed_playground-0.2.0.tar.gz (63.5 kB view details)

Uploaded Jun 17, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

fed_playground-0.2.0-py3-none-any.whl (51.2 kB view details)

Uploaded Jun 17, 2026 Python 3

File details

Details for the file fed_playground-0.2.0.tar.gz.

File metadata

Download URL: fed_playground-0.2.0.tar.gz
Upload date: Jun 17, 2026
Size: 63.5 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for fed_playground-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`c58f2f7312cc5c30a372c125f56ff57dc4291ca599d78848e77e2e33a8f8f491`
MD5	`01b84c6e1cd6040d46f2ad365cdd27f7`
BLAKE2b-256	`cbe10ac8409cd7a5c7b17d8279c313a496a46d8cd7a368c630fd26bca389f8fb`

See more details on using hashes here.

Provenance

The following attestation bundles were made for fed_playground-0.2.0.tar.gz:

Publisher: release.yml on thecola13/fed-playground

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: fed_playground-0.2.0.tar.gz
- Subject digest: c58f2f7312cc5c30a372c125f56ff57dc4291ca599d78848e77e2e33a8f8f491
- Sigstore transparency entry: 1852902839
- Sigstore integration time: Jun 17, 2026
Source repository:
- Permalink: thecola13/fed-playground@c2d950b589415660910fe5bd838decc80d13ed9b
- Branch / Tag: refs/tags/v0.2.0
- Owner: https://github.com/thecola13
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@c2d950b589415660910fe5bd838decc80d13ed9b
- Trigger Event: push

File details

Details for the file fed_playground-0.2.0-py3-none-any.whl.

File metadata

Download URL: fed_playground-0.2.0-py3-none-any.whl
Upload date: Jun 17, 2026
Size: 51.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for fed_playground-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`972e3b15da5ab356b014cfb4e7c1342e873a3492eecdb40670295d2b9bda377a`
MD5	`fc9446c6af32a7a10031d62217d99df6`
BLAKE2b-256	`be63ddea00339bb6eeeccf58e9b80c78110250307ffd6bcc2f51af9216374276`

See more details on using hashes here.

Provenance

The following attestation bundles were made for fed_playground-0.2.0-py3-none-any.whl:

Publisher: release.yml on thecola13/fed-playground

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: fed_playground-0.2.0-py3-none-any.whl
- Subject digest: 972e3b15da5ab356b014cfb4e7c1342e873a3492eecdb40670295d2b9bda377a
- Sigstore transparency entry: 1852903221
- Sigstore integration time: Jun 17, 2026
Source repository:
- Permalink: thecola13/fed-playground@c2d950b589415660910fe5bd838decc80d13ed9b
- Branch / Tag: refs/tags/v0.2.0
- Owner: https://github.com/thecola13
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@c2d950b589415660910fe5bd838decc80d13ed9b
- Trigger Event: push

fed-playground 0.2.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

fed-playground

Table of Contents

Overview

Research-grounded components

Benchmarking (fedbench)

Optional: Fully Homomorphic Encryption (FHE)

Project Structure

Requirements

Installation

Using uv (recommended)

Using pip (alternative)

Configuration

Usage

Basic simulation

Using synthetic data

Visualizing results

Extending with a custom model

CLI example scripts

Running Tests

Contributing

Setting up a dev environment

Branch naming

Commit style

Pull request checklist

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

Benchmarking (`fedbench`)