A benchmark for proximity-based non-IID Federated Learning
Project description
ProFed: A Benchmark for Proximity-based Federated Learning
🔗 ProFed: A Benchmark for Proximity‑based Non‑IID Federated Learning
ProFed is a framework for evaluating federated learning (FL) systems under realistic, geographically clustered, non‑IID data scenarios. It simulates clients grouped into regions such that data is IID within regions but non‑IID across regions.
🚀 Features
- Built‑in datasets: Support for MNIST, FashionMNIST, CIFAR‑10, CIFAR‑100 and UTKFace via PyTorch/TorchVision.
- Flexible partitioning: Implements Dirichlet-based splits, hard label skews, and can model arbitrary proximity-driven distribution skews.
- Customizable proximity modeling: Define how many geographic clusters (regions) to simulate and control skew intensity (e.g., Dirichlet α).
🔧 Getting Started
Prerequisites
- Python ≥ 3.12
Installation
ProFed is publicly released on PyPi, to install ProFed on your machine:
pip install ProFed
API Explanation
1. Downloading and importing the dataset
train_data, test_data = download_dataset('EMNIST')
2. Splitting into train & validation sets
train_data, validation_data = split_train_validation(train_data, 0.8)
3. Partitioning into geographic “regions” (i.i.d. internally)
environment = partition_to_subregions(
train_data,
validation_data,
dataset_name = 'EMNIST',
partitioning_method = 'Hard',
number_of_regions = 5,
seed = 42,
)
-
method: partition strategy ('Hard', 'Dirichlet', or 'IID')
-
number_of_regions: how many simulated geographic clusters
-
dirichlet_alpha: optional, controls how concentrated the Dirichlet splits are
-
min_region_size: optional, retries Dirichlet sampling until each region has at least this many samples
-
Returns an Environment object. Each region within it contains IID data internally, but non-IID across regions.
Example for Dirichlet:
environment = partition_to_subregions(
train_data,
validation_data,
dataset_name = 'CIFAR100',
partitioning_method = 'Dirichlet',
number_of_regions = 8,
seed = 42,
dirichlet_alpha = 0.3,
min_region_size = 20,
)
1. Distributing region data across devices
mapping = {}
for region_id, devices in mapping_devices_area.items():
mapping_devices_data = environment.from_subregion_to_devices(
region_id,
len(devices)
)
for device_index, data in mapping_devices_data.items():
device_id = devices[device_index]
mapping[device_id] = data
Splits the region’s IID data equally among its devices, assigning each a local subset.
The result is a mapping:
device_id → local_dataset
✍🏻 Examples of usage
- Proximity-based Self-Federated Learning
- SParSeFuL
- Baselines implementation
- ProFed Startup Experiemnts
📄 License
MIT License — feel free to freely use, modify, and distribute.
📬 Contact
For questions, issues, or contributions, feel free to reach out:
- Author: Davide Domini
- Email: davide.domini@unibo.it
- GitHub: domm99
You can also open an issue or submit a pull request on GitHub!
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file profed-0.7.6.tar.gz.
File metadata
- Download URL: profed-0.7.6.tar.gz
- Upload date:
- Size: 6.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.3 CPython/3.12.13 Linux/6.14.0-1017-azure
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ea924ae191a7632b10c59b9e7e59c737fbeb929b07a9ae9ca2af1610a9c27c71
|
|
| MD5 |
1639420e681c4a3fc0e00e14324af7ce
|
|
| BLAKE2b-256 |
49acfffcd9120f398525165f833a9f8f6398e335506668ae698963e25500a2d1
|
File details
Details for the file profed-0.7.6-py3-none-any.whl.
File metadata
- Download URL: profed-0.7.6-py3-none-any.whl
- Upload date:
- Size: 8.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.3 CPython/3.12.13 Linux/6.14.0-1017-azure
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0b981e516ae37b902fc0b46e63d14f2e21c49589fd1dcfb48ea98852d7119032
|
|
| MD5 |
9c4578347881d8bde2d1fbdd3aa0e6c0
|
|
| BLAKE2b-256 |
13ae067aed12b4bfb211631e58e984476ce27626869db143af1273d4800d3f5a
|