Skip to main content

A benchmark for proximity-based non-IID Federated Learning

Project description

ProFed: A Benchmark for Proximity-based Federated Learning

🔗 ProFed: A Benchmark for Proximity‑based Non‑IID Federated Learning

ProFed is a framework for evaluating federated learning (FL) systems under realistic, geographically clustered, non‑IID data scenarios. It simulates clients grouped into regions such that data is IID within regions but non‑IID across regions.


🚀 Features

  • Built‑in datasets: Support for MNIST, FashionMNIST, CIFAR‑10, CIFAR‑100 and UTKFace via PyTorch/TorchVision.
  • Flexible partitioning: Implements Dirichlet-based splits, hard label skews, and can model arbitrary proximity-driven distribution skews.
  • Customizable proximity modeling: Define how many geographic clusters (regions) to simulate and control skew intensity (e.g., Dirichlet α).

🔧 Getting Started

Prerequisites

  • Python ≥ 3.12

Installation

ProFed is publicly released on PyPi, to install ProFed on your machine:

pip install ProFed

API Explanation

1. Downloading and importing the dataset

train_data, test_data = download_dataset('EMNIST')

2. Splitting into train & validation sets

train_data, validation_data = split_train_validation(train_data, 0.8)

3. Partitioning into geographic “regions” (i.i.d. internally)

environment = partition_to_subregions(
    train_data,
    validation_data,
    partitioning_method = 'Hard',
    number_subregions = 5,
    seed = 42
)
  • method: partition strategy ('Hard', 'Dirichlet', or 'IID')

  • number_subregions: how many simulated geographic clusters

  • Returns an Environment object. Each region within it contains IID data internally, but non-IID across regions.

1. Distributing region data across devices

mapping = {}
for region_id, devices in mapping_devices_area.items():
    mapping_devices_data = environment.from_subregion_to_devices(
        region_id,
        len(devices)
    )
    for device_index, data in mapping_devices_data.items():
        device_id = devices[device_index]
        mapping[device_id] = data

Splits the region’s IID data equally among its devices, assigning each a local subset.

The result is a mapping:

device_id  local_dataset

📄 License

MIT License — feel free to freely use, modify, and distribute.

📬 Contact

For questions, issues, or contributions, feel free to reach out:

You can also open an issue or submit a pull request on GitHub!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

profed-0.7.3.tar.gz (6.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

profed-0.7.3-py3-none-any.whl (7.7 kB view details)

Uploaded Python 3

File details

Details for the file profed-0.7.3.tar.gz.

File metadata

  • Download URL: profed-0.7.3.tar.gz
  • Upload date:
  • Size: 6.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.12.11 Linux/6.11.0-1018-azure

File hashes

Hashes for profed-0.7.3.tar.gz
Algorithm Hash digest
SHA256 b1cd14098b01dcb2e94e82e4db264c42d4301d8db53df7470714fe97f85a036b
MD5 04f379af0542018ce26b61aa38238a9c
BLAKE2b-256 28db768a97d30b7ac48e6e351d339cbd5609e0045e2f81395bd7eed4af273396

See more details on using hashes here.

File details

Details for the file profed-0.7.3-py3-none-any.whl.

File metadata

  • Download URL: profed-0.7.3-py3-none-any.whl
  • Upload date:
  • Size: 7.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.12.11 Linux/6.11.0-1018-azure

File hashes

Hashes for profed-0.7.3-py3-none-any.whl
Algorithm Hash digest
SHA256 d2713d7171e665637926ade45b5a6065dc37439d3acfe1664b8aa8f3f5dfa975
MD5 3e94c0b37877b7c7286ece677b240b55
BLAKE2b-256 f65eb1c544c4c0911cbbc6f016592abeb5521bc11d06237e8638e9e1a0932324

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page