Skip to main content

A benchmark for proximity-based non-IID Federated Learning

Project description

ProFed: A Benchmark for Proximity-based Federated Learning

🔗 ProFed: A Benchmark for Proximity‑based Non‑IID Federated Learning

ProFed is a framework for evaluating federated learning (FL) systems under realistic, geographically clustered, non‑IID data scenarios. It simulates clients grouped into regions such that data is IID within regions but non‑IID across regions.


🚀 Features

  • Built‑in datasets: Support for MNIST, FashionMNIST, CIFAR‑10, CIFAR‑100 and UTKFace via PyTorch/TorchVision.
  • Flexible partitioning: Implements Dirichlet-based splits, hard label skews, and can model arbitrary proximity-driven distribution skews.
  • Customizable proximity modeling: Define how many geographic clusters (regions) to simulate and control skew intensity (e.g., Dirichlet α).

🔧 Getting Started

Prerequisites

  • Python ≥ 3.12

Installation

ProFed is publicly released on PyPi, to install ProFed on your machine:

pip install ProFed

API Explanation

1. Downloading and importing the dataset

train_data, test_data = download_dataset('EMNIST')

2. Splitting into train & validation sets

train_data, validation_data = split_train_validation(train_data, 0.8)

3. Partitioning into geographic “regions” (i.i.d. internally)

environment = partition_to_subregions(
    train_data,
    validation_data,
    partitioning_method = 'Hard',
    number_subregions = 5,
    seed = 42
)
  • method: partition strategy ('Hard', 'Dirichlet', or 'IID')

  • number_subregions: how many simulated geographic clusters

  • Returns an Environment object. Each region within it contains IID data internally, but non-IID across regions.

1. Distributing region data across devices

mapping = {}
for region_id, devices in mapping_devices_area.items():
    mapping_devices_data = environment.from_subregion_to_devices(
        region_id,
        len(devices)
    )
    for device_index, data in mapping_devices_data.items():
        device_id = devices[device_index]
        mapping[device_id] = data

Splits the region’s IID data equally among its devices, assigning each a local subset.

The result is a mapping:

device_id  local_dataset

✍🏻 Examples of usage

  1. Proximity-based Self-Federated Learning
  2. SParSeFuL
  3. Baselines implementation
  4. ProFed Startup Experiemnts

📄 License

MIT License — feel free to freely use, modify, and distribute.

📬 Contact

For questions, issues, or contributions, feel free to reach out:

You can also open an issue or submit a pull request on GitHub!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

profed-0.7.4.tar.gz (6.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

profed-0.7.4-py3-none-any.whl (7.8 kB view details)

Uploaded Python 3

File details

Details for the file profed-0.7.4.tar.gz.

File metadata

  • Download URL: profed-0.7.4.tar.gz
  • Upload date:
  • Size: 6.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.12.12 Linux/6.11.0-1018-azure

File hashes

Hashes for profed-0.7.4.tar.gz
Algorithm Hash digest
SHA256 5e4611864f19330e811882fe1f0315b6b50939ac7728cdf43deeba546856d518
MD5 9962f2f6f7fa71c4ac0db22b88ce6868
BLAKE2b-256 4cd1ebf6a66ed76c0d50bffd9ffc21fef8b782e8d35d4421fa28da00306a9fc9

See more details on using hashes here.

File details

Details for the file profed-0.7.4-py3-none-any.whl.

File metadata

  • Download URL: profed-0.7.4-py3-none-any.whl
  • Upload date:
  • Size: 7.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.12.12 Linux/6.11.0-1018-azure

File hashes

Hashes for profed-0.7.4-py3-none-any.whl
Algorithm Hash digest
SHA256 c0ff76796c6ffd0e30940b9927c16a8038fe4f789dc862a2b4f086b704bea655
MD5 6990d1d6009d1469eb55af2f5f78c689
BLAKE2b-256 79c113656c38f36da45bccc274557545e80d70f1932247b45d5e5539ed5a8055

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page