Skip to main content

Python Open-source package for simulating federated learning and differential privacy

Project description

MEDfl: A Collaborative Framework for Federated Learning in Medicine

Python Versions License: GPL v3 Build Status Contributors PyPI version Downloads


📚 Table of Contents

  1. Introduction
  2. Key Features
  3. Installation
  4. Modes of Operation
  5. Quick Start
  6. Documentation
  7. Contributing
  8. Acknowledgment
  9. Authors

1. Introduction

MEDfl is an open-source Federated Learning (FL) framework designed for both simulation and real-world distributed trainingin the medical and healthcare domains. It integrates Differential Privacy (DP), Transfer Learning (TL), and secure communication to enable privacy-preserving model training across multiple institutions—particularly suited for medical and clinical data.


2. Key Features

  • 🧩 Two Operation Modes

    • Simulation Mode: Run FL experiments locally for testing and benchmarking.
    • Real-World Mode: Connect remote clients for production-grade FL.
  • 🔒 Differential Privacy (Opacus Integration)
    Ensures client updates are mathematically protected against data leakage.

  • 🧠 Transfer Learning Integration
    Improve convergence and accuracy in small or heterogeneous datasets.

  • ⚙️ Modular Design
    Plug-and-play components for models, optimizers, datasets, and aggregation strategies.


3. Installation

pip install medfl

✅ Requires Python 3.9+.

If you prefer the development version:

git clone https://github.com/MEDomics-UdeS/MEDfl.git
cd MEDfl
pip install -e .

4. Modes of Operation

Mode Description Typical Use Case
Simulation FL Runs all clients locally in a controlled environment. Benchmarking, debugging, or prototyping.
Real-World FL Connects distributed client machines. Multi-institution collaboration, production deployments.

5. Quick Start

🌍 Real-World Federated Learning Example

Server Setup

from MEDfl.rw.server import FederatedServer, Strategy

custom_strategy = Strategy(
    name="FedAvg",
    fraction_fit=1,
    min_fit_clients=1,
    min_evaluate_clients=1,
    min_available_clients=1,
    local_epochs=1,
    threshold=0.5,
    learning_rate=0.01,
    optimizer_name="SGD",
    saveOnRounds=3,
    savingPath="./",
    total_rounds=10,
    datasetConfig={"isGlobal": True, "globalConfig": {"target": "label", "testFrac": 0.2}},
)

server = FederatedServer(
    host="0.0.0.0",
    port=8080,
    num_rounds=10,
    strategy=custom_strategy,
)
server.start()

Client Setup

from MEDfl.rw.client import FlowerClient, DPConfig

# Example: XGBoost client
xgb_params = {
    "objective": "binary:logistic",
    "eval_metric": "logloss",
    "eta": 0.1,
    "max_depth": 6,
    "subsample": 0.8,
    "colsample_bytree": 0.8,
    "tree_method": "hist",  # GPU: "gpu_hist"
}

client = FlowerClient(
    server_address="100.65.215.27:8080",
    data_path="../data/client1.csv",
    dp_config=None,            # DP only applies to neural networks
    model_type="xgb",
    xgb_params=xgb_params,
    xgb_rounds=10,
)
client.start()

💡 Tip:
Use Tailscale to connect clients and server under the same secure VPN for real-world deployments.


6. Documentation

You can generate and host the documentation locally with Sphinx:

cd docs
make clean && make html
cd _build/html
python -m http.server

7. Contributing

We welcome contributions of all kinds — from bug fixes to new modules.

  1. Fork the repo and create a feature branch.
  2. Run tests and format your code with black and flake8.
  3. Submit a Pull Request with clear details on your changes.

8. Acknowledgment

MEDfl is part of the MEDomicsLab initiative at the Université de Sherbrooke.
It was developed to enable secure, privacy-preserving, and reproducible machine learning across distributed medical datasets.


9. Authors


If you find this project useful, please consider starring it on GitHub to support continued development.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

medfl-2.0.5.dev6.tar.gz (60.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

medfl-2.0.5.dev6-py3-none-any.whl (67.3 kB view details)

Uploaded Python 3

File details

Details for the file medfl-2.0.5.dev6.tar.gz.

File metadata

  • Download URL: medfl-2.0.5.dev6.tar.gz
  • Upload date:
  • Size: 60.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for medfl-2.0.5.dev6.tar.gz
Algorithm Hash digest
SHA256 c34574f1ff18818be74ca2a247c385d8b61d6c57536dda7dfc0c1bcbb7df7ad4
MD5 8525d1de8921903e63d6e3c3e4386472
BLAKE2b-256 317c0e4042712f3ba6fef7d50c58624ddf3ef87d7161668b3d57673a27f54bab

See more details on using hashes here.

File details

Details for the file medfl-2.0.5.dev6-py3-none-any.whl.

File metadata

  • Download URL: medfl-2.0.5.dev6-py3-none-any.whl
  • Upload date:
  • Size: 67.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for medfl-2.0.5.dev6-py3-none-any.whl
Algorithm Hash digest
SHA256 072af667faf288eefba4612645a2186977770f44f267607f02aac92ab0d6e143
MD5 cb96a25a8738e3e1cec38b7c8892c318
BLAKE2b-256 e637674636d3cafa680364a398052a18e40bbf13a0c385c6571f16ac2c4de600

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page