Skip to main content

RayLEAF: a flexible, highly-scalable benchmark for federated learning

Project description

LEAF: A Benchmark for Federated Settings

Resources

Datasets

  1. FEMNIST
  • Overview: Image Dataset
  • Details: 62 different classes (10 digits, 26 lowercase, 26 uppercase), images are 28 by 28 pixels (with option to make them all 128 by 128 pixels), 3500 users
  • Task: Image Classification
  1. Sentiment140
  • Overview: Text Dataset of Tweets
  • Details 660120 users
  • Task: Sentiment Analysis
  1. Shakespeare
  • Overview: Text Dataset of Shakespeare Dialogues
  • Details: 1129 users (reduced to 660 with our choice of sequence length. See bug.)
  • Task: Next-Character Prediction
  1. Celeba
  1. Synthetic Dataset
  • Overview: We propose a process to generate synthetic, challenging federated datasets. The high-level goal is to create devices whose true models are device-dependant. To see a description of the whole generative process, please refer to the paper
  • Details: The user can customize the number of devices, the number of classes and the number of dimensions, among others
  • Task: Classification
  1. Reddit
  • Overview: We preprocess the Reddit data released by pushshift.io corresponding to December 2017.
  • Details: 1,660,820 users with a total of 56,587,343 comments.
  • Task: Next-word Prediction.

Notes

  • Install the libraries listed in requirements.txt
    • I.e. with pip: run pip3 install -r requirements.txt
  • Go to directory of respective dataset for instructions on generating data
    • in MacOS check if wget is installed and working
  • models directory contains instructions on running baseline reference implementations

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rayleaf-0.0.2.tar.gz (23.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

rayleaf-0.0.2-py3-none-any.whl (36.9 kB view details)

Uploaded Python 3

File details

Details for the file rayleaf-0.0.2.tar.gz.

File metadata

  • Download URL: rayleaf-0.0.2.tar.gz
  • Upload date:
  • Size: 23.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.10.4

File hashes

Hashes for rayleaf-0.0.2.tar.gz
Algorithm Hash digest
SHA256 d0bc95f5edad39b46dba6e9e8c5d682bbae9cb81d14f4b85f057076038b9cd8a
MD5 71542a37c8f7dd28c1af58174cb039f7
BLAKE2b-256 cc5daa7aa7400473b6d3ebc203272c724775f446a24c087ae222d370e7ee0f5c

See more details on using hashes here.

File details

Details for the file rayleaf-0.0.2-py3-none-any.whl.

File metadata

  • Download URL: rayleaf-0.0.2-py3-none-any.whl
  • Upload date:
  • Size: 36.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.10.4

File hashes

Hashes for rayleaf-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 28c397d770d93230c2f7cb5406c839e204d8bba6e4b4f6e4c45c68df177efe9a
MD5 efb292704b3b2886930bd57e35e23792
BLAKE2b-256 9b0cab6e953638c0cc3f7e9aae2f06ed733bce4a2a66792e7a191aa8292463b0

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page