Flower Datasets

These details have not been verified by PyPI

Project links

Project description

Flower Datasets

Build Downloads

Flower Datasets (flwr-datasets) is a library to quickly and easily create datasets for federated learning, federated evaluation, and federated analytics. It was created by the Flower Labs team that also created Flower: A Friendly Federated AI Framework.

[!TIP] For complete documentation that includes API docs, how-to guides and tutorials, please visit the Flower Datasets Documentation and for full FL example see the Flower Examples page.

Installation

For a complete installation guide visit the Flower Datasets Documentation

pip install flwr-datasets[vision]

Overview

Flower Datasets library supports:

downloading datasets - choose the dataset from Hugging Face's datasets,
partitioning datasets - customize the partitioning scheme,
creating centralized datasets - leave parts of the dataset unpartitioned (e.g. for centralized evaluation).

Thanks to using Hugging Face's datasets used under the hood, Flower Datasets integrates with the following popular formats/frameworks:

Hugging Face,
PyTorch,
TensorFlow,
Numpy,
Pandas,
Jax,
Arrow.

Create custom partitioning schemes or choose from the implemented partitioning schemes:

Partitioner (the abstract base class) Partitioner
IID partitioning IidPartitioner(num_partitions)
Dirichlet partitioning DirichletPartitioner(num_partitions, partition_by, alpha)
Distribution partitioning DistributionPartitioner(distribution_array, num_partitions, num_unique_labels_per_partition, partition_by, preassigned_num_samples_per_label, rescale)
InnerDirichlet partitioning InnerDirichletPartitioner(partition_sizes, partition_by, alpha)
Pathological partitioning PathologicalPartitioner(num_partitions, partition_by, num_classes_per_partition, class_assignment_mode)
Natural ID partitioning NaturalIdPartitioner(partition_by)
Size based partitioning (the abstract base class for the partitioners dictating the division based the number of samples) SizePartitioner
Linear partitioning LinearPartitioner(num_partitions)
Square partitioning SquarePartitioner(num_partitions)
Exponential partitioning ExponentialPartitioner(num_partitions)
more to come in the future releases (contributions are welcome).

Comparison of Partitioning Schemes on CIFAR10

PS: This plot was generated using a library function (see flwr_datasets.visualization package for more).

Usage

Flower Datasets exposes the FederatedDataset abstraction to represent the dataset needed for federated learning/evaluation/analytics. It has two powerful methods that let you handle the dataset preprocessing: load_partition(partition_id, split) and load_split(split).

Here's a basic quickstart example of how to partition the MNIST dataset:

from flwr_datasets import FederatedDataset
from flwr_datasets.partitioners import IidPartitioner

# The train split of the MNIST dataset will be partitioned into 100 partitions
partitioner = IidPartitioner(num_partitions=100)
fds = FederatedDataset("ylecun/mnist", partitioners={"train": partitioner})

partition = fds.load_partition(0)

centralized_data = fds.load_split("test")

For more details, please refer to the specific how-to guides or tutorials. They showcase customization and more advanced features.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.6.0

Feb 9, 2026

0.5.0

Dec 20, 2024

0.4.0

Oct 22, 2024

0.3.0

Jul 24, 2024

0.2.0

Jun 24, 2024

0.1.0

Mar 14, 2024

0.0.2

Nov 15, 2023

0.0.1

Sep 22, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

flwr_datasets-0.6.0.tar.gz (54.2 kB view details)

Uploaded Feb 9, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

flwr_datasets-0.6.0-py3-none-any.whl (93.4 kB view details)

Uploaded Feb 9, 2026 Python 3

File details

Details for the file flwr_datasets-0.6.0.tar.gz.

File metadata

Download URL: flwr_datasets-0.6.0.tar.gz
Upload date: Feb 9, 2026
Size: 54.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.1.3 CPython/3.10.19 Darwin/25.2.0

File hashes

Hashes for flwr_datasets-0.6.0.tar.gz
Algorithm	Hash digest
SHA256	`b7a5953fed49f54342286bcd98af6e95bc4dcd6aff57500589b62e837c20c0fa`
MD5	`c9a0293baf569377b9a5e648ece199a4`
BLAKE2b-256	`30fc41b31a2cd57ca8bfa813b029e06a9e157b02d870ba14ea1ef4a12b28bd00`

See more details on using hashes here.

File details

Details for the file flwr_datasets-0.6.0-py3-none-any.whl.

File metadata

Download URL: flwr_datasets-0.6.0-py3-none-any.whl
Upload date: Feb 9, 2026
Size: 93.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.1.3 CPython/3.10.19 Darwin/25.2.0

File hashes

Hashes for flwr_datasets-0.6.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`69d36c58ed0ccb9f90443827e2a840aa038b3f528e7315a20bca892dc27d3ae7`
MD5	`df92c8787d477b29deac38580c889bba`
BLAKE2b-256	`32da8cdd9799f48617ff6c3307d259f6bbca9aec68d70f61cd20c090756050b7`

See more details on using hashes here.

flwr-datasets 0.6.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Flower Datasets

Installation

Overview

Usage

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes