Structured random rotation (srr) based compression tools

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Structured Random Rotation-based Compression (srrcomp)

srrcomp offers compression techniques grounded in structured random rotation, with strong theoretical guarantees, as detailed in the following publications:

Shay Vargaftik, Ran Ben-Basat, Amit Portnoy, Gal Mendelson, Yaniv Ben-Itzhak, and Michael Mitzenmacher. "DRIVE: One-bit Distributed Mean Estimation." Advances in Neural Information Processing Systems 34 (2021): 362-377.
Shay Vargaftik, Ran Ben Basat, Amit Portnoy, Gal Mendelson, Yaniv Ben Itzhak, and Michael Mitzenmacher. "EDEN: Communication-Efficient and Robust Distributed Mean Estimation for Federated Learning." In International Conference on Machine Learning, pp. 21984-22014. PMLR, 2022.

Also, see the following blog for a high-level overview: "Pushing the Limits of Network Efficiency for Federated Machine Learning"

In particular, srrcomp can be used for:

Fast and efficient lossy compression.
Unbiased estimates.
Distributed mean estimation.
Compressing gradient updates in distributed and federated learning.

The implementation is torch-based and thus supports CPU and GPU.

Compression and decompression operations are carried out on the device where the associated vector is located.

srrcomp currently contains the implementation of EDEN.

CUDA acceleration

srrcomp offers some functions in CUDA for faster execution (up to an order of magnitude). This acceleration requires local compilation with nvcc/torch/python compatible versions.

The 'gpuacctype' argument, which specifies the GPU acceleration type, defaults to 'cuda' but can be changed to 'torch' to utilize the torch-based implementation.

The torch-based implementation is utilized when CUDA acceleration is unavailable, such as when working with CPU-based vectors or when local CUDA compilation hasn't been performed.

Pre-requisites

torch

numpy

[Optional] nvcc for compiling the aforementioned CUDA functions for faster execution

Installation

Install from pip

Linux: $ pip install srrcomp

Windows: $ pip install srrcomp --extra-index-url https://download.pytorch.org/whl/ --no-cache

If the message "Faster CUDA implementation for Hadamard and bit packing is not available. Using torch implementation instead." appears when importing srrcomp on a GPU machine, try installing srrcomp from source.

Install from source

For Windows and Ubuntu versions earlier than 22.04, download source from the official repository and run $ python setup.py install

For Ubuntu 22.04 use build and pip and other standards-based tools to build and install from source.

Testing

Basic

Execute from \tests folder: $ python basic_test.py

dim, bits, and seed variables can be modified within the script.

Distributed Mean Estimation (DME)

Execute from \tests folder: $ python dme_test.py

Use $ python dme_test.py -h to get the test options

The team

Shay Vargaftik (VMware Research), shayv@vmware.com

Yaniv Ben-Itzhak (VMware Research), ybenitzhak@vmware.com

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

This version

0.1.3

Sep 14, 2023

0.1.2

May 24, 2023

0.1.1

Dec 26, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

srrcomp-0.1.3.tar.gz (13.2 kB view hashes)

Uploaded Sep 14, 2023 Source

Hashes for srrcomp-0.1.3.tar.gz

Hashes for srrcomp-0.1.3.tar.gz
Algorithm	Hash digest
SHA256	`8cc5c71bc805525ff0b2cc256514ac7118fdf27cb50565491df105f35752f947`
MD5	`149a67e602a7a3e2149716f3c4dd753f`
BLAKE2b-256	`9071cc8a67f5ea4941fcac95c7dd9694acb552b78e640508d968540a64e4a4ec`