Skip to main content

Secure Data Exploratory Analyses on Vertically Partitioned Data

Project description

TNO MPC Lab - MPyC - Statistics

The TNO MPC lab consists of generic software components, procedures, and functionalities developed and maintained on a regular basis to facilitate and aid in the development of MPC solutions. The lab is a cross-project initiative allowing us to integrate and reuse previously developed MPC functionalities to boost the development of new protocols and solutions.

The package tno.mpc.mpyc.statistics is part of the TNO Python Toolbox.

Within the LANCELOT project, a collaboration between TNO, IKNL and Janssen, TNO developed and implemented secure statistics. LANCELOT is partly funded by PPS-surcharge for Research and Innovation of the Dutch Ministry of Economic Affairs and Climate Policy. The Appl.AI project SELECTED, partly funded by NLAIC, also contributed to specific components in secure statistics (correlation, covariance).

Limitations in (end-)use: the content of this software package may solely be used for applications that comply with international export control laws.
This implementation of cryptographic software has not been audited. Use at your own risk.

Documentation

Documentation of the tno.mpc.mpyc.statistics package can be found here.

Install

Easily install the tno.mpc.mpyc.statistics package using pip:

$ python -m pip install tno.mpc.mpyc.statistics

If you wish to run the tests you can use:

$ python -m pip install 'tno.mpc.mpyc.statistics[tests]'

Note:

A significant performance improvement can be achieved by installing the GMPY2 library.

$ python -m pip install 'tno.mpc.mpyc.statistics[gmpy]'

Usage

The statistics module can be used as follows:

import numpy as np
from mpyc.runtime import mpc
from tno.mpc.mpyc.statistics import covariance


secnum = mpc.SecFxp(l=64, f=32)


def get_mpc_data(row_1, row_2):
    row_1_mpc = [secnum(x) for x in row_1]
    row_2_mpc = [secnum(y) for y in row_2]
    return row_1_mpc, row_2_mpc


def distribute_data_over_players(row_1_mpc, row_2_mpc):
    row_1_mpc_shared = mpc.input(row_1_mpc, senders=0)
    row_2_mpc_shared = mpc.input(row_2_mpc, senders=0)
    return row_1_mpc_shared, row_2_mpc_shared


async def covariance_example():
    print("Covariance example")

    row_1 = [1.0, 3.0, 2.0, 1.0, 5.0, 6.0, 3.0]
    row_2 = [2.0, 11.0, 9.0, 0.0, 8.0, 2.0, 2.1]

    row_1_np = np.array(row_1)
    row_2_np = np.array(row_2)

    row_1_mpc, row_2_mpc = get_mpc_data(row_1_np, row_2_np)

    async with mpc:
        row_1_mpc_shared, row_2_mpc_shared = distribute_data_over_players(
            row_1_mpc, row_2_mpc
        )

    secure_cov = covariance(row_1_mpc_shared, row_2_mpc_shared)
    revealed_cov = await mpc.output(secure_cov)

    np_cov = np.cov(row_1, row_2)[0][1]

    print("Secure Covariance: ", revealed_cov)
    print("Numpy Covariance:", np_cov)


if __name__ == "__main__":
    mpc.run(covariance_example())

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

tno.mpc.mpyc.statistics-0.1.1-py3-none-any.whl (18.5 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page