Secure Data Exploratory Analyses on Vertically Partitioned Data
Project description
TNO MPC Lab - MPyC - Statistics
The TNO MPC lab consists of generic software components, procedures, and functionalities developed and maintained on a regular basis to facilitate and aid in the development of MPC solutions. The lab is a cross-project initiative allowing us to integrate and reuse previously developed MPC functionalities to boost the development of new protocols and solutions.
The package tno.mpc.mpyc.statistics is part of the TNO Python Toolbox.
Within the LANCELOT project, a collaboration between TNO, IKNL and Janssen, TNO developed and implemented secure statistics. LANCELOT is partly funded by PPS-surcharge for Research and Innovation of the Dutch Ministry of Economic Affairs and Climate Policy. The Appl.AI project SELECTED, partly funded by NLAIC, also contributed to specific components in secure statistics (correlation, covariance).
Limitations in (end-)use: the content of this software package may solely be used for applications that comply with international export control laws.
This implementation of cryptographic software has not been audited. Use at your own risk.
Documentation
Documentation of the tno.mpc.mpyc.statistics package can be found here.
Install
Easily install the tno.mpc.mpyc.statistics package using pip:
$ python -m pip install tno.mpc.mpyc.statistics
If you wish to run the tests you can use:
$ python -m pip install 'tno.mpc.mpyc.statistics[tests]'
Note:
A significant performance improvement can be achieved by installing the GMPY2 library.
$ python -m pip install 'tno.mpc.mpyc.statistics[gmpy]'
Usage
The statistics module can be used as follows:
import numpy as np
from mpyc.runtime import mpc
from tno.mpc.mpyc.statistics import covariance
secnum = mpc.SecFxp(l=64, f=32)
def get_mpc_data(row_1, row_2):
row_1_mpc = [secnum(x) for x in row_1]
row_2_mpc = [secnum(y) for y in row_2]
return row_1_mpc, row_2_mpc
def distribute_data_over_players(row_1_mpc, row_2_mpc):
row_1_mpc_shared = mpc.input(row_1_mpc, senders=0)
row_2_mpc_shared = mpc.input(row_2_mpc, senders=0)
return row_1_mpc_shared, row_2_mpc_shared
async def covariance_example():
print("Covariance example")
row_1 = [1.0, 3.0, 2.0, 1.0, 5.0, 6.0, 3.0]
row_2 = [2.0, 11.0, 9.0, 0.0, 8.0, 2.0, 2.1]
row_1_np = np.array(row_1)
row_2_np = np.array(row_2)
row_1_mpc, row_2_mpc = get_mpc_data(row_1_np, row_2_np)
async with mpc:
row_1_mpc_shared, row_2_mpc_shared = distribute_data_over_players(
row_1_mpc, row_2_mpc
)
secure_cov = covariance(row_1_mpc_shared, row_2_mpc_shared)
revealed_cov = await mpc.output(secure_cov)
np_cov = np.cov(row_1, row_2)[0][1]
print("Secure Covariance: ", revealed_cov)
print("Numpy Covariance:", np_cov)
if __name__ == "__main__":
mpc.run(covariance_example())
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
Hashes for tno.mpc.mpyc.statistics-0.1.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4c7db34d9fbebdc182e6d0754dcc038ab78740d31e3ab9761b0f582cd2534d54 |
|
MD5 | 3404c636ede2afbae3d8cb355edb90f4 |
|
BLAKE2b-256 | 5dcd60a57e1098e431cb5411d29b305291e9501595e615f2251133dfe3a87ae0 |