Perform numpy-like analysis on data that remains in someone elses server
Project description
Perform numpy-like analysis on data that remains in someone else's server
Quickstart
✅ Linux ✅ macOS* ✅ Windows†‡
-
Install our handy 🛵 cli tool which makes deploying a Domain or Network server a one-liner:
pip install -U hagrid -
Then run our interactive jupyter Install 🧙🏽♂️ WizardBETA:
hagrid quickstart
-
In the tutorial you will learn how to install and deploy:
PySyft= ournumpy-like 🐍 Python library for computing onprivate datain someone else'sDomainPyGrid= our 🐳docker/k8s/ 🐧vmDomain&NetworkServers whereprivate datalives -
During quickstart we will deploy
PyGridto localhost with 🐳docker, however 🛵 HAGrid can deploy tok8sor a 🐧ubuntuVM onazure/gcp/ANY_IP_ADDRESSby using 🔨ansible†
Install Notes
- HAGrid Requires: 🐍
python🐙git- Run:pip install -U hagrid - Interactive Install 🧙🏽♂️ WizardBETA Requires 🛵
hagrid: - Run:hagrid quickstart
†Windowsdoes not supportansible, preventing some remote deployment targets - PySyft Requires: 🐍
python 3.8+- Run:pip install -U syft
*macOSApple Silicon users need cmake:brew install cmake
‡Windowsusers must run this first:pip install jaxlib==0.3.14 -f https://whls.blob.core.windows.net/unstable/index.html - PyGrid Requires: 🐳
docker/k8sor 🐧ubuntuVM - Run:hagrid launch ...
Versions
0.7.0 - Course 3 Updated
0.6.0 - Course 3
0.5.1 - Course 2 + M1 Hotfix
0.2.0 - 0.5.0 Deprecated
PySyft and PyGrid use the same version and its best to match them up where possible. We release weekly betas which can be used in each context:
PySyft: pip install -U syft --pre
PyGrid: hagrid launch ... tag=latest
HAGrid is a cli / deployment tool so the latest version of hagrid is usually the best.
What is Syft?
Syft is OpenMined's open source stack that provides secure and private Data Science in Python. Syft decouples private data from model training, using techniques like Federated Learning, Differential Privacy, and Encrypted Computation. This is done with a numpy-like interface and integration with Deep Learning frameworks, so that you as a Data Scientist can maintain your current workflow while using these new privacy-enhancing techniques.
Why should I use Syft?
Syft allows a Data Scientist to ask questions about a dataset and, within privacy limits set by the data owner, get answers to those questions, all without obtaining a copy of the data itself. We call this process Remote Data Science. It means in a wide variety of domains across society, the current risks of sharing information (copying data) with someone such as, privacy invasion, IP theft and blackmail will no longer prevent the vast benefits such as innovation, insights and scientific discovery which secure access will provide.
No more cold calls to get access to a dataset. No more weeks of wait times to get a result on your query. It also means 1000x more data in every domain. PySyft opens the doors to a streamlined Data Scientist workflow, all with the individual's privacy at its heart.
Tutorials
|
Data Owner |
Data Scientist |
Data Engineer |
|---|---|---|
|
|
|
Terminology
|
👨🏻💼 Data Owners |
👩🏽🔬 Data Scientists |
|---|---|
|
Provide |
Are end |
|
🏰 Domain Server |
🔗 Network Server |
|
Manages the |
Provides services to a group of |
Community
Courses
|
|
|
|
|---|
Contributors
OpenMined and Syft appreciates all contributors, if you would like to fix a bug or suggest a new feature, please see our guidelines.
Supporters
|
|
|
|
|
|
|
|
|
|
|---|
Open Collective
OpenMined is a fiscally sponsored 501(c)(3) in the USA. We are funded by our generous supporters on Open Collective.
Disclaimer
Syft is under active development and is not yet ready for pilots on private data without our assistance. As early access participants, please contact us via Slack or email if you would like to ask a question or have a use case that you would like to discuss.
License
Apache License 2.0
Person icons created by Freepik - Flaticon
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file syft-0.7.0.tar.gz.
File metadata
- Download URL: syft-0.7.0.tar.gz
- Upload date:
- Size: 7.6 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7ec293966ee6c648e41507564bc4cc693129cb22abc29ca58748a75e047a38fa
|
|
| MD5 |
5d9dfe411f1a1b5706ffcbbfddff1c24
|
|
| BLAKE2b-256 |
6cdceb3a8f50ceb2d287833d2729936f304910603fd0bea95d2027ca0a9d9a65
|
File details
Details for the file syft-0.7.0-py2.py3-none-any.whl.
File metadata
- Download URL: syft-0.7.0-py2.py3-none-any.whl
- Upload date:
- Size: 7.8 MB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
31bc19487ec61ead6e40e07a51368b442fe0aebffc20061c239e663791b92f88
|
|
| MD5 |
14cd3afa89ac0362ce0916275febd125
|
|
| BLAKE2b-256 |
7acd361d05b32fc53a24a46247d89ce9f3907516be4fbb05366c61325542bec0
|