Skip to main content

Perform numpy-like analysis on data that remains in someone elses server

Project description



Syft Logo

Data Science on data you are not allowed to see

PySyft enables a new way to do data science, where you can use non-public information, without seeing nor obtaining a copy of the data itself. All you need is to connect to a Datasite!

Datasites are like websites, but for data. Designed with the principles of structured transparency, they enable data owners to control how their data is protected and data scientists to use data without obtaining a copy.

PySyft supports any statistical analysis or machine learning, offering support for directly running Python code - even using third-party Python libraries.

Supported on:

✅ Linux ✅ macOS ✅ Windows ✅ Docker ✅ Kubernetes

Quickstart

Try out your first query against a live demo Datasite!

Install Client

pip install -U "syft[data_science]"

More instructions are available here.

Launch Server

Launch a development server directly in your Jupyter Notebook:

import syft as sy

sy.requires(">=0.9,<0.9.1")

server = sy.orchestra.launch(
    name="my-datasite",
    port=8080,
    create_producer=True,
    n_consumers=1,
    dev_mode=False,
    reset=True, # resets database
)

or from the command line:

$ syft launch --name=my-datasite --port=8080 --reset=True

Starting syft-datasite server on 0.0.0.0:8080

Datasite servers can be deployed as a single container using Docker or directly in Kubernetes. Check out our deployment guide.

Launch Client

Main way to use a Datasite is via our Syft client, in a Jupyter Notebook. Check out our PySyft client guide:

import syft as sy

sy.requires(">=0.9,<0.9.1")

datasite_client = sy.login(
    port=8080,
    email="info@openmined.org",
    password="changethis"
)

PySyft - Getting started 📝

Learn about PySyft via our getting started guide:

PySyft In-depth

📚 Check out our docs website.

Quick PySyft components links:

Why use PySyft?

In a variety of domains across society, data owners have valid concerns about the risks associated with sharing their data, such as legal risks, privacy invasion (misuing the data), or intellectual property (copying and redistributing it).

Datasites enable data scientists to answer questions without even seeing or acquiring a copy of the data, within the data owners's definition of acceptable use. We call this process Remote Data Science.

This means that the current risks of sharing information with someone will no longer prevent the vast benefits such as innovation, insights and scientific discovery. With each Datasite, data owners are able to enable 1000x more accesible data in each scientific field and lead, together with data scientists, breakthrough innovation.

Learn more about our work on our website.

Support

For questions about PySyft, reach out via #support on Slack.

Syft Versions

:exclamation: PySyft and Syft Server must use the same version.

Latest Stable

  • 0.9.0 (Stable) - Docs
  • Install PySyft (Stable): pip install -U syft

Latest Beta

  • 0.9.1 (Beta) - dev branch 👈🏽
  • Install PySyft (Beta): pip install -U syft --pre

Find more about previous releases here.

Community

Supported by the OpenMined Foundation, the OpenMined Community is an online network of over 17,000 technologists, researchers, and industry professionals keen to unlock 1000x more data in every scientific field and industry.

Courses

Contributors

OpenMined and Syft appreciates all contributors, if you would like to fix a bug or suggest a new feature, please reach out via Github or Slack!

Contributors

About OpenMined

OpenMined is a non-profit foundation creating technology infrastructure that helps researchers get answers from data without needing a copy or direct access. Our community of technologists is building Syft.

Supporters

License

Apache License 2.0
Person icons created by Freepik - Flaticon

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

syft-0.9.1b7.tar.gz (632.5 kB view details)

Uploaded Source

Built Distribution

syft-0.9.1b7-py2.py3-none-any.whl (731.2 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file syft-0.9.1b7.tar.gz.

File metadata

  • Download URL: syft-0.9.1b7.tar.gz
  • Upload date:
  • Size: 632.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.5

File hashes

Hashes for syft-0.9.1b7.tar.gz
Algorithm Hash digest
SHA256 dfd8a359fab1f31c7dee5564000f109dbe1a28a0355f2035f96beb2a89c36ec2
MD5 dacc8856eabe61411046627ff1d024d1
BLAKE2b-256 86506ae2d12617cd3f4a10ed5c9079ccc513720844b036b2609a7513d2779996

See more details on using hashes here.

File details

Details for the file syft-0.9.1b7-py2.py3-none-any.whl.

File metadata

  • Download URL: syft-0.9.1b7-py2.py3-none-any.whl
  • Upload date:
  • Size: 731.2 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.5

File hashes

Hashes for syft-0.9.1b7-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 7546f9ddd57bae7429f8afccb3fcd9a8871f727f408d6b60d90bc38c64a47f2b
MD5 650c0698fe9456f54d2e2e5f1d931574
BLAKE2b-256 02248caf3b5661e308fb5f6652cfb94f042ee3ca846731f5086a661be5c29e5f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page