aqueduct-sdk

Python SDK for the Aqueduct prediction infrastructure

These details have been verified by PyPI

Maintainers

andre.aqueducthq aqeunice aqueduct_engineering cgwu cwinddavid hsubbaraj jerome65 kenxu sauravchh vsreekanti

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Aqueduct: Prediction Delivery for Data Teams

Aqueduct is prediction delivery made simple. With Aqueduct, data scientists can instantaneously deploy machine learning models to the cloud, connect those models to data and business systems, and gain visibility into everything from inference latency to model accuracy -- all with Python.

from aqueduct import Client, op, metric, get_apikey

client = Client(get_apikey(), "localhost:8080")

@op
def transform_data(reviews):
    reviews['strlen'] = reviews['review'].str.len()
    
demo_db = client.integration("aqueduct_demo")
reviews_table = demo_db.sql("select * from hotel_reviews;")

strlen_table = transform_data(reviews_table)
strlen_table.save(demo_db.config(table="strlen_table", update_mode="replace")) 

client.publish_flow(name="review_strlen", artifacts=[strlen_table])

You can run the full Aqueduct server in a Google Colab notebook here. Our examples directory has a few, more detailed prediction pipelines:

Getting Started

To get started with Aqueduct:

Ensure that you meet the basic requirements.
Install the aqueduct server and UI by running:
```
pip3 install aqueduct-ml
```
Launch both the server and the UI by running:
```
aqueduct start
```
Get your API Key by running:
```
aqueduct apikey
```

The core abstraction in Aqueduct is a Workflow, which is a sequence of Artifacts (data) that are transformed by Operators (compute). The input Artifact(s) for a Workflow is typically loaded from a database, and the output Artifact(s) are typically persisted back to a database. Each Workflow can either be run on a fixed schedule or triggered on-demand.

Why Aqueduct?

The existing tools for deploying models are not designed with data scientists in mind -- they assume the user will casually build Docker containers, deploy Kubernetes clusters, and writes thousands of lines of YAML to deploy a single model. Data scientists are by and large not interested in doing that, and there are better uses for their skills.

Aqueduct is designed for data scientists, with three core design principles in mind:

Simplicity: Data scientists should be able to deploy models with tools they're comfortable with and without having to learn how to use complex, low-level infrastructure systems.
Connectedness: Data science and machine learning can have the greatest impact when everyone in the business has access, and data scientists shouldn't have to bend over backwards to make this happen.
Confidence: Having the whole organization benefit from your work means that data scientists should be able to sleep peacefully, knowing that things are working as expected -- and they'll be alerted as soon as that changes.

What's next?

Interested in learning more? Check out our documentation, where you'll find:

If you have questions or comments or would like to learn more about what we're building, please reach out, join our Slack channel, or start a conversation on GitHub. We'd love to hear from you!

Project details

These details have been verified by PyPI

Maintainers

andre.aqueducthq aqeunice aqueduct_engineering cgwu cwinddavid hsubbaraj jerome65 kenxu sauravchh vsreekanti

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.3.6

Jun 7, 2023

0.3.5

May 31, 2023

0.3.4

May 24, 2023

0.3.3

May 17, 2023

0.3.2

May 10, 2023

0.3.1

May 4, 2023

0.2.12

Apr 26, 2023

0.2.11

Apr 18, 2023

0.2.10

Apr 11, 2023

0.2.9

Apr 5, 2023

0.2.8

Mar 29, 2023

0.2.7

Mar 23, 2023

0.2.6

Mar 14, 2023

0.2.5

Mar 7, 2023

0.2.4

Mar 1, 2023

0.2.3

Feb 22, 2023

0.2.2

Feb 15, 2023

0.2.1

Feb 8, 2023

0.2.0

Feb 1, 2023

0.1.11

Jan 24, 2023

0.1.10

Jan 17, 2023

0.1.9

Jan 11, 2023

0.1.8

Dec 21, 2022

0.1.7

Dec 14, 2022

0.1.6

Dec 14, 2022

0.1.5

Nov 30, 2022

0.1.4

Nov 15, 2022

0.1.3

Nov 8, 2022

0.1.2

Nov 1, 2022

0.1.1

Oct 26, 2022

0.1.0

Oct 18, 2022

0.0.16

Sep 26, 2022

0.0.15

Sep 21, 2022

0.0.14

Sep 12, 2022

0.0.13

Sep 7, 2022

0.0.12

Aug 25, 2022

0.0.11

Aug 23, 2022

0.0.10

Aug 23, 2022

0.0.9

Aug 16, 2022

0.0.8

Aug 9, 2022

0.0.7

Aug 1, 2022

0.0.6

Jul 25, 2022

This version

0.0.5

Jul 14, 2022

0.0.4

Jul 8, 2022

0.0.3

Jun 22, 2022

0.0.2

Jun 9, 2022

0.0.1

May 26, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aqueduct-sdk-0.0.5.tar.gz (59.8 kB view hashes)

Uploaded Jul 14, 2022 Source

Built Distribution

aqueduct_sdk-0.0.5-py3-none-any.whl (73.7 kB view hashes)

Uploaded Jul 14, 2022 Python 3

Hashes for aqueduct-sdk-0.0.5.tar.gz

Hashes for aqueduct-sdk-0.0.5.tar.gz
Algorithm	Hash digest
SHA256	`32bafcbb254011ff3947bd152efa59405f240b6eed202a9604cddc7bd1651146`
MD5	`8f763b4d0566974730a08f584764039a`
BLAKE2b-256	`d4f49cb32f060a0a4461ba728449e25148b4c48effdc293a01df3e3efa357546`

Hashes for aqueduct_sdk-0.0.5-py3-none-any.whl

Hashes for aqueduct_sdk-0.0.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7b4f08d05b4304282dd20d206be929931abca5706174e3310e9f757c6cf95cee`
MD5	`5b490a85020cca4573395c2e2f183d8f`
BLAKE2b-256	`bd54cbfa96bd44f4a200337024512b734e3ec4a9e1a8ee083cde26c55bde033d`