Skip to main content

SageWorks: A Python WorkBench for creating and deploying AWS SageMaker Models

Project description

SageWorksTM

DataSource_EDA

SageWorks: The scientist's workbench powered by AWS® for scalability, flexibility, and security.

SageWorks is a medium granularity framework that manages and aggregates AWS® Services into classes and concepts. When you use SageWorks you think about DataSources, FeatureSets, Models, and Endpoints. Underneath the hood those classes handle all the details around updating and managing a complex set of AWS Services. All the power and none of the pain so that your team can Do Science Faster!

Full SageWorks OverView

SageWorks Architected FrameWork

Why SageWorks?

  • The AWS SageMaker® ecosystem is awesome but has a large number of services with significant complexity
  • SageWorks provides rapid prototyping through easy to use classes and transforms
  • SageWorks provides visibility and transparency into AWS SageMaker® Pipelines
    • What S3 data sources are getting pulled?
    • What Features Store/Group is the Model Using?
    • What's the Provenance of a Model in Model Registry?
    • What SageMaker Endpoints are associated with this model?

Single Pane of Glass

Visibility into the AWS Services that underpin the SageWorks Classes. We can see that SageWorks automatically tags and tracks the inputs of all artifacts providing 'data provenance' for all steps in the AWS modeling pipeline.

Top Dashboard

Clearly illustrated: SageWorks provides intuitive and transparent visibility into the full pipeline of your AWS Sagemaker Deployments.

Getting Started

SageWorks Zen

  • The AWS SageMaker® set of services is vast and complex.
  • SageWorks Classes encapsulate, organize, and manage sets of AWS® Services.
  • Heavy transforms typically use AWS Athena or Apache Spark (AWS Glue/EMR Serverless).
  • Light transforms will typically use Pandas.
  • Heavy and Light transforms both update AWS Artifacts (collections of AWS Services).
  • Quick prototypes are typically built with the light path and then flipped to the heavy path as the system matures and usage grows.

Classes and Concepts

The SageWorks Classes are organized to work in concert with AWS Services. For more details on the current classes and class hierarchies see SageWorks Classes and Concepts.

Contributions

If you'd like to contribute to the SageWorks project, you're more than welcome. All contributions will fall under the existing project license. If you are interested in contributing or have questions please feel free to contact us at sageworks@supercowpowers.com.

SageWorks Alpha Testers Wanted

Our experienced team can provide development and consulting services to help you effectively use Amazon’s Machine Learning services within your organization.

The popularity of cloud based Machine Learning services is booming. The problem many companies face is how that capability gets effectively used and harnessed to drive real business decisions and provide concrete value for their organization.

Using SageWorks will minimize the time and manpower needed to incorporate AWS ML into your organization. If your company would like to be a SageWorks Alpha Tester, contact us at sageworks@supercowpowers.com.

® Amazon Web Services, AWS, the Powered by AWS logo, are trademarks of Amazon.com, Inc. or its affiliates.

Readme change

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sageworks-0.2.15.tar.gz (3.8 MB view hashes)

Uploaded Source

Built Distribution

sageworks-0.2.15-py2.py3-none-any.whl (183.6 kB view hashes)

Uploaded Python 2 Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page