Skip to main content

No project description provided

Project description

aiSSEMBLE™ Model Training API

PyPI PyPI - Python Version PyPI - Wheel

This module contains the implementation and baseline Docker image for the aiSSEMBLE model training service. This service allows you to create model training jobs, list jobs, retrieve job logs, and kill jobs.

Model Training API

POST /training-jobs?pipeline=PIPELINE_NAME

  • Request body contains all key/value pairs required for model training, such as model hyperparameters
  • Functionality:
    • Spawns appropriate model training Kubernetes job
      • Checks for existence of model training image with naming convention: "model-training-PIPELINE_NAME"
        • Returns error if not present
      • Job naming convention: "model-training-PIPELINE_NAME-RANDOM_UUID"
    • Passes in user-provided parameters
  • Returns model training job name

GET /training-jobs/TRAINING_JOB_NAME

  • Returns logs from pod running model training job or error if job doesn't exist

GET /training-jobs

  • Returns list of all model training jobs (active, failed, and completed) and statuses
  • Filters all jobs in cluster by reserved job name prefix "model-training"

GET /training-jobs?pipeline=PIPELINE_NAME

  • Returns list of all model training jobs (active, failed, and completed) and statuses for a given pipeline

DELETE /training-jobs/TRAINING_JOB_NAME

  • Deletes specified Kubernetes job
  • Returns error if job does not exist

Remaining Items

  • Ensure appropriate Kubernetes RBAC config in Helm charts
  • Deploy model training API in downstream projects with ML training step(s)
  • In downstream projects, ensure model training image is generated into "model-training-PIPELINE_NAME"
  • In downstream projects, ensure embeddings deployment name is "PIPELINE_NAME-STEP_NAME"
  • Configure permissions/implement PDP authorization for each API route

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

Built Distribution

File details

Details for the file aissemble_foundation_model_training_api-1.10.0rc1.tar.gz.

File metadata

File hashes

Hashes for aissemble_foundation_model_training_api-1.10.0rc1.tar.gz
Algorithm Hash digest
SHA256 eea34c2095e597b7f52a6ff8912f425b63e492326b0962c9c2034d30ca3eef7a
MD5 97f1c4428f5619dfb1a878566fcea54e
BLAKE2b-256 fb944f276f3c9b429408f98c60802da5c4d98b2e41e05e9f959fcd237512235e

See more details on using hashes here.

File details

Details for the file aissemble_foundation_model_training_api-1.10.0rc1-py3-none-any.whl.

File metadata

File hashes

Hashes for aissemble_foundation_model_training_api-1.10.0rc1-py3-none-any.whl
Algorithm Hash digest
SHA256 adec101707e775666fb71ba65173d6e779890a37e26c40e5ca3680a8acf66bb4
MD5 04304f8b794a7f09123d84f784781288
BLAKE2b-256 03c0ac367c93d05825000755565a8a577c66b7fb04198daafdec919d9efac83b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page