Skip to main content

No project description provided

Project description

aiSSEMBLE™ Model Training API

PyPI PyPI - Python Version PyPI - Wheel

This module contains the implementation and baseline Docker image for the aiSSEMBLE model training service. This service allows you to create model training jobs, list jobs, retrieve job logs, and kill jobs.

Model Training API

POST /training-jobs?pipeline=PIPELINE_NAME

  • Request body contains all key/value pairs required for model training, such as model hyperparameters
  • Functionality:
    • Spawns appropriate model training Kubernetes job
      • Checks for existence of model training image with naming convention: "model-training-PIPELINE_NAME"
        • Returns error if not present
      • Job naming convention: "model-training-PIPELINE_NAME-RANDOM_UUID"
    • Passes in user-provided parameters
  • Returns model training job name

GET /training-jobs/TRAINING_JOB_NAME

  • Returns logs from pod running model training job or error if job doesn't exist

GET /training-jobs

  • Returns list of all model training jobs (active, failed, and completed) and statuses
  • Filters all jobs in cluster by reserved job name prefix "model-training"

GET /training-jobs?pipeline=PIPELINE_NAME

  • Returns list of all model training jobs (active, failed, and completed) and statuses for a given pipeline

DELETE /training-jobs/TRAINING_JOB_NAME

  • Deletes specified Kubernetes job
  • Returns error if job does not exist

Remaining Items

  • Ensure appropriate Kubernetes RBAC config in Helm charts
  • Deploy model training API in downstream projects with ML training step(s)
  • In downstream projects, ensure model training image is generated into "model-training-PIPELINE_NAME"
  • In downstream projects, ensure embeddings deployment name is "PIPELINE_NAME-STEP_NAME"
  • Configure permissions/implement PDP authorization for each API route

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

Built Distribution

File details

Details for the file aissemble_foundation_model_training_api-1.10.0rc2.tar.gz.

File metadata

File hashes

Hashes for aissemble_foundation_model_training_api-1.10.0rc2.tar.gz
Algorithm Hash digest
SHA256 06fff7433df87eedb8bcc82d07f1fa9808ca3843729821fb2ba9cbdc4515a037
MD5 04b523c575cf2eefef015f4de830b522
BLAKE2b-256 1fe57246d0c7d5d2f44998a4f2d1e8eec1c09175e96b30754fb20d18d6ecde36

See more details on using hashes here.

File details

Details for the file aissemble_foundation_model_training_api-1.10.0rc2-py3-none-any.whl.

File metadata

File hashes

Hashes for aissemble_foundation_model_training_api-1.10.0rc2-py3-none-any.whl
Algorithm Hash digest
SHA256 9c65a8a18880203c8dde54a462157c71158820031558737ad8772465eaf50fc4
MD5 cc4ea29522295f42a2897ce1a18a7f64
BLAKE2b-256 a0584a99d0b2eb791f638a0dd1704a604b976d0bbc4f5bb03c7f4d8871beaad3

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page