Skip to main content

The quality gate for testing MLRun/Iguazio solution.

Project description

License PyPI version fury.io coverage GitHub commit activity GitHub release

QGate-Sln-MLRun

The Quality Gate for solution MLRun (and Iguazio). The main aims of the project are:

  • independent quality test (function, integration, acceptance, ... tests)
  • deeper quality checks before full rollout/use in company environments
  • identification of possible compatibility issues (if any)
  • external and independent test coverage
  • etc.

The tests use these key components, MLRun solution see GIT mlrun, sample meta-data model see GIT qgate-model and this project.

Test scenarios

The quality gate covers these test scenarios (✅ done, ✔ in-progress, ❌ planned):

  • 01 - Project
    • ✅ TS101: Create project(s)
    • ✅ TS102: Delete project(s)
  • 02 - Feature set
    • ✅ TS201: Create feature set(s)
    • ✅ TS202: Create feature set(s) & Ingest from DataFrame source (one step)
    • ✅ TS203: Create feature set(s) & Ingest from CSV source (one step)
    • ✅ TS204: Create feature set(s) & Ingest from Parquet source (one step)
    • ✅ TS205: Create feature set(s) & Ingest from SQL source (one step)
    • ✔ TS206: Create feature set(s) & Ingest from Kafka source (one step)
    • ❌ TS207: Create feature set(s) & Ingest from HTTP source (one step)
  • 03 - Ingest data
    • ✔ TS301: Ingest data (Preview)
    • ✅ TS302: Ingest data to feature set(s) from DataFrame source
    • ✅ TS303: Ingest data to feature set(s) from CSV source
    • ✅ TS304: Ingest data to feature set(s) from Parquet source
    • ✅ TS305: Ingest data to feature set(s) from SQL source
    • ✔ TS306: Ingest data to feature set(s) from Kafka source
    • ❌ TS307: Ingest data to feature set(s) from HTTP source
  • 04 - Feature vector
    • ✅ TS401: Create feature vector(s)
  • 05 - Get data from vector
    • ✅ TS501: Get data from off-line feature vector(s)
    • ✅ TS502: Get data from on-line feature vector(s)
  • 06 - Pipeline
    • ✔ TS601: Simple pipeline(s) (HTTP call)
    • ❌ TS602: Simple pipeline for CSV source
    • ❌ TS603: Complex pipeline for DataFrame source
    • ❌ TS604: Complex pipeline for CSV source
  • 07 - Build model
    • ✅ TS701: Build CART model
    • ❌ TS702: Build XGBoost model
    • ❌ TS703: Build DNN model
  • 08 - Serve model
    • ✅ TS801: Serving score from CART
    • ❌ TS802: Serving score from XGBoost
    • ❌ TS803: Serving score from DNN
  • 09 - Model monitoring/drifting
    • ❌ TS901: Real-time monitoring
    • ❌ TS902: Batch monitoring

NOTE: Each test scenario contains addition specific test cases (e.g. with different targets for feature sets, etc.).

Test inputs/outputs

The quality gate tests these inputs/outputs (✅ done, ✔ in-progress, ❌ planned):

  • Outputs (targets)
    • ✅ RedisTarget, ✅ SQLTarget/MySQL, ✔ SQLTarget/Postgres, ✅ KafkaTarget
    • ✅ ParquetTarget, ✅ CSVTarget
    • ✅ File system, ❌ S3, ❌ BlobStorage
  • Inputs (sources)
    • ✅ Pandas/DataFrame, ✅ SQLSource/MySQL, ❌ SQLSource/Postgres, ❌ KafkaSource
    • ✅ ParquetSource, ✅ CSVSource
    • ✅ File system, ❌ S3, ❌ BlobStorage

The current supported sources/targets in MLRun.

Sample of outputs

Sample of outputs

The reports in original form, see:

Usage

You can easy use this solution in four steps:

  1. Download content of these two GIT repositories to your local environment
  2. Update file qgate-sln-mlrun.env from qgate-model
    • Update variables for MLRun/Iguazio, see MLRUN_DBPATH, V3IO_USERNAME, V3IO_ACCESS_KEY, V3IO_API
      • setting of V3IO_* is needed only in case of Iguazio installation (not for pure free MLRun)
    • Update variables for QGate, see QGATE_* (basic description directly in *.env)
  3. Run from qgate-sln-mlrun
    • python main.py
  4. See outputs (location is based on QGATE_OUTPUT in configuration)
    • './output/qgt-mlrun- .html'
    • './output/qgt-mlrun- .txt'

Precondition: You have available MLRun or Iguazio solution (MLRun is part of that), see official installation steps, or directly installation for Desktop Docker.

Tested with

The project was tested with these MLRun versions (see change log):

  • MLRun (in Desktop Docker)
    • MLRun 1.7.0 (plan 05-06/2024)
    • MLRun 1.6.3 (plan 05/2024), 1.6.2, 1.6.1, 1.6.0
    • MLRun 1.5.2, 1.5.1, 1.5.0
    • MLRun 1.4.1, 1.3.0
  • Iguazio (k8s, on-prem, VM on VMware)
    • Iguazio 3.5.3 (with MLRun 1.4.1)
    • Iguazio 3.5.1 (with MLRun 1.3.0)

NOTE: Current state, only the last MLRun/Iguazio versions are valid for testing (these tests are without back-compatibilities).

Others

  • To-Do, the list of expected/future improvements, see
  • Applied limits, the list of applied limits, see
  • How can you test the solution?, you have to focus on Linux env. or Windows with WSL2 (see step by step tutorial)
  • MLRun/Iguazio, the key changes in a nutshell, see

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

qgate_sln_mlrun-0.2.2-py3-none-any.whl (27.0 kB view details)

Uploaded Python 3

File details

Details for the file qgate_sln_mlrun-0.2.2-py3-none-any.whl.

File metadata

File hashes

Hashes for qgate_sln_mlrun-0.2.2-py3-none-any.whl
Algorithm Hash digest
SHA256 31c6eefbe44ca5bdfeac19410152962b1d13ce64deb2cdad4fd8e00ef5b03cdf
MD5 e48492bd222c88eeb4a7ee7600463c71
BLAKE2b-256 1ee57e261ee58c0606b6e391fd16d146047a58b5456c169213e10e0bdff78c44

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page