Skip to main content

Add your description here

Project description

TrustyAI LM-Eval as and Out-of-Tree Llama Stack Provider

PyPI version

About

This repository implements TrustyAI's LM-Eval as an out-of-tree Llama Stack remote provider.

Use

Prerequsites

  • Admin access to an OpenShift cluster with RHOAI installed
  • Login to your OpenShift cluster with oc login --token=<TOKEN> --server=<SERVER>
  • Installation of uv
  • Installation of oc cli tool
  • Installation of llama stack cli tool
  1. Clone this repository

    git clone https://github.com/trustyai-explainability/llama-stack-provider-lmeval.git
    
  2. Set llama-stack-provider-lmeval as your working directory.

    cd llama-stack-provider-lmeval
    
  3. Deploy microsoft/Phi-3-mini-4k-instruct on vLLM Serving Runtime

    a. Create a namespace with a name of your choice

    TEST_NS=<NAMESPACE>
    oc create ns $TEST_NS
    oc get ns $TEST_NS
    

    b. Deploy the model via vLLM

    oc apply -k demos/resources/kustomization.yaml
    
  4. Before continuing, preform a sanity check to make sure the model was sucessfully deployed

    oc get pods | grep "predictor"
    

    Expected output:

    phi-3-predictor-00002-deployment-794fb6b4b-clhj7   3/3     Running   0          5h55m
    
  5. Create and activate a virtual enviornment

    uv venv .llamastack-venv
    
    source .llamastack-venv/bin/activate
    
  6. Install the required libraries

    uv pip install -e .
    
  7. Define the following ennvironment variables

    export VLLM_URL=https://$(oc get $(oc get ksvc -o name | grep predictor) --template={{.status.url}})/v1/completions
    
    export TRUSTYAI_LM_EVAL_NAMESPACE=$(oc project | cut -d '"' -f2)
    
  8. Start the llama stack server in a virtual enviornment

    llama stack run run.yaml --image-type venv
    

    Expected output:

    INFO:     Application startup complete.
    INFO:     Uvicorn running on http://['::', '0.0.0.0']:8321 (Press CTRL+C to quit)
    
  9. Navigate to demos/ to run the demo notebooks

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llama_stack_provider_lmeval-0.1.8.tar.gz (24.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

llama_stack_provider_lmeval-0.1.8-py3-none-any.whl (19.4 kB view details)

Uploaded Python 3

File details

Details for the file llama_stack_provider_lmeval-0.1.8.tar.gz.

File metadata

File hashes

Hashes for llama_stack_provider_lmeval-0.1.8.tar.gz
Algorithm Hash digest
SHA256 b4ff758b8f3c4c108b43af648b9a369a28a26a31aa1c6140da4c7d6ef8702025
MD5 3c273a5dfe916de73c8cfcc71f2ead14
BLAKE2b-256 c54b67e8d70458c3670cd4fbdfa8d2eb0b75a0112a2021ec8818463f5bc1a7bb

See more details on using hashes here.

Provenance

The following attestation bundles were made for llama_stack_provider_lmeval-0.1.8.tar.gz:

Publisher: build-and-push.yaml on trustyai-explainability/llama-stack-provider-lmeval

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file llama_stack_provider_lmeval-0.1.8-py3-none-any.whl.

File metadata

File hashes

Hashes for llama_stack_provider_lmeval-0.1.8-py3-none-any.whl
Algorithm Hash digest
SHA256 4acfc1ea0b1cccc9549e78443f8d51efeb168465281a8c5f9fbd2a6945e5af46
MD5 06e3840aeb01b45ab18c54be27083738
BLAKE2b-256 8c164c0e0e48fafe4915ae8cf39ca476b445a473410d47a39e8335cbc696855e

See more details on using hashes here.

Provenance

The following attestation bundles were made for llama_stack_provider_lmeval-0.1.8-py3-none-any.whl:

Publisher: build-and-push.yaml on trustyai-explainability/llama-stack-provider-lmeval

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page