W-Train Utils for MLflow Triton Plugin
Project description
w-train-utils-mlflow-triton-plugin
가상환경 설정
pyenv install 3.8.18
pyenv virtualenv 3.8.18 wtrainclient3.8
pyenv activate wtrainclient3.8
Triton Inference Server 실행
$ docker run --rm -p8000:8000 -p8001:8001 -p8002:8002 \
-e AWS_ACCESS_KEY_ID=<AccessKey> \
-e AWS_SECRET_ACCESS_KEY=<SecretKey> \
nvcr.io/nvidia/tritonserver:24.01-py3 \
tritonserver --model-repository=s3://https://kitech-minio-api.wimcorp.dev:443/triton \
--model-control-mode=explicit \
--log-verbose=1
환경 변수 설정
프로젝트를 실행하기 전에 아래의 환경 변수들을 설정해야 합니다:
환경변수 | 설명 | 예시 |
---|---|---|
MLFLOW_S3_ENDPOINT_URL | MLflow가 저장소로 사용하고있는 MinIO 엔드포인트 URL | http://localhost:9000 |
MLFLOW_TRACKING_URI | MLflow 트래킹 서버의 URI | http://localhost:5001 |
AWS_ACCESS_KEY_ID | MinIO 서버 접근을 위한 AWS 호환 액세스 키 | minio |
AWS_SECRET_ACCESS_KEY | MinIO 서버 접근을 위한 AWS 호환 시크릿 액세스 키 | miniostorage |
TRITON_URL | Triton Inference Server 의 grpc 엔드포인트 URL | http://localhost:8001 |
TRITON_MODEL_REPO | Triton Inference Server 의 모델저장소 URL | s3://http://localhost:9000/triton |
패키지 빌드 및 업로드
# 필요한 의존성 설치
pip install wheel setuptools twine
vi ~/.pypirc
[distutils]
index-servers =
pypi
pypi-repository
[pypi]
username = __token__
password = <token>
[pypi-repository]
repository: https://<domain>/repository/<pypi-hosted>/
username: <username>
password: <password>
sh scripts/build.sh
sh scripts/deploy.sh
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for wtu-mlflow-triton-plugin-0.0.14.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | f20fadbf622e7f71139fc5307866f072e3dd345c46182acc0a5be7addff3df36 |
|
MD5 | 0352a1dbaed115433c90533251b4a0c7 |
|
BLAKE2b-256 | 0663feb8a606348d197a44a8ffa3c9c273a8e0e7ba288844cefcebcc8816e512 |
Close
Hashes for wtu_mlflow_triton_plugin-0.0.14-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | ab80f5e2b0c09de720d7dfd305e82c450dc2ce64c992017759215c37977ce34b |
|
MD5 | 502b21ff1e1174f65adeb0fc430679e8 |
|
BLAKE2b-256 | f2ba2f14b0f0f7cd9402304c50fed90fe9ecb80bcee7deaebffbce00256b10b7 |