🤗 Hugging Face Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but unofficial)
Project description
🤗 Hugging Face Inference Toolkit for Google Cloud Vertex AI
from vertex_ai_huggingface_inference_toolkit import TransformersModel
model = TransformersModel(
model_name_or_path="facebook/bart-large-mnli",
framework="torch",
framework_version="2.2.0",
transformers_version="4.38.2",
python_version="3.10",
cuda_version="12.3.0",
environment_variables={
"HF_TASK": "zero-shot-classification",
},
)
model.deploy(
machine_type="n1-standard-4",
accelerator_type="NVIDIA_TESLA_T4",
accelerator_count=1,
)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for vertex_ai_huggingface_inference_toolkit-0.0.1b3.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4a46c31f1df6e6727a40e0aa738259fe6a535bed5d5975e9062810f248277a44 |
|
MD5 | 5c1c73611de316cfc64f038dec8fd00d |
|
BLAKE2b-256 | 900d01bf11f3e2644b0f80cdbdde5cdd0175a9674ce554f93d2144b77f98336a |
Close
Hashes for vertex_ai_huggingface_inference_toolkit-0.0.1b3-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6a066cb17b623a8487fbf4d24ea0f891b61b6115603d3e242974ade5b7550cd7 |
|
MD5 | d76d3b629371200003c85494520adede |
|
BLAKE2b-256 | ad8c6fb64be6fd09da5654991d61d4752d472f9842f4480ff6a6abb274070ca5 |