🤗 Hugging Face Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but unofficial)
Project description
🤗 Hugging Face Inference Toolkit for Google Cloud Vertex AI
from vertex_ai_huggingface_inference_toolkit import TransformersModel
model = TransformersModel(
model_name_or_path="facebook/bart-large-mnli",
framework="torch",
framework_version="2.2.0",
transformers_version="4.38.2",
python_version="3.10",
cuda_version="12.3.0",
environment_variables={
"HF_TASK": "zero-shot-classification",
},
)
model.deploy(
machine_type="n1-standard-4",
accelerator_type="NVIDIA_TESLA_T4",
accelerator_count=1,
)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for vertex_ai_huggingface_inference_toolkit-0.0.1b0.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 725769dc8455c8a5e27f1c3ed56b1f050d7a800b34c72b8e34fff9e59be01c61 |
|
MD5 | ca565a850751a94d7028a986d23cdbb9 |
|
BLAKE2b-256 | ed64f41d7e7bf07fa3e1432e344cb74a4471942c2e8b00fea2197e9c36eb228a |
Close
Hashes for vertex_ai_huggingface_inference_toolkit-0.0.1b0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 72168ae5833edfc17290f8c24b1c94f0708d0381c83678608d8800c716a36bae |
|
MD5 | d01ec557d98537c7b95a387f1a06e0d0 |
|
BLAKE2b-256 | 8b4ffbecca1aaf9bd28b8ec38c79b32250c2745150684b274838d475e6da5656 |