🤗 Hugging Face Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but unofficial)
Project description
🤗 Hugging Face Inference Toolkit for Google Cloud Vertex AI
from vertex_ai_huggingface_inference_toolkit import TransformersModel
model = TransformersModel(
model_name_or_path="facebook/bart-large-mnli",
framework="torch",
framework_version="2.2.0",
transformers_version="4.38.2",
python_version="3.10",
cuda_version="12.3.0",
environment_variables={
"HF_TASK": "zero-shot-classification",
},
)
model.deploy(
machine_type="n1-standard-4",
accelerator_type="NVIDIA_TESLA_T4",
accelerator_count=1,
)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for vertex_ai_huggingface_inference_toolkit-0.0.1b5.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 320a8ac7bb7d3073c23b42a43fe3109b8a8fe66d2d76768edac87c8caa95ed2a |
|
MD5 | b4b070d4c3484a87ad69c78a31afdcb4 |
|
BLAKE2b-256 | 5cd40dd0e811ca22f1935a4c7ffb7ab9bc53a1fcc30875cd1c5ec60333d4d65d |
Close
Hashes for vertex_ai_huggingface_inference_toolkit-0.0.1b5-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | ba60e4076a006fd2e2a2ac40fa63f72649ed9200a09b152194e061d6596a5f9d |
|
MD5 | f655b9983bb8035a29c6bcbb85019ef1 |
|
BLAKE2b-256 | 68d0101a6c84248f0c86712a1d988937d5c0b621a8e179ff30ca8167df4f7a05 |