🤗 Hugging Face Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but unofficial)
Project description
🤗 Hugging Face Inference Toolkit for Google Cloud Vertex AI
from vertex_ai_huggingface_inference_toolkit import TransformersModel
model = TransformersModel(
model_name_or_path="facebook/bart-large-mnli",
framework="torch",
framework_version="2.2.0",
transformers_version="4.38.2",
python_version="3.10",
cuda_version="12.3.0",
environment_variables={
"HF_TASK": "zero-shot-classification",
},
)
model.deploy(
machine_type="n1-standard-4",
accelerator_type="NVIDIA_TESLA_T4",
accelerator_count=1,
)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for vertex_ai_huggingface_inference_toolkit-0.0.1b11.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | bb7e26b75824a11a530949ac9eae33b0332deaf6a3104fd96eb0e03c19287b61 |
|
MD5 | ca4ae357f5179d97527c2c02ff653a62 |
|
BLAKE2b-256 | 359ecbe31946eb25258f790709d724aa1be4c3de6c579c683a5aa4abe71fe499 |
Close
Hashes for vertex_ai_huggingface_inference_toolkit-0.0.1b11-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | d65ed7c16602360ec97f33f009fa4a8c46c54c72f08d9f8575dc1fc324800e71 |
|
MD5 | c8085771ef45a714e22e270c55d84cbe |
|
BLAKE2b-256 | e27a2a01bd3a33a1a1cdc25299000a8063edab9e1dd5a1f3c040a8ca83ac4576 |