🤗 Hugging Face Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but unofficial)
Project description
🤗 Hugging Face Inference Toolkit for Google Cloud Vertex AI
from vertex_ai_huggingface_inference_toolkit import TransformersModel
model = TransformersModel(
model_name_or_path="facebook/bart-large-mnli",
framework="torch",
framework_version="2.2.0",
transformers_version="4.38.2",
python_version="3.10",
cuda_version="12.3.0",
environment_variables={
"HF_TASK": "zero-shot-classification",
},
)
model.deploy(
machine_type="n1-standard-4",
accelerator_type="NVIDIA_TESLA_T4",
accelerator_count=1,
)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for vertex_ai_huggingface_inference_toolkit-0.0.1b7.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | d99b11bae2b7c6e9e1808db277d68515573e919e60e36e10c5ca52924155c6d7 |
|
MD5 | 20acad6261b3bb76d9d732b828809e43 |
|
BLAKE2b-256 | 3acb8a9454b8607ff6db9f7ef03e7dd20b515603c2f0faf1448736a8046ecef9 |
Close
Hashes for vertex_ai_huggingface_inference_toolkit-0.0.1b7-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | a5ad5a1ba3b26f41c50265f1dc16156147f8c9bedba6546851669e2069b77f53 |
|
MD5 | 7236926a59455f6078f7ea0d3ccba92c |
|
BLAKE2b-256 | fa3cc686fc9fc12dfd7e189d0c7182b8cb29533bdd11d04f8b841faa3b1e26cf |