🤗 Hugging Face Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but unofficial)
Project description
🤗 Hugging Face Inference Toolkit for Google Cloud Vertex AI
from vertex_ai_huggingface_inference_toolkit import TransformersModel
model = TransformersModel(
model_name_or_path="facebook/bart-large-mnli",
framework="torch",
framework_version="2.2.0",
transformers_version="4.38.2",
python_version="3.10",
cuda_version="12.3.0",
environment_variables={
"HF_TASK": "zero-shot-classification",
},
)
model.deploy(
machine_type="n1-standard-4",
accelerator_type="NVIDIA_TESLA_T4",
accelerator_count=1,
)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for vertex_ai_huggingface_inference_toolkit-0.0.1b2.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 02f02fada75f02273879b8653c393e631db5cdfbceeb01f2b2dd1cca67aa5914 |
|
MD5 | de88f352a8c746341aff84b0544b9eb3 |
|
BLAKE2b-256 | 11b7a084c36be1d4aa668c47caf5be28fe536b422414c3a559483906c775fcfc |
Close
Hashes for vertex_ai_huggingface_inference_toolkit-0.0.1b2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | b143d6b2090afdc1cca41a61af67f00e6f84ca9f1b20d5db7b0e773ec114189a |
|
MD5 | f8057c590d5cd638efde021fc6145a57 |
|
BLAKE2b-256 | 6090ea59dff496938ca4e01e9da41d4e365004df03dcab0f7182d93d8d822626 |