🤗 Hugging Face Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but unofficial)
Project description
🤗 Hugging Face Inference Toolkit for Google Cloud Vertex AI
from vertex_ai_huggingface_inference_toolkit import TransformersModel
model = TransformersModel(
model_name_or_path="facebook/bart-large-mnli",
framework="torch",
framework_version="2.2.0",
transformers_version="4.38.2",
python_version="3.10",
cuda_version="12.3.0",
environment_variables={
"HF_TASK": "zero-shot-classification",
},
)
model.deploy(
machine_type="n1-standard-4",
accelerator_type="NVIDIA_TESLA_T4",
accelerator_count=1,
)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for vertex_ai_huggingface_inference_toolkit-0.0.1b9.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5d8bc60066394b4b3cc99d83d734aacc93c39cec7bfb7cc72b0fb2b1f0f8e172 |
|
MD5 | 45c2071dfd1123d088d3bd32fe1eb742 |
|
BLAKE2b-256 | 792ab40e054acbe50341501ed5a467454b41794ce75c6d53acc8db96932b1178 |
Close
Hashes for vertex_ai_huggingface_inference_toolkit-0.0.1b9-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 946089c871a67e2dc0dc1b9a18984e07faf3ea290beeecad8aca0483d1891cc7 |
|
MD5 | f93c90036067d918e05552fc2cbf833d |
|
BLAKE2b-256 | 69f3ce765f6058451e662b2d4c3d349efecf87c9ae9335c197b114609584c2d4 |