🤗 Hugging Face Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but unofficial)
Project description
🤗 Hugging Face Inference Toolkit for Google Cloud Vertex AI
from vertex_ai_huggingface_inference_toolkit import TransformersModel
model = TransformersModel(
model_name_or_path="facebook/bart-large-mnli",
framework="torch",
framework_version="2.2.0",
transformers_version="4.38.2",
python_version="3.10",
cuda_version="12.3.0",
environment_variables={
"HF_TASK": "zero-shot-classification",
},
)
model.deploy(
machine_type="n1-standard-4",
accelerator_type="NVIDIA_TESLA_T4",
accelerator_count=1,
)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for vertex_ai_huggingface_inference_toolkit-0.0.1b8.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | de8aeded526e450bb5e7146e55e63e0a71aa8179a2338b197589721de585c089 |
|
MD5 | 9ab93b2f0b27ab3f1a47e395eb07968e |
|
BLAKE2b-256 | 22c5c790c397421391ac2dbc838e97f1a315f6751c21be3ec7397f62c04c35bc |
Close
Hashes for vertex_ai_huggingface_inference_toolkit-0.0.1b8-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3724624c3761bfbf1bcbe55b1647a51a92bedb19f4457afa1b775526fd559965 |
|
MD5 | e9f2503ed61ccf094953d793cab52e4a |
|
BLAKE2b-256 | 6cb1fb322a7f2a5e0d01c9606dc76ca2caacda5bcbc4d8340999944af574ec32 |