🤗 Hugging Face Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but unofficial)
Project description
🤗 Hugging Face Inference Toolkit for Google Cloud Vertex AI
from vertex_ai_huggingface_inference_toolkit import TransformersModel
model = TransformersModel(
model_name_or_path="facebook/bart-large-mnli",
framework="torch",
framework_version="2.2.0",
transformers_version="4.38.2",
python_version="3.10",
cuda_version="12.3.0",
environment_variables={
"HF_TASK": "zero-shot-classification",
},
)
model.deploy(
machine_type="n1-standard-4",
accelerator_type="NVIDIA_TESLA_T4",
accelerator_count=1,
)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for vertex_ai_huggingface_inference_toolkit-0.0.1b12.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 43906366c4e21e2667c3c4f040554e918a62d584ded18510a9539fce98f50b72 |
|
MD5 | 37272524f4abdb854110f85284d9870f |
|
BLAKE2b-256 | 03c49a6933d80dabd0480a66f2d48eb70ecfa8ce1d74e83c16608bef11a64590 |
Close
Hashes for vertex_ai_huggingface_inference_toolkit-0.0.1b12-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 03866db92ec367899140110c395a772e642ba310f26b234d928410f6310be4bc |
|
MD5 | f11d3c8b5b4f510b396df1d1d7c65b6c |
|
BLAKE2b-256 | 3d48b7bb4c8b3a7ecf194ef3861b0ab1a8d8726b818167496216fc1a35bf4c74 |