🤗 Hugging Face Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but unofficial)
Project description
🤗 Hugging Face Inference Toolkit for Google Cloud Vertex AI
from vertex_ai_huggingface_inference_toolkit import TransformersModel
model = TransformersModel(
model_name_or_path="facebook/bart-large-mnli",
framework="torch",
framework_version="2.2.0",
transformers_version="4.38.2",
python_version="3.10",
cuda_version="12.3.0",
environment_variables={
"HF_TASK": "zero-shot-classification",
},
)
model.deploy(
machine_type="n1-standard-4",
accelerator_type="NVIDIA_TESLA_T4",
accelerator_count=1,
)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for vertex_ai_huggingface_inference_toolkit-0.0.1b4.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1c5cd248a15ed5dd6dd7f4d512eda39b97887c8aa758f03c9469c4151005e392 |
|
MD5 | 65e003120ea35d5eba786077ba122fbb |
|
BLAKE2b-256 | 2343cd6c7322e6bb5ded26778562b4751f506d580f9f4a6f11ea656d42956717 |
Close
Hashes for vertex_ai_huggingface_inference_toolkit-0.0.1b4-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 80d107e79b23d819fd1d87dd3c0d3d44a06889745c0b9b6b20e4f8aeb2b27ed5 |
|
MD5 | 03c8b7f9f1337e4b951a5696ab23025b |
|
BLAKE2b-256 | f69b29ef3935aeb6640ba12457d6b374b67dc0c499db7af6b4664af3bdb46128 |