🤗 Hugging Face Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but unofficial)
Project description
🤗 Hugging Face Inference Toolkit for Google Cloud Vertex AI
from vertex_ai_huggingface_inference_toolkit import TransformersModel
model = TransformersModel(
model_name_or_path="facebook/bart-large-mnli",
framework="torch",
framework_version="2.2.0",
transformers_version="4.38.2",
python_version="3.10",
cuda_version="12.3.0",
environment_variables={
"HF_TASK": "zero-shot-classification",
},
)
model.deploy(
machine_type="n1-standard-4",
accelerator_type="NVIDIA_TESLA_T4",
accelerator_count=1,
)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for vertex_ai_huggingface_inference_toolkit-0.0.1b10.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | bf95c51e54b8a59005d52184e9cffd5ead20223adf162e3ef3f36aa50bf8d8bd |
|
MD5 | 88541d5ed001244e90e4980f3e1422a7 |
|
BLAKE2b-256 | 5896bf5074ccfafa5cba33a4e8a9170db111c41ac864b7e211edceff2d399c11 |
Close
Hashes for vertex_ai_huggingface_inference_toolkit-0.0.1b10-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | a140e87019a46128203b116aeb2039e1b2adbada4780baf35af87c1ac7f5af4b |
|
MD5 | 196151076939d4dc54d3ec0a81dca08a |
|
BLAKE2b-256 | a62529f84edb9a84d4b3547f86c3c8c126e7013b2f77d1ff691fd6c62dc8b390 |