🤗 Hugging Face Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but unofficial)
Project description
🤗 Hugging Face Inference Toolkit for Google Cloud Vertex AI
from vertex_ai_huggingface_inference_toolkit import TransformersModel
model = TransformersModel(
model_name_or_path="facebook/bart-large-mnli",
framework="torch",
framework_version="2.2.0",
transformers_version="4.38.2",
python_version="3.10",
cuda_version="12.3.0",
environment_variables={
"HF_TASK": "zero-shot-classification",
},
)
model.deploy(
machine_type="n1-standard-4",
accelerator_type="NVIDIA_TESLA_T4",
accelerator_count=1,
)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for vertex_ai_huggingface_inference_toolkit-0.0.1b1.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 78fda71cd61519b8bc4c0f9d0fa03161b9a9489f6fd7784cbbab205fd1a0042e |
|
MD5 | dfba9e503a1c1e76b2b8d715dcc68221 |
|
BLAKE2b-256 | 1641cf24381ad6f7d98d1614530d7e38d9254de93ca270d84d4ad8efefe19d2e |
Close
Hashes for vertex_ai_huggingface_inference_toolkit-0.0.1b1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | a295fbc99b24e44875535ab7ff1cf345ec1165fe6cb9615d67da310b1c0f3d8d |
|
MD5 | 46d78f2ae2186d3614f7501e6668ca33 |
|
BLAKE2b-256 | 0607e3faaa565f8119951c7d244f4a342013c83bd03ea8dd160138969901c011 |