🤗 Hugging Face Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but unofficial)
Project description
🤗 Hugging Face Inference Toolkit for Google Cloud Vertex AI
from vertex_ai_huggingface_inference_toolkit import TransformersModel
model = TransformersModel(
model_name_or_path="facebook/bart-large-mnli",
framework="torch",
framework_version="2.2.0",
transformers_version="4.38.2",
python_version="3.10",
cuda_version="12.3.0",
environment_variables={
"HF_TASK": "zero-shot-classification",
},
)
model.deploy(
machine_type="n1-standard-4",
accelerator_type="NVIDIA_TESLA_T4",
accelerator_count=1,
)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for vertex_ai_huggingface_inference_toolkit-0.0.1b6.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0737c360f43d4b1f6fb269e0fe409df925c887530f6c73889d216d390f115596 |
|
MD5 | f600488d48e8cef05168fad7e953b909 |
|
BLAKE2b-256 | f31dc263a0153da6161a4220c140b5570dfee27379cee112679d15d015bf44b5 |
Close
Hashes for vertex_ai_huggingface_inference_toolkit-0.0.1b6-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5be8cae3fcfe35f56aadbc4a98f32c43dea49f57c5a541ba465e609a00228bbb |
|
MD5 | 46022b20a90cc3c9c3f1768689f21c6b |
|
BLAKE2b-256 | bc212df67533cf5f96c2afdab4af63d6ae57c2cb6d1598b4334fd4004501d026 |