Last released Apr 2, 2024
Accelerate transformer inference by 6-8.5x. Native to Huggingface and PyTorch.
Last released Jan 5, 2024
Deploy, scale, and monitor your ML models all with one click. Native to AWS.
Supported by