5 projects
optimum-neuron
Optimum Neuron serves as the bridge between Hugging Face libraries, such as Transformers, Diffusers, and PEFT, and AWS Trainium and Inferentia accelerators. It provides a set of tools enabling easy model loading, training, and inference on both single and multiple Neuron core configurations, across a wide range of downstream tasks.
optimum
Optimum Library is an extension of the Hugging Face Transformers library, providing a framework to integrate third-party libraries from Hardware Partners and interface with their specific functionality.
optimum-quanto
A pytorch quantization backend for optimum.
optimum-pipelines
An inference pipelines framework for optimum.
quanto
A quantization toolkit for pytorch.