5 projects
optimum-neuron
Optimum Neuron is the interface between the Hugging Face Transformers and Diffusers libraries and AWS Trainium and Inferentia accelerators. It provides a set of tools enabling easy model loading, training and inference on single and multiple neuron core settings for different downstream tasks.
optimum
Optimum Library is an extension of the Hugging Face Transformers library, providing a framework to integrate third-party libraries from Hardware Partners and interface with their specific functionality.
optimum-quanto
A pytorch quantization backend for optimum.
optimum-pipelines
An inference pipelines framework for optimum.
quanto
A quantization toolkit for pytorch.