Skip to main content

Templates for HuggingFace Transformers, supporting automatic-speech-recognition, text-to-speech, translation and summarization templates

Project description

[![sp](https://img.shields.io/badge/lang-sp-red.svg)](https://github.com/Sinapsis-AI/sinapsis-huggingface/blob/main/README.es.md)




Sinapsis Hugging Face Transformers

Templates for seamless integration with Transformers models

🐍 Installation📂 Features▶️ Example usage📦 Documentation🔍 License

🐍 Installation

Install using your package manager of choice. We encourage the use of uv

Example with uv:

  uv pip install sinapsis-huggingface-transformers --extra-index-url https://pypi.sinapsis.tech

or with raw pip:

  pip install sinapsis-huggingface-transformers --extra-index-url https://pypi.sinapsis.tech

[!IMPORTANT] Templates may require extra optional dependencies. For development, we recommend installing the package with all the optional dependencies:

with uv:

  uv pip install sinapsis-huggingface-transformers[all] --extra-index-url https://pypi.sinapsis.tech

or with raw pip:

  pip install sinapsis-huggingface-transformers[all] --extra-index-url https://pypi.sinapsis.tech

📦 Features

Sinapsis Hugging Face Transformers provides customizable inference templates for a variety of tasks, including image captioning, object detection, instance segmentation, speech-to-text, and text-to-speech.

Templates:

  • ImageToTextTransformers: Generates textual descriptions from input images using Hugging Face image-to-text models.
  • PaliGemmaInference: Generate captions for images.
  • PaliGemmaDetection: Detect specific objects in images.
  • SpeechToTextTransformers: Converts spoken audio into text using automatic speech recognition (ASR) models.
  • SummarizationTransformers: Summarizes long text into concise summaries using Hugging Face summarization models.
  • TextToSpeechTransformers: Converts text into lifelike audio using text-to-speech (TTS) models.
  • TranslationTransformers: Translates text from a source language to a target language using Hugging Face translation models.

▶️ Example Usage

Below is an example YAML configuration for text-to-speech (TTS) conversion using the Suno Bark model.

Config
agent:
  name: test_agent

templates:
  - template_name: InputTemplate
    class_name: InputTemplate
    attributes: {}

  - template_name: TextInput
    class_name: TextInput
    attributes:
      text: Hello, my name is Suno. And, uh — and I like pizza. [laughs] But I also have other interests such as playing tic tac toe.

  - template_name: TextToSpeechTransformers
    class_name: TextToSpeechTransformers
    template_input: TextInput
    attributes:
      model_path: 'suno/bark'
      device: "cuda"
      torch_dtype: float32
      seed: 7
      use_embeddings: false
      n_words: 30
      inference_kwargs:
        generate_kwargs:
          do_sample: true
          temperature: 0.7

  - template_name: AudioWriterSoundfile
    class_name: AudioWriterSoundfile
    template_input: TextToSpeechTransformers
    attributes:
      root_dir: ./test
      save_dir: audios

[!IMPORTANT] The TextInput and AudioWriterSoundfile templates correspond to the sinapsis-data-readers and sinapsis-data-writers packages respectively. If you want to use the example, please make sure you install these packages.

To run the config, use the CLI:

sinapsis run name_of_config.yml

📙 Documentation

Documentation is available on the sinapsis website

Tutorials for different projects within sinapsis are available at sinapsis tutorials page

🔍 License

This project is licensed under the AGPLv3 license, which encourages open collaboration and sharing. For more details, please refer to the LICENSE file.

For commercial use, please refer to our official Sinapsis website for information on obtaining a commercial license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sinapsis_huggingface_transformers-0.2.15.tar.gz (31.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file sinapsis_huggingface_transformers-0.2.15.tar.gz.

File metadata

File hashes

Hashes for sinapsis_huggingface_transformers-0.2.15.tar.gz
Algorithm Hash digest
SHA256 f4198dd8d5ef7c987245d7513e868059723100289bc3e6b304ee47330297910b
MD5 b038e722063871b910ec7073f3bffdf6
BLAKE2b-256 7cd2c9a248553ab94aa799a9bdaf6896aef81ec79c4657d961f12c58b8ca41ea

See more details on using hashes here.

File details

Details for the file sinapsis_huggingface_transformers-0.2.15-py3-none-any.whl.

File metadata

File hashes

Hashes for sinapsis_huggingface_transformers-0.2.15-py3-none-any.whl
Algorithm Hash digest
SHA256 0f887381ba6b2dea30ff5ce55ea7a169d8302b011009a4d424d8650ca9809cea
MD5 182c97486d75c0be6ca565ecff9fd6ef
BLAKE2b-256 22a3dd505a3fcdba477b30b0e80245335db01eff278a5670650a5e6d7fc44203

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page