Skip to main content

Paper - Pytorch

Project description

Multi-Modality

AudioFlamingo

Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities". PAPER LINK

Install

pip3 install audio-flamingo

Usage

import torch
from audio_flamingo.model import AudioFlamingo

# Generate a random input sequence
text = torch.randint(0, 256, (1, 1024))
audio = torch.randn(1, 16000)

# Initialize AudioFlamingo model
model = AudioFlamingo(
    dim=512,
    num_tokens=256,
    max_seq_len=1024,
    heads=8,
    depth=6,
    dim_head=64,
    dropout=0.1,
    context_dim=512,
)

# Pass the input sequence through the model
output = model(text, audio)  # (1, 1024, 256)

# Print the output shape
print(output.shape)
# Path: audio_flamingo/model.py

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

audio_flamingo-0.0.3.tar.gz (6.2 kB view details)

Uploaded Source

Built Distribution

audio_flamingo-0.0.3-py3-none-any.whl (6.2 kB view details)

Uploaded Python 3

File details

Details for the file audio_flamingo-0.0.3.tar.gz.

File metadata

  • Download URL: audio_flamingo-0.0.3.tar.gz
  • Upload date:
  • Size: 6.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.3.2 CPython/3.11.0 Darwin/22.4.0

File hashes

Hashes for audio_flamingo-0.0.3.tar.gz
Algorithm Hash digest
SHA256 ce8f94ed6a4edaa05631a5e42ce47645a98792aedc54c22d9ac309f055faecd6
MD5 600c77034cb28e6e76f9e6315f9b6d52
BLAKE2b-256 810ced19e09199dc516035028509e35dd75ae9271aa8d13895ab973c1f342e54

See more details on using hashes here.

File details

Details for the file audio_flamingo-0.0.3-py3-none-any.whl.

File metadata

  • Download URL: audio_flamingo-0.0.3-py3-none-any.whl
  • Upload date:
  • Size: 6.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.3.2 CPython/3.11.0 Darwin/22.4.0

File hashes

Hashes for audio_flamingo-0.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 c3bfda9840a88e7a59da33c2d66d5b51815e5b67dc6aba2bfcfcc181fb7d8409
MD5 41932f5d7cc5f9bd234d9e2fc9cee5cd
BLAKE2b-256 0ab8f5a549c7de7b265d667998537094e78434f98a8499336648207271f2ff61

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page