Paper - Pytorch
Project description
AudioFlamingo
Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities". PAPER LINK
Install
pip3 install audio-flamingo
Usage
import torch
from audio_flamingo.model import AudioFlamingo
# Generate a random input sequence
text = torch.randint(0, 256, (1, 1024))
audio = torch.randn(1, 16000)
# Initialize AudioFlamingo model
model = AudioFlamingo(
dim=512,
num_tokens=256,
max_seq_len=1024,
heads=8,
depth=6,
dim_head=64,
dropout=0.1,
context_dim=512,
)
# Pass the input sequence through the model
output = model(text, audio) # (1, 1024, 256)
# Print the output shape
print(output.shape)
# Path: audio_flamingo/model.py
License
MIT
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
audio_flamingo-0.0.3.tar.gz
(6.2 kB
view details)
Built Distribution
File details
Details for the file audio_flamingo-0.0.3.tar.gz
.
File metadata
- Download URL: audio_flamingo-0.0.3.tar.gz
- Upload date:
- Size: 6.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.3.2 CPython/3.11.0 Darwin/22.4.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ce8f94ed6a4edaa05631a5e42ce47645a98792aedc54c22d9ac309f055faecd6 |
|
MD5 | 600c77034cb28e6e76f9e6315f9b6d52 |
|
BLAKE2b-256 | 810ced19e09199dc516035028509e35dd75ae9271aa8d13895ab973c1f342e54 |
File details
Details for the file audio_flamingo-0.0.3-py3-none-any.whl
.
File metadata
- Download URL: audio_flamingo-0.0.3-py3-none-any.whl
- Upload date:
- Size: 6.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.3.2 CPython/3.11.0 Darwin/22.4.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | c3bfda9840a88e7a59da33c2d66d5b51815e5b67dc6aba2bfcfcc181fb7d8409 |
|
MD5 | 41932f5d7cc5f9bd234d9e2fc9cee5cd |
|
BLAKE2b-256 | 0ab8f5a549c7de7b265d667998537094e78434f98a8499336648207271f2ff61 |