ast - Pytorch
Project description
AST
Implementation of AST from the paper: "AST: Audio Spectrogram Transformer' in PyTorch and Zeta. In this implementation we basically take an 2d input tensor representing audio -> then patchify it -> linear proj -> then position embeddings -> then attention and feedforward in a loop for layers. Please Join Agora and tag me if this could be improved in any capacity.
Install
pip3 install ast-torch
Usage
import torch
from ast_torch.model import ASTransformer
# Create dummy data
x = torch.randn(2, 16)
# Initialize model
model = ASTransformer(
dim=4, seqlen=16, dim_head=4, heads=4, depth=2, patch_size=4
)
# Run model and print output shape
print(model(x).shape)
Citation
@misc{gong2021ast,
title={AST: Audio Spectrogram Transformer},
author={Yuan Gong and Yu-An Chung and James Glass},
year={2021},
eprint={2104.01778},
archivePrefix={arXiv},
primaryClass={cs.SD}
}
License
MIT
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
ast_torch-0.0.5.tar.gz
(6.9 kB
view hashes)
Built Distribution
Close
Hashes for ast_torch-0.0.5-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1920586c09473c403878e20c3e483194d0d8f4137ea644d8d9a24ae49bc36e55 |
|
MD5 | 9c9ac66dd4ba09e6696016e8c9b7dc15 |
|
BLAKE2b-256 | 5ea1c942462042444954e43c5476d5868289268ef9b8993b31e40c5233b5a7ed |