Skip to main content

Attention Free Transformer - Pytorch

Project description

aft-pytorch

Unofficial PyTorch implementation of the Attention Free Transformer by Zhai, et al. [abs, pdf] from Apple Inc.

Installation

You can install aft-pytorch via pip:

pip install aft-pytorch

Usage

You can import the Attention Free Transformer (AFT) from the package like so:

from aft_pytorch import AFTFullAttention

layer = AFTFullAttention(
    tsteps=10,
    dim=512,
    hidden_dim=64,
    heads=8
)

# a batch of sequences with 10 timesteps of length 512 each
x = torch.rand(32, 10, 512)
y = layer(x) # [32, 10, 512]

TODO

  • Add full AFT architecture

Contributing

If you like this repo, please leave a star! If there are any ammends or suggestions, feel free to raise a PR/issue.

Credits

@misc{
zhai2021an,
title={An Attention Free Transformer},
author={Shuangfei Zhai and Walter Talbott and Nitish Srivastava and Chen Huang and Hanlin Goh and Joshua M. Susskind},
year={2021},
url={https://openreview.net/forum?id=pW--cu2FCHY}
}

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aft_pytorch-0.0.1.tar.gz (3.3 kB view hashes)

Uploaded Source

Built Distribution

aft_pytorch-0.0.1-py3-none-any.whl (4.4 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page