trifast

Fast kernel for triangle self attetion.

Project description

Fused Triangle Self Attention kernel, written in triton. Basically flash attention, but for triangle self attention. Implementation heavily inspired by FlagAttention and the triton fused attention tutorial.

n^2 memory complexity (vs n^3 for pure pytorch).
Faster (~2x) backward pass than next fastest implementation I could find (DS4S evoformer kernel).
Faster (~4x) forward pass than next fastest implementation I could find (DS4S evoformer kernel).
As far as I can tell, faster than naieve implementation.

Plots

All done on a 3090 in bfloat16.

Forward

TSA forward runtime TSA forward memory

Backward TSA backward runtime TSA backward memory

Todos:

[] Try to train a model with it.
[] Can we perform and of dq/db/dkv transposed?
[] Rewrite autotuner

Project details

Release history Release notifications | RSS feed

0.1.13

Apr 28, 2025

0.1.13.dev1 pre-release

Apr 28, 2025

0.1.12

Mar 17, 2025

0.1.11

Feb 8, 2025

0.1.10

Feb 5, 2025

0.1.9

Feb 5, 2025

0.1.8

Feb 4, 2025

0.1.7

Jan 24, 2025

0.1.6

Jan 21, 2025

0.1.5

Jan 21, 2025

0.1.4

Jan 20, 2025

0.1.3

Jan 20, 2025

This version

0.1.2

Jan 20, 2025

0.1.1

Jan 15, 2025

0.1.0

Jan 15, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

trifast-0.1.2.tar.gz (15.9 MB view details)

Uploaded Jan 20, 2025 Source

File details

Details for the file trifast-0.1.2.tar.gz.

File metadata

Download URL: trifast-0.1.2.tar.gz
Upload date: Jan 20, 2025
Size: 15.9 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.5.2

File hashes

Hashes for trifast-0.1.2.tar.gz
Algorithm	Hash digest
SHA256	`3bf6cc93c7d76d3fb273b8f7b7df49be5421379ace1b633f00ba963df88366ea`
MD5	`f5decc408f708869c5fd97d5516aaecd`
BLAKE2b-256	`3b3730e40f71a544aabddeacc3a9144d75633a212f8f19d798573995f0876fcf`

See more details on using hashes here.

trifast 0.1.2

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Plots

Forward

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes