Skip to main content

General-purpose Multimodal Transformer with Linear Complexity Attention Mechanism.

Project description

LinMulT

General-purpose Multimodal Transformer with Linear-Complexity Attention

CI Coverage Docs GitHub Release PyPI Python PyTorch Code style: ruff License


LinMulT is a modular Transformer library designed for multimodal sequence modelling. It handles variable-length inputs across any number of modalities, supports missing-modality scenarios, and offers six attention variants ranging from O(N²) softmax to O(N·s) gated linear attention; all behind a single config file.

Features

Multiple modalities 1–N input sequences with independent lengths and feature dims
Standard attention softmax — quadratic complexity for baselines and ablations
Efficient attention linear, performer, flash, bigbird — sub-quadratic complexity
TCN smoothing optional per-modality temporal convolution before cross-modal attention
Flexible heads sequence, aggregated, upsample, downsample — mix freely
Missing modalities zero-mask a modality; model handles it gracefully
Config-driven dict or YAML; no subclassing required

Installation

pip install linmult

For development:

git clone https://github.com/fodorad/linmult
cd linmult
uv sync --extra dev --extra docs
make check

Quick start

LinT — single-modality transformer

import torch
from linmult import LinT

x = torch.rand(8, 1500, 25)  # (batch, time, features)

model = LinT({
    'input_feature_dim': 25,
    'heads': [{'name': 'out', 'type': 'simple', 'output_dim': 5}],
    'time_dim_reducer': 'attentionpool',  # aggregate over time
})
result = model(x)
assert result['out'].shape == (8, 5)

LinMulT — multimodal transformer

import torch
from linmult import LinMulT

x1 = torch.rand(8, 1500, 25)  # (batch, time, features)
x2 = torch.rand(8,  450, 35)
x3 = torch.rand(8,  450, 256)

model = LinMulT({
    'input_feature_dim': [25, 35, 256],
    'heads': [{'name': 'sentiment', 'type': 'simple', 'output_dim': 3}],
    'time_dim_reducer': 'gap',
})
result = model([x1, x2, x3])
assert result['sentiment'].shape == (8, 3)

Switching attention type

model = LinT({
    'input_feature_dim': 64,
    'heads': [{'name': 'out', 'type': 'simple', 'output_dim': 10}],
    'attention_type': 'flash',        # linear, performer, flash, bigbird, softmax, mha
    'flash_query_key_dim': 32,        # flash (GAU) scoring dimension
})

Documentation

API reference

Config reference

Quick-start notebook

Attention benchmark

UR-Funny training example


Similar projects using LinMulT

BlinkLinMulT (2023)

LinMulT trained for blink presence detection and eye state recognition across 7 public benchmark databases.

PersonalityLinMulT (2022)

LinMulT trained for Big Five personality trait estimation and sentiment analysis (MOSI, MOSEI, First Impressions V2).


Citation

If you found this work helpful, please cite the relevant paper:

Eye blink detection (2023)

@article{blinklinmult-fodor23,
  title   = {BlinkLinMulT: Transformer-based Eye Blink Detection},
  author  = {Fodor, {\'A}d{\'a}m and Fenech, Kristian and L{\H{o}}rincz, Andr{\'a}s},
  journal = {Journal of Imaging},
  pages   = {1--19},
  year    = {2023}
}

Personality and sentiment estimation (2022)

@InProceedings{pmlr-v173-fodor22a,
  title     = {Multimodal Sentiment and Personality Perception Under Speech:
               A Comparison of Transformer-based Architectures},
  author    = {Fodor, {\'A}d{\'a}m and Saboundji, Rachid R. and
               Jacques Junior, Julio C. S. and Escalera, Sergio and
               Gallardo-Pujol, David and L{\H{o}}rincz, Andr{\'a}s},
  booktitle = {Understanding Social Behavior in Dyadic and Small Group Interactions},
  pages     = {218--241},
  year      = {2022},
  volume    = {173},
  series    = {Proceedings of Machine Learning Research},
  publisher = {PMLR},
  url       = {https://proceedings.mlr.press/v173/fodor22a.html}
}

Contact

Ádám Fodoradamfodor.com · fodorad201@gmail.com

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

linmult-2.1.0.tar.gz (167.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

linmult-2.1.0-py3-none-any.whl (44.0 kB view details)

Uploaded Python 3

File details

Details for the file linmult-2.1.0.tar.gz.

File metadata

  • Download URL: linmult-2.1.0.tar.gz
  • Upload date:
  • Size: 167.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for linmult-2.1.0.tar.gz
Algorithm Hash digest
SHA256 e4cb83b572fe54da13e0a5ed32ef6079e62600c560f68708ce6e7be4befad060
MD5 0a500c1b2efdee1647efe97e9a5473db
BLAKE2b-256 f7a0cdde3234bf3fc0f595ed3c469e04cb79d30bb130751be1148d148e6faf0a

See more details on using hashes here.

Provenance

The following attestation bundles were made for linmult-2.1.0.tar.gz:

Publisher: cd.yml on fodorad/LinMulT

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file linmult-2.1.0-py3-none-any.whl.

File metadata

  • Download URL: linmult-2.1.0-py3-none-any.whl
  • Upload date:
  • Size: 44.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for linmult-2.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 383bbff7fc65e4efbe62a659ad6a8ed0df7fc55ee1d25277a5e157ac4e58dd48
MD5 3dcad80af02e855965cd88551bf78f3d
BLAKE2b-256 495881034660167213a0aa2a5af12d129bd05daf84bd6f2818e57b8bf6c723ba

See more details on using hashes here.

Provenance

The following attestation bundles were made for linmult-2.1.0-py3-none-any.whl:

Publisher: cd.yml on fodorad/LinMulT

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page