Skip to main content

Implementation of MultiScreen

Project description

MultiScreen

Implementation of Multiscreen proposed by Ken Nakanishi for Screening is Enough

Basically it is a non-softmax attention with ReLU squared activation, content similarity thresholding, and aggressive normalization of the values.

Install

$ pip install multiscreen

Usage

import torch
from multiscreen import MultiScreen

multi_screen = MultiScreen(
    num_tokens = 256,
    dim = 512,
    depth = 6,
    heads = 8,
    dim_keys = 16,     # paper says 16 or 32
    dim_values = 64    # paper says 64 or 128
)

token_ids = torch.randint(0, 256, (1, 1024))

logits = multi_screen(token_ids)
assert logits.shape == (1, 1024, 256)

Enwik8

$ uv run train.py

Citations

@misc{nakanishi2026screening,
    title   = {Screening Is Enough},
    author  = {Ken M. Nakanishi},
    year    = {2026},
    eprint  = {2604.01178},
    archivePrefix = {arXiv},
    primaryClass = {cs.LG},
    url     = {https://arxiv.org/abs/2604.01178},
}
@misc{zhang2021sparse,
    title   = {Sparse Attention with Linear Units},
    author  = {Biao Zhang and Ivan Titov and Rico Sennrich},
    year    = {2021},
    eprint  = {2104.07012},
    archivePrefix = {arXiv},
    primaryClass = {cs.CL}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

multiscreen-0.1.5.tar.gz (8.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

multiscreen-0.1.5-py3-none-any.whl (7.3 kB view details)

Uploaded Python 3

File details

Details for the file multiscreen-0.1.5.tar.gz.

File metadata

  • Download URL: multiscreen-0.1.5.tar.gz
  • Upload date:
  • Size: 8.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.8.13

File hashes

Hashes for multiscreen-0.1.5.tar.gz
Algorithm Hash digest
SHA256 2164dfcbc6c1cf85d86630561a653c9bd5b8c2d7f8391a1ba95f7c3c69a7ba9f
MD5 cac3d8aaea9cf781b6017e6c1f6209b8
BLAKE2b-256 fb1ffa855d80e04f46370a11aa651b6b8271189d97b08f10dafe977b2f4b5885

See more details on using hashes here.

File details

Details for the file multiscreen-0.1.5-py3-none-any.whl.

File metadata

File hashes

Hashes for multiscreen-0.1.5-py3-none-any.whl
Algorithm Hash digest
SHA256 7bcea505ffe89c74bbee9b84244dbcba711228ecef815ec3fcdca58bb27bdda1
MD5 3501c7e01276977b17bb3fcfbb982226
BLAKE2b-256 d67428eb46a2501ccbe14c2dde50221f01fe5de3fe8b4b76aac8763354ebc227

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page