A Tensorflow/Keras implementation of the Neuronal Attention Circuit (NAC) and variants.
Project description
keras-nac
This repository contains TensorFlow/Keras implementations of the following research papers:
- FLUID: Continuous-Time Hyperconnected Sparse Transformer for Sink-Free Learning
- Neuronal Attention Circuit (NAC) for Representation Learning
- Neuronal Stochastic Attention Circuit (NSAC) for Probabilistic Representation Learning
Installation
pip install keras-nac
Requirements
- Python >= 3.10
- TensorFlow >= 2.18.0
Usage Examples 
These layers can be used as drop-in components inside TensorFlow/Keras models.
1. Liquid Attention Network (LAN)
import tensorflow as tf
from keras_nac import layers
inputs = tf.keras.Input(shape=(1, 1))
attn = layers.LAN(
d_model=64, # Dimension of the model of LAN
num_heads=16, # Number of attention heads of LAN
topk=8, # Number of top-k attention interactions
euler_steps=6, # Number of Euler steps
activation="sigmoid", # Activation function
use_sink_gate=True, # Use Attention Sink Gate
return_sequences=False, # Return full sequences if True, else last output
return_attention=False # Return attention weights if True
)(inputs)
outputs = tf.keras.layers.Dense(2)(attn)
model = tf.keras.Model(inputs, outputs)
model.compile(
optimizer="adam",
loss="mse",
metrics=["mae"]
)
2. FLUID Transformer
import tensorflow as tf
from keras_nac import layers
inputs = tf.keras.Input(shape=(1, 1))
x = layers.FLUID(
d_model=64, # Dimension of the model of LAN
num_heads=16, # Number of attention heads of LAN
num_layers=1, # Number of stacked encoder/decoder layers
ff_dim=32, # Dimension of the feed-forward network
delta_t= 0.01, # Time-step for the Liquid Attention
euler_steps=5, # Number of Euler steps for Liquid Attention
topk=8, # Number of top-k attention interactions
expansion_rate=2, # Expansion factor for feed-forward layers
use_sink_gate=True, # Enable sink gate mechanism
use_pairwise=False, # disable top-k sparsity if True
enable_hc=True, # Enable hyper-connections if True, Otherwise -> Residual connections
dynamic_hc=True, # Enable Liquid hyper-connections if True, Otherwise -> Static
dropout=0.0, # Dropout rate
max_len=1000, # Maximum sequence length of positional encoder
return_attention=False, # Return attention weights if True
)(inputs)
x = tf.keras.layers.Activation("sigmoid")(x)
x = tf.keras.layers.Flatten()(x)
outputs = tf.keras.layers.Dense(1)(x)
model = tf.keras.Model(inputs, outputs)
model.compile(
optimizer="adam",
loss="mse",
metrics=["mae"]
)
3. Neuronal Attention Circuit (NAC)
import tensorflow as tf
from keras_nac import layers
inputs = tf.keras.Input(shape=(1, 1))
x = layers.NAC(
d_model=64, # Dimension of the model
num_heads=16, # Number of attention heads
mode='exact', # Computation mode: 'exact', 'euler', or 'steady'
topk=8, # Number of top-k pairwise interactions
delta_t=0.5, # Time step for Euler mode
sparsity=0.5, # Sparsity level for NCP wiring
euler_steps=6, # Number of Euler integration steps
dropout=0.0, # Dropout rate
tau_epsilon=1e-5, # Small positive value for temporal head
activation='sigmoid', # Activation function
use_riemann_sum=True # Use Reimann-sum integration if True, else standard weighted sum
return_sequences=False, # Return full sequences if True, else last output
return_attention=False, # Return attention weights if True
return_cell_state=False # Return cell-level state if True
)(inputs)
outputs = tf.keras.layers.Dense(1)(x)
model = tf.keras.Model(inputs, outputs)
model.compile(
optimizer="adam",
loss="mse",
metrics=["mae"]
)
4. Neuronal Stochastic Attention Circuit (NSAC)
import tensorflow as tf
from keras_nac import layers, models, losses
def stochastic_model_fn():
inputs = tf.keras.Input(shape=(1, 1))
mean, log_std = OUWrap(
NAC(d_model=64, num_heads=16, topk=8, sparsity=0.5),
output_dim=1, # Output dimension for regression
bn_mean=0.0, # Brownian mean
bn_std=0.1, # Brownian standard deviation
activation='sigmoid', # Activation function
return_sequences=False, # Return full sequences if True, else last output
return_attention=False, # Return attention weights if True
return_cell_state=False, # Return cell potentials if True
)(inputs)
return tf.keras.Model(inputs, [mean, log_std])
model = NSAC(stochastic_model_fn(),
mc_samples=5, # Monte-Carlo steps
ood_mean=0.0, # OOD generating noise mean
ood_std=5.0 # OOD generating noise standard deviation
)
model.compile(
optimizer=tf.keras.optimizers.AdamW(1e-3),
loss=NSACLoss(),
)
model.compile(
optimizer=tf.keras.optimizers.AdamW(1e-3),
loss=losses.NSACLoss(lambda_reg=0.5),
)
Citation
@article{razzaq2025neuronal,
title={Neuronal Attention Circuit (NAC) for Representation Learning},
author={Razzaq, Waleed and Kanjaraway, Izis and Zhao, Yun-Bo},
journal={arXiv preprint arXiv:2512.10282},
year={2025}
}
@article{razzaq2026fluid,
title={FLUID: Continuous-Time Hyperconnected Sparse Transformer for Sink-Free Learning},
author={Razzaq, Waleed and Zhao, Yun-Bo},
journal={arXiv preprint arXiv:2605.04421},
year={2026}
}
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
No source distribution files available for this release.See tutorial on generating distribution archives.
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
keras_nac-0.0.3-py3-none-any.whl
(24.0 kB
view details)
File details
Details for the file keras_nac-0.0.3-py3-none-any.whl.
File metadata
- Download URL: keras_nac-0.0.3-py3-none-any.whl
- Upload date:
- Size: 24.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.10.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
060a1f2699345fbbef772f056e92760125368730a4b32efa24757db2ee93e0d0
|
|
| MD5 |
1c76917c17ffb817c27dc7dfecdee19e
|
|
| BLAKE2b-256 |
a1123bbbeabdb59d1c3328f20e58dd55bd60a1d4d000b3e36fd8b09d30d9c984
|