Neural-Matter Network (NMN) - Advanced neural network layers with attention mechanisms

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

NMN — Neural Matter Networks

Activation-free neural layers that learn non-linearity through geometric operations.
One library. Six frameworks. Numerically equivalent.

📚 Docs · 🔥 PyTorch · ⚡ JAX/Flax · 🟨 Keras · 🟧 TF · 🍎 MLX · 🧮 Theory · 🔄 Migrate · 📄 Paper

What is NMN?
Install
CLI / discovery
60-second tour
Choose your framework
Layer reference
The math, in one minute
Examples
Testing
Project status
Contributing
Citation
License

What is NMN?

NMN is a drop-in replacement for Linear + activation and Conv + activation blocks. The non-linearity is built into the layer via a geometric ratio — no ReLU, no Sigmoid, no GELU.

# Before
y = relu(linear(x))            # dot product → activation

# After
y = YatNMN(in_features=128, out_features=64)(x)   # geometric, intrinsically non-linear

Why care?

Standard neuron	Yat neuron
Measures correlation between w and x	Balances correlation AND proximity
Requires an external activation for non-linearity	Non-linearity is intrinsic
Fires for distant-but-aligned vectors (spurious)	Penalizes distance → cleaner, prototype-like features

NMN ships across PyTorch, Flax NNX, Flax Linen, Keras 3, TensorFlow, and MLX (Apple Silicon) with numerically equivalent outputs (< 1e-6 max-abs error in fp32, verified by an integration parity matrix). Pick the framework you like; switch later without retraining math.

Install

pip install nmn                   # the Yat layers, no framework deps
pip install "nmn[torch]"          # + PyTorch
pip install "nmn[nnx]"            # + Flax NNX (JAX)
pip install "nmn[linen]"          # + Flax Linen (JAX)
pip install "nmn[keras]"          # + Keras 3 / TensorFlow
pip install "nmn[tf]"             # + TensorFlow
pip install "nmn[mlx]"            # + MLX (Apple Silicon only)
pip install "nmn[all]"            # everything except MLX (Linux/Windows safe)

Requirements: Python ≥ 3.10 (≥ 3.11 if you want JAX/Flax).

GPU/TPU note: install the GPU/TPU build of your framework first (see PyTorch or JAX install pages), then pip install nmn.

CLI / discovery

pip install nmn ships a small nmn command (also python -m nmn) for discovery and diagnostics. It is import-light — it never imports a deep-learning framework just to print help, so it's instant and works even before any backend is installed.

nmn                       # banner: version, the six backends + pip extras
nmn frameworks            # import line + YatNMN signature per framework
nmn guide nnx             # self-contained quickstart for one framework
nmn guide pytorch         # aliases: torch/pytorch, tf/tensorflow, nnx/flax-nnx, linen/flax-linen, keras, mlx
nmn features              # MAY/RAY performer maps + lazy YatNMN, incl. a nnx performer_kind snippet
nmn examples             # where to find runnable examples + a nnx quickstart
nmn version              # the version string only
nmn doctor               # which of the six backends import OK (+ versions), Python and nmn version

nmn doctor reports each backend independently and never fails on a missing one, so it's the quickest way (for humans or coding agents) to see what's installed:

$ nmn doctor
torch       ok        2.x.x
nnx/linen   ok        jax 0.9.x / flax 0.12.x
keras       missing   pip install "nmn[keras]"
tf          missing   pip install "nmn[tf]"
mlx         ok        0.x.x

The same content is available programmatically (both stay import-light):

import nmn

nmn.help()                # prints the `nmn info` banner
status = nmn.doctor()     # prints the report AND returns {framework: version_str_or_None}

60-second tour

The same MLP in every framework. Pick one, copy, run.

🔥 PyTorch

import torch, torch.nn as nn
from nmn.torch import YatNMN

model = nn.Sequential(
    nn.Flatten(),
    YatNMN(in_features=784, out_features=256),
    YatNMN(in_features=256, out_features=128),
    nn.Linear(128, 10),          # keep logits linear
)

x = torch.randn(32, 1, 28, 28)
print(model(x).shape)            # torch.Size([32, 10])

→ Full PyTorch guide

⚡ Flax NNX (JAX)

import jax.numpy as jnp
from flax import nnx
from nmn.nnx import YatNMN

class MLP(nnx.Module):
    def __init__(self, rngs):
        self.fc1 = YatNMN(in_features=784, out_features=256, rngs=rngs)
        self.fc2 = YatNMN(in_features=256, out_features=128, rngs=rngs)
        self.out = nnx.Linear(128, 10, rngs=rngs)
    def __call__(self, x):
        x = x.reshape((x.shape[0], -1))
        return self.out(self.fc2(self.fc1(x)))

model = MLP(rngs=nnx.Rngs(0))
print(model(jnp.ones((32, 28, 28, 1))).shape)   # (32, 10)

→ Full Flax NNX guide

🟨 Keras 3

import keras
from nmn.keras import YatNMN

model = keras.Sequential([
    keras.layers.Input((28, 28)),
    keras.layers.Flatten(),
    YatNMN(units=256),
    YatNMN(units=128),
    keras.layers.Dense(10),
])
print(model(keras.ops.ones((32, 28, 28))).shape)  # (32, 10)

→ Full Keras guide

🟧 TensorFlow

import tensorflow as tf
from nmn.tf import YatNMN

model = tf.keras.Sequential([
    tf.keras.layers.Flatten(),
    YatNMN(features=256),
    YatNMN(features=128),
    tf.keras.layers.Dense(10),
])
print(model(tf.zeros((32, 28, 28))).shape)        # (32, 10)

→ Full TensorFlow guide

⚡ Flax Linen (JAX, legacy API)

import jax, jax.numpy as jnp
import flax.linen as nn
from nmn.linen import YatNMN

class MLP(nn.Module):
    @nn.compact
    def __call__(self, x):
        x = x.reshape((x.shape[0], -1))
        x = YatNMN(features=256)(x)
        x = YatNMN(features=128)(x)
        return nn.Dense(10)(x)

model = MLP()
params = model.init(jax.random.PRNGKey(0), jnp.ones((1, 28, 28, 1)))
print(model.apply(params, jnp.ones((32, 28, 28, 1))).shape)  # (32, 10)

→ Full Flax Linen guide

Choose your framework

All six backends expose the same operations with framework-idiomatic naming. They are numerically equivalent (verified in tests/integration/).

Framework	Pick it when…	Guide
PyTorch	You want the most ergonomic Python API and broad GPU support.	docs/guides/pytorch.md
Flax NNX	You want JAX speed with Pythonic state. Recommended JAX entry point.	docs/guides/flax-nnx.md
Flax Linen	You're maintaining a legacy Linen codebase.	docs/guides/flax-linen.md
Keras 3	You want one API that runs on JAX, TF, or PyTorch backends.	docs/guides/keras.md
TensorFlow	You need TF-specific deployment (SavedModel, TFLite, Serving).	docs/guides/tensorflow.md
MLX	You're on Apple Silicon and want native Metal acceleration.	docs/guides/mlx.md

Layer reference

All layers are available across all 6 frameworks with verified parity.

Operation	PyTorch	TF / Keras	Flax NNX	Flax Linen	MLX
Dense	`YatNMN`	`YatNMN`	`YatNMN`	`YatNMN`	`YatNMN`
Conv 1D / 2D / 3D	`YatConv{1,2,3}D`	`YatConv{1,2,3}D`	`YatConv`	`YatConv{1,2,3}D`	`YatConv{1,2,3}D`
ConvTranspose 1D / 2D / 3D	`YatConvTranspose{1,2,3}D`	`YatConvTranspose{1,2,3}D`	`YatConvTranspose`	`YatConvTranspose{1,2,3}D`	`YatConvTranspose{1,2,3}D`
Multi-Head Attention	`MultiHeadYatAttention`	`MultiHeadYatAttention`	`MultiHeadAttention`	`MultiHeadAttention`	`MultiHeadYatAttention`
Embedding	`YatEmbed`	`YatEmbed`	`Embed`	`YatEmbed`	`YatEmbed`
Squashers	`softermax`, `softer_sigmoid`, `soft_tanh`	same	same	same	same

Flax NNX exclusives:

Variant	What it does	Complexity
`RotaryYatAttention`	Yat attention + RoPE	O(n²)
`MultiHeadAttention(use_performer=True)`	Spherical YAT-Performer (FAVOR+ features)	O(n)
Pallas fused yat-attention kernel	Flash-attention-style fused TPU/GPU kernel	O(n²) mem-efficient

Cross-framework consistency

Framework Pair             │ Max Error    │ Status
───────────────────────────┼──────────────┼────────
PyTorch ↔ TensorFlow       │ < 1e-6       │ ✅
PyTorch ↔ Keras            │ < 1e-6       │ ✅
PyTorch ↔ Flax NNX         │ < 1e-6       │ ✅
PyTorch ↔ Flax Linen       │ < 1e-6       │ ✅
TensorFlow ↔ Keras         │ < 1e-7       │ ✅
Flax NNX ↔ Flax Linen      │ < 1e-7       │ ✅

Run yourself: pytest tests/integration/test_cross_framework_consistency.py -v.

Bias-aware linear-attention feature maps (MAY / RAY)

Linearized spherical-Yat attention approximates the kernel κ(s) = (s + b)² / ((2 + ε) − 2s) (with s = q̂·k̂) by a feature map φ so that φ(q)·φ(k) ≈ κ, giving O(n) attention. Three feature maps ship across all frameworks (create_*_projection + *_features + *_yat_attention):

Feature map	Module (per framework)	Bias `b`	Best regime
SLAY (anchor)	`spherical_yat_performer` / `performer`	`b = 0` only	bias-free kernel
MAY (Random Maclaurin)	`maclaurin_yat` / `may` / `performer_yat`	any `b`	`b > 0` — near-exact
RAY (radial)	`radial_yat` / `ray` / `performer_yat`	any `b`	sharp-`ε` route

Trained Yat-attention learns a per-head bias b > 0, which SLAY's b = 0 anchors cannot represent. MAY is bias-aware and near-exact there. Benchmark (cosine similarity of linearized vs exact attention output, d=64, N=512, matched F=256, ε = median sq-distance, reproduced by tests/scripts/benchmark_may_ray.py):

   b  │  MAY   │  SLAY      b  │  MAY   │  SLAY
 ─────┼────────┼──────    ─────┼────────┼──────
 0.00 │  0.20  │  0.52     1.00 │  0.89  │  0.84
 0.25 │  0.52  │  0.66     2.00 │  0.97  │  0.86
 0.50 │  0.74  │  0.78     4.00 │  0.99  │  0.87   ← SLAY floors, MAY → exact

SLAY wins only at its b = 0 design point; for the b > 0 deployment regime MAY beats it and keeps improving with the feature budget. On NNX the attention layer takes a selector: performer_kind="slay" | "maclaurin" | "radial" (default slay). MAY/RAY features are sign-indefinite — see the module docstrings for the caveat.

Lazy YatNMN training (freeze kernel)

YatNMN(..., lazy=True) (alias freeze_kernel=True) freezes only the kernel (feature directions) while keeping bias, alpha, and epsilon trainable — a cheap-adaptation / α-ε-probing regime. The freeze uses each backend's idiomatic mechanism (NNX FrozenParam excluded from nnx.state(model, nnx.Param), torch requires_grad=False, mlx freeze, Keras/TF trainable=False, linen stop_gradient). Defaults are unchanged (lazy=False).

The math, in one minute

A Yat neuron is a ratio of similarity to distance, with the bias absorbed into the (squared) inner product:

$$ \mathrm{ⵟ}(\mathbf{w}, \mathbf{x}, b) = \frac{\bigl(\langle \mathbf{w}, \mathbf{x} \rangle + b\bigr)^2}{\lVert \mathbf{w} - \mathbf{x} \rVert^2 + \varepsilon} $$

Maximum response requires w and x to be both aligned AND close. That's the geometric prior that lets you drop the activation function. The bias b shifts the affine score inside the polynomial (biased polynomial kernel) — not added to the output after the ratio.

For convolutions, the same identity applies per patch:

$$ \mathrm{ⵟ}^*(\mathbf{W}, \mathbf{X}, b) = \frac{\bigl(\sum_{i,j} w_{ij} x_{ij} + b\bigr)^2}{\sum_{i,j} (w_{ij} - x_{ij})^2 + \varepsilon} $$

ε (epsilon, default 1e-5) prevents division by zero; bump it to 1e-3 for fp16/bf16. Some layers also expose a learnable alpha scalar (set use_alpha=True, or constant_alpha=True to fix α = √2).

📖 Deeper dive: docs/architecture.md — geometric reading, ε tuning, where (not) to use NMN, mental model.

Examples

Runnable scripts live in-tree, organized per framework:

Script	What it does
`src/nmn/torch/examples/quick_example.py`	Yat layers in PyTorch (weight norm, α, …)
`src/nmn/torch/examples/vision/resnet_training.py`	ResNet training on PyTorch
`src/nmn/nnx/examples/vision/aether_resnet50_tpu.py`	ResNet-50 on TPU with Flax NNX
`src/nmn/nnx/examples/language/m3za.py`	MiniBERT pre-training (uses fused attention)
`src/nmn/nnx/examples/language/m3za_perf.py`	Performance evaluation

For copy-pasteable snippets across all frameworks (CNN, transformer, attention, embeddings, custom squashers), see EXAMPLES.md.

Testing

pip install "nmn[test]"

pytest tests/                                      # everything
pytest tests/test_torch/                           # one framework
pytest tests/integration/                          # cross-framework parity
pytest tests/ -m "not slow"                        # skip slow tests
pytest tests/ --cov=nmn --cov-report=html          # coverage report

CI matrix: Linux × Python {3.10, 3.11, 3.12} for all frameworks, plus macOS-3.11 (PyTorch + Keras/TF) and Windows-3.11 (PyTorch). See .github/workflows/test.yml.

Project status

Area	Status
Core layers across 6 frameworks	✅ Production-ready, on PyPI
Cross-framework consistency tests	✅ Verified < 1e-6 in fp32
Documentation	✅ Per-platform guides, architecture, migration
ONNX export	🚧 Should work (standard ops) — not yet covered in CI (TODO.md)
INT8 quantization	🚧 Not yet implemented (TODO.md)
Auto-generated API reference	🚧 Planned (Sphinx / mkdocstrings) — see TODO.md

Latest changes: CHANGELOG.md.

Contributing

We welcome contributions of all sizes — from typo fixes to new framework backends. See CONTRIBUTING.md for development setup, test commands, and the "add a new layer" workflow.

Quick start:

git clone https://github.com/azettaai/nmn.git
cd nmn
pip install -e ".[dev,torch]"      # or ".[dev,nnx]", etc.
pytest tests/test_torch/ -v

Found a bug? → open an issue. Security issue? → see SECURITY.md for private disclosure.

Citation

@software{nmn2024,
  author = {Bouhsine, Taha},
  title  = {NMN: Neural Matter Networks},
  year   = {2024},
  url    = {https://github.com/azettaai/nmn}
}

@article{bouhsine2024dl2,
  author = {Bouhsine, Taha},
  title  = {Deep Learning 2.0: Artificial Neurons that Matter --- Reject Correlation, Embrace Orthogonality},
  year   = {2024}
}

Community

💬 Discussions — GitHub Discussions
🐛 Issues — GitHub Issues
🌐 Company — azetta.ai
📧 Contact — taha@azetta.ai

License

AGPL-3.0 — free for personal, academic, and commercial use with attribution. If you modify and deploy on a network, you must share the source.

For alternative licensing, contact taha@azetta.ai.

Acknowledgments

This project was originally developed under the mlnomadpy organization and is now maintained by Azetta.ai. Thanks to everyone who has contributed code, feedback, and ideas.

_{Built with ❤️ by Azetta.ai · Originally created by ML Nomad}

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

skywolfmo

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.3.2

Jun 9, 2026

0.3.1

Jun 9, 2026

0.3.0

May 28, 2026

0.2.30

Apr 23, 2026

0.2.29

Apr 14, 2026

0.2.28

Apr 13, 2026

0.2.27

Apr 12, 2026

0.2.26

Apr 10, 2026

0.2.25

Apr 7, 2026

0.2.24

Mar 27, 2026

0.2.23

Mar 26, 2026

0.2.22

Mar 26, 2026

0.2.21

Mar 26, 2026

0.2.20

Mar 25, 2026

0.2.12

Mar 24, 2026

0.2.7

Feb 20, 2026

0.2.6

Jan 9, 2026

0.2.5

Jan 9, 2026

0.2.4

Jan 9, 2026

0.2.3

Dec 26, 2025

0.2.1

Dec 26, 2025

0.1.17

Nov 2, 2025

0.1.15

Oct 12, 2025

0.1.14

Jul 28, 2025

0.1.13

Jul 28, 2025

0.1.12

Jun 30, 2025

0.1.11

Jun 22, 2025

0.1.10

Jun 22, 2025

0.1.9

Jun 22, 2025

0.1.8

Jun 19, 2025

0.1.7

Jun 14, 2025

0.1.6

Jun 14, 2025

0.1.5

Jun 13, 2025

0.1.4

Jun 10, 2025

0.1.3

Jun 5, 2025

0.1.2

Jun 5, 2025

0.1.1

Jun 5, 2025

0.1.0

Jun 5, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nmn-0.3.2.tar.gz (740.9 kB view details)

Uploaded Jun 9, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

nmn-0.3.2-py3-none-any.whl (300.9 kB view details)

Uploaded Jun 9, 2026 Python 3

File details

Details for the file nmn-0.3.2.tar.gz.

File metadata

Download URL: nmn-0.3.2.tar.gz
Upload date: Jun 9, 2026
Size: 740.9 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for nmn-0.3.2.tar.gz
Algorithm	Hash digest
SHA256	`871637240ff68864ad57574b684917ad4fb26ea9e1456d91128c2227b3fdfb4e`
MD5	`4168fedaf7586d90177816dbb5dab03a`
BLAKE2b-256	`a7f08faddf5831ab63ac95d532b127d09ec9ddee1d54ed08803a042af50a26e6`

See more details on using hashes here.

Provenance

The following attestation bundles were made for nmn-0.3.2.tar.gz:

Publisher: publish.yml on azettaai/nmn

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: nmn-0.3.2.tar.gz
- Subject digest: 871637240ff68864ad57574b684917ad4fb26ea9e1456d91128c2227b3fdfb4e
- Sigstore transparency entry: 1760474022
- Sigstore integration time: Jun 9, 2026
Source repository:
- Permalink: azettaai/nmn@6ca6ee066f88c8d68e0b97e471c2e2a704655999
- Branch / Tag: refs/tags/v0.3.2
- Owner: https://github.com/azettaai
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@6ca6ee066f88c8d68e0b97e471c2e2a704655999
- Trigger Event: push

File details

Details for the file nmn-0.3.2-py3-none-any.whl.

File metadata

Download URL: nmn-0.3.2-py3-none-any.whl
Upload date: Jun 9, 2026
Size: 300.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for nmn-0.3.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`48a5eeceb0df52a7796e69f13f52959e1d612156db50db53814bab4a414bbb46`
MD5	`0226c1ffe75a16a69bcb5c3ac3e7da52`
BLAKE2b-256	`876ca26638fe6a395f1dc1d0b36a7e924da72935c49b95b7b6df55b0b8c34d39`

See more details on using hashes here.

Provenance

The following attestation bundles were made for nmn-0.3.2-py3-none-any.whl:

Publisher: publish.yml on azettaai/nmn

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: nmn-0.3.2-py3-none-any.whl
- Subject digest: 48a5eeceb0df52a7796e69f13f52959e1d612156db50db53814bab4a414bbb46
- Sigstore transparency entry: 1760474228
- Sigstore integration time: Jun 9, 2026
Source repository:
- Permalink: azettaai/nmn@6ca6ee066f88c8d68e0b97e471c2e2a704655999
- Branch / Tag: refs/tags/v0.3.2
- Owner: https://github.com/azettaai
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@6ca6ee066f88c8d68e0b97e471c2e2a704655999
- Trigger Event: push

nmn 0.3.2

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

NMN — Neural Matter Networks

Contents

What is NMN?

Install

CLI / discovery

60-second tour

Choose your framework

Layer reference

Cross-framework consistency

Bias-aware linear-attention feature maps (MAY / RAY)

Lazy YatNMN training (freeze kernel)

The math, in one minute

Examples

Testing

Project status

Contributing

Citation

Community

License

Acknowledgments

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance