Dual-timescale hybrid SSM+Attention architecture for efficient robotics and IoT language models

These details have not been verified by PyPI

Project links

Project description

ArcMind

Dual-timescale hybrid SSM+Attention architecture for efficient robotics and IoT.

⚠️ Alpha release — architecture is functional and tested, pretrained weights coming soon. API may change.

What is ArcMind?

ArcMind is a neural architecture purpose-built for robotics and IoT edge deployment. It combines a fast State Space Model (SSM) path for continuous sensor stream processing with a slow exact attention path for episodic memory recall — all at 245K to 10.3M parameters.

The problem

Current approaches to "LMs for robotics" fall into two camps:

Massive VLA models (RT-2, OpenVLA, Pi-Zero) — 7B–55B parameters, cloud-dependent, text-centric tokenization. Can't run on edge hardware.
Shoehorned text SLMs (quantized Phi, TinyLlama on Jetson) — general-purpose text models forced onto constrained devices. Not designed for sensor streams.

Neither is purpose-built for the sensor-stream → reasoning → action loop that robotics and IoT actually need.

The approach

ArcMind introduces a dual-timescale hybrid that mirrors how robotic control actually works:

Fast path (SSM): Processes every sensor frame at hardware rate (100–1000 Hz). O(n) time, O(1) decode memory, no KV cache. Produces smooth, physically plausible control signals.
Slow path (Attention): Tiny exact attention (1–2 layers, 2–4 heads) runs at decision rate (1–10 Hz) over an episodic memory buffer for precise spatial/temporal recall.
Sensor-native tokenization: Raw sensor frames projected directly into model dimension via learned linear layers. No vocabulary table — eliminates the ~40% parameter overhead of embedding tables in small text LMs.
Episodic memory: Fixed-size ring buffer of compressed environment state snapshots. Enables landmark recall and obstacle memory without a growing KV cache.

Installation

Requirements: Python ≥ 3.10, PyTorch ≥ 2.1

pip install arcmind

For development:

git clone https://github.com/jemsbhai/arcmind.git
cd arcmind
pip install -e ".[dev]"
pytest tests/ -v  # 44 tests, all passing

Quick Start

import torch
from arcmind import ArcMindConfig, ArcMindModel

# Create a model from a preset
config = ArcMindConfig.robotics_small()
model = ArcMindModel(config)

# Simulate a sensor stream (batch=1, 100 timesteps, 12 channels)
sensor_data = torch.randn(1, 100, config.num_sensor_channels)

# Forward pass
model.reset_memory(batch_size=1)
actions = model(sensor_data)
print(actions.shape)  # (1, 100, 6)

# Inspect parameter breakdown
for component, count in model.count_parameters().items():
    print(f"  {component}: {count:,}")

Custom Configuration

# Build a custom config for your specific robot/sensor setup
config = ArcMindConfig(
    num_sensor_channels=9,    # e.g., 3-axis accel + 3-axis gyro + 3-axis mag
    d_model=96,
    num_ssm_layers=6,
    ssm_state_dim=12,
    num_attn_layers=1,
    num_attn_heads=3,
    num_memory_slots=32,
    action_dim=4,             # e.g., 4 motor commands
    sensor_freq_hz=200.0,
    decision_freq_hz=20.0,
)
model = ArcMindModel(config)

Model Presets

All parameter counts independently verified via test suite.

Preset	Params	SSM	Attention	Tokenizer	Target Hardware
`iot_tiny`	245K	73.5%	20.3%	0.2%	Cortex-M7, ESP32-S3
`robotics_small`	1.7M	84.7%	11.7%	0.1%	Jetson Orin Nano, RPi 5
`robotics_medium`	10.3M	82.4%	15.3%	0.1%	Desktop GPU, Jetson AGX

Key property: the sensor tokenizer is consistently <1% of total parameters, confirming the embedding-table-free design eliminates the parameter overhead that dominates sub-100M text LMs.

Architecture

Sensor Stream → SensorTokenizer → SSMCore (fast, 100-1000 Hz)
                                      ↓ periodic snapshot
                                 EpisodicMemory (ring buffer)
                                      ↓ read
SSM output → SlowAttention (slow, 1-10 Hz) ← memory slots
                   ↓ gated fusion
              ActionHead → action output

Design rationale:

SSM:Attention parameter ratio of ~5:1 to ~8:1 extends the Granite 4 (9:1) and Jamba (7:1) ratios, justified by sensor streams being temporally smoother than text.
Only 2–4 attention heads for recall, based on retrieval-aware distillation research showing 2–3 heads suffice for recall in SSM hybrids.
Episodic ring buffer adapted from Expansion Span's reserved attention context for distant retrieval, applied here to compressed environment state snapshots.

Project Status

Core architecture (SSM + Attention + Memory + Gating)
Sensor-native tokenizer
Three validated presets (IoT, Robotics-S, Robotics-M)
Full test suite (44 tests)
PyPI package
Training pipeline
Benchmark evaluation (MuJoCo, sensor datasets)
Pretrained weights on HuggingFace
Research paper

Citation

Paper forthcoming. For now:

@software{arcmind2026,
  title={ArcMind: Dual-Timescale Hybrid SSM+Attention for Efficient Robotics and IoT},
  author={Syed, Muntaser},
  year={2026},
  url={https://github.com/jemsbhai/arcmind},
}

License

MIT License. See LICENSE for details.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.1.0

May 12, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

arcmind-0.1.0.tar.gz (18.1 kB view details)

Uploaded May 12, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

arcmind-0.1.0-py3-none-any.whl (15.0 kB view details)

Uploaded May 12, 2026 Python 3

File details

Details for the file arcmind-0.1.0.tar.gz.

File metadata

Download URL: arcmind-0.1.0.tar.gz
Upload date: May 12, 2026
Size: 18.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.2

File hashes

Hashes for arcmind-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`6737f57ace315cf179f03797f8e7e43a083117964ee60d3c1d7f1e49973fab41`
MD5	`19494e55f76e607956683e8b477ffc11`
BLAKE2b-256	`ff4cb46396d8338aef787be69805e4eed843d3b58edb911f0629fc675e68dbfa`

See more details on using hashes here.

File details

Details for the file arcmind-0.1.0-py3-none-any.whl.

File metadata

Download URL: arcmind-0.1.0-py3-none-any.whl
Upload date: May 12, 2026
Size: 15.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.2

File hashes

Hashes for arcmind-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`14d661530812ad6ddd39b7ab99423afe4f043bce392148d4e58fcea8c0d73c5a`
MD5	`e8d76747360136933a2abe865a1a109e`
BLAKE2b-256	`93a1d380ea6e3cd21b39441951c58ed1d2ed95ee31a6d524bd23cdf7f0bd1d0d`

See more details on using hashes here.

arcmind 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

ArcMind

What is ArcMind?

The problem

The approach

Installation

Quick Start

Custom Configuration

Model Presets

Architecture

Project Status

Citation

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes