HSL (Holistic Signal Language): a non-learned, byte-level signal encoder for PyTorch — one modality-agnostic 27-D exact base embedding. Δ = per-byte POSITION from a symbolic anchor (byte-independent), Δ² = cross-byte FLOW/momentum, exact complex Fourier + phase; no tokenizer, losslessly invertible.
Project description
HSL — Holistic Signal Language
🇰🇷 이 프로젝트는 개인 시간에 독립적으로 연구·공개한 오픈 연구 산출물입니다. 🇬🇧 This is an independent, open research project — researched and released on personal time.
A non-learned, byte-level signal encoder for PyTorch. Instead of splitting text into tokens, it reads raw bytes holistically as signal: change-rate (Δ, XOR-delta), 2nd-order change (Δ²), 편미분-boundary, exact complex Fourier, and phase — 27 dimensions per byte (35 with raw bits), losslessly invertible. One modality-agnostic input layer for text, image, audio, video — any byte stream.
Everything is information — a fluctuation between 0 and 1. HSL doesn't ask what a token means; it measures how the signal changes, with exact formulas, so the same representation works under every modality. It's one base embedding applied to every modality; which channels help which modality is decided downstream by your model's adapters — nothing is prematurely thrown away at the base.
import hsl_embedding as hsl
feats, phase = hsl.embed(b"hello") # -> Tensor [L, 27], Tensor [L]
emb = hsl.Embedding() # an nn.Module, no parameters (like nn.Embedding)
feats = emb("강아지".encode()) # -> [L, 27]
assert hsl.decode(hsl.encode(b"hello")) == b"hello" # lossless, by construction
Import name: the package is installed as
hsl-embeddingbut imported asimport hsl_embedding(import hslwill fail).embed(data)returns a tuple(feats, phase)—featsis atorch.Tensor [L, 27](per-byte features;[L, 35]withinclude_bits=True) andphaseis atorch.Tensor [L](the raw phase angle θ). Unpack both:feats, phase = hsl.embed(...).
Install
pip install hsl_embedding # import as `import hsl_embedding as hsl` (pip treats - and _ the same)
# deps: numpy, torch
Why not just nn.Embedding?
They solve different problems — this is not a performance claim, it's a "when to use which".
torch.nn.Embedding |
hsl.Embedding |
|
|---|---|---|
| what it is | a learned lookup table (trainable params) | an exact formula (zero params, deterministic) |
| input | a token id (int) |
raw bytes |
| needs | a tokenizer + fixed vocab + training data | nothing — works on any bytes, day one |
| dimensions | opaque, learned | named & interpretable (Δ / Δ² / boundary / Fourier / phase) |
| modality | one tokenizer per modality (text ≠ image ≠ audio) | one substrate for all (byte-native) |
| invertible | no | yes (decode(encode(x)) == x) |
| new scripts / formats | breaks / out-of-vocab | just bytes — never breaks |
They compose. HSL is an input substrate, not a replacement for learned representations: nn.Embedding
learns what tokens mean; HSL gives exact structural signal for free. Stack learned layers on top of
HSL features.
Reach for HSL when you want: tokenizer-free input · one model across modalities · structure/change-aware features · exact reconstruction · small-data or from-scratch training · interpretable input channels.
What each channel captures (and where it's good)
HSL is built from exact formulas, each chosen to carry information a plain learned embedding tends to throw away. The default is the 27-D exact base — the pure change-rate substrate, every channel lossless:
| channel (dims) | exact formula | captures | especially good for |
|---|---|---|---|
Δ dxor 0–7 (8) |
per-byte XOR-delta from the symbolic anchor 0 (each byte from the anchor → byte-INDEPENDENT POSITION) | change / position — where the signal sits, measured from the 0-anchor | edges, topic/region shifts, the modality-shared "rate of change". Measured: shift-detection AUC 0.725 vs content 0.698. |
Δ² d2xor 0–7 (8) |
Δ[byteᵢ] ⊕ Δ[byteᵢ₋₁] (cross-byte) |
flow / momentum (2nd order) — how the per-byte position changes between bytes | sharp corners / onsets; segment cuts, audio attacks, image corners |
| boundary (1) | windowed mean of Δ + 0.5·Δ² (편미분 경계) |
transition-energy salience (1st+2nd derivative) | tokenizer-free segmentation — natural byte/word/chunk cuts without decoding (heuristic; not part of the codec) |
Fourier fft_re0–4, fft_im1–3 (8) |
exact complex 8-bit rFFT (real+imag) | frequency / texture / periodicity — and spectral phase | smooth vs busy, periodic vs random — audio timbre, image texture. Lossless/invertible (irfft → byte) |
| phase cos/sin (2) | exact phasor z = e^{iθ}, θ = 2π·byte/256 |
cyclic relation / angle — exact cos(θᵢ−θⱼ) |
affect / mood and relative/positional structure. Measured: phase-variation tracks the audio affect-line 0.912, better than loudness alone. (momentum_phase=True: z = r·e^{iθ} carries velocity in the magnitude too.) |
The point: a single learned vector blurs all of this together. HSL keeps change (Δ), curvature (Δ²), spectrum (exact Fourier), and phase as separate, exact, interpretable channels — and your model selects which ones each modality needs (no premature compression at the base).
Optional 35-D: include_bits=True prepends the 8 raw byte bits. They're redundant (the per-byte Δ
already encodes each byte losslessly) — an optional extra lens, not part of the base.
Lossless by construction
The features are grounded in a lossless codec, so the substrate is byte-exact:
frame = hsl.encode(b"any bytes \x00\xff")
hsl.decode(frame) == b"any bytes \x00\xff" # True
Δ is a per-byte XOR-delta from the symbolic anchor 0 — each byte is measured from the anchor (so bytes
are independent), and integrating each byte's Δ from 0 recovers it exactly. That's why the raw bits channel
is redundant and can be dropped.
27-D (default) vs 35-D (with raw bits)
hsl.embed(data) # 27-D (default exact base; change-rate + exact Fourier + phase)
hsl.embed(data, include_bits=True) # 35-D (also prepend the 8 raw bits — redundant optional lens)
hsl.embed(data, momentum_phase=True) # 27-D (phasor magnitude also carries |Δbyte| velocity)
hsl.Embedding(include_bits=True).out_dim # 35
Batch
emb = hsl.Embedding()
feats, phase, mask = emb.pack([b"a", b"abcdef"], max_len=8) # [B, L, D], [B, L], [B, L]
Examples
python examples/quickstart.py # bytes in, features out; named channels
python examples/roundtrip_all.py # text / image / audio / video -> embed -> EXACT reconstruction
python examples/vs_nn_embedding.py # nn.Embedding vs hsl.Embedding — when to use which
python examples/benchmark_vs_nn.py # honest capability + speed comparison
roundtrip_all.py — one modality-agnostic encoder, lossless by construction:
modality bytes feat shape reconstruction
----------------------------------------------------------------
text (utf-8) 98 (98, 27) EXACT ✓
image (RGB u8) 3072 (3072, 27) EXACT ✓
audio (PCM i16) 8000 (8000, 27) EXACT ✓
video (6 frames) 4608 (4608, 27) EXACT ✓
Scope (honest)
HSL is a non-learned input substrate — a possibility-proof from an independent, single-GPU project, not a benchmark-beating system. It gives exact structural signal; the meaning still comes from a model you stack on top. See the paper and live demo:
- 📄 Paper: A Feasibility Study of Change-Rate-Based Multimodal Unification (Zenodo)
- 🌐 Live demo: https://holo-demo-p5txmh4dda-as.a.run.app
- 💻 HoLo project: https://github.com/Woojiggun/holo-hsl
License & citation
MIT License — © 2026 Jinhyun Woo (ggunio5782@gmail.com). Free to use, modify, and distribute, including for commercial use — the only condition is that the copyright notice and attribution to Jinhyun Woo are kept. See LICENSE.
@software{woo_hsl_2026,
author = {Jinhyun Woo},
title = {HSL: a byte-native, modality-agnostic signal embedding},
year = {2026},
doi = {10.5281/zenodo.20581805},
url = {https://github.com/Woojiggun/holo-hsl}
}
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file hsl_embedding-0.3.0.tar.gz.
File metadata
- Download URL: hsl_embedding-0.3.0.tar.gz
- Upload date:
- Size: 15.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
72a9eea49ecfdf9dcb7bc5ab7451a95defe2c31c4fbad58d317e31c1e3ec26a4
|
|
| MD5 |
1503e9edb58d7424318a825f89c4c451
|
|
| BLAKE2b-256 |
9ce9c7e7a5dc19e75c6ace338cf1b37e6b4df81c26153a0e77d00cc3cf961135
|
File details
Details for the file hsl_embedding-0.3.0-py3-none-any.whl.
File metadata
- Download URL: hsl_embedding-0.3.0-py3-none-any.whl
- Upload date:
- Size: 11.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
54a4d12c1911090b04be4c2c7454779f81891600e5f523a9cb372a27d083c00d
|
|
| MD5 |
e29dcb36484a997c1bca5dea0de0e0cd
|
|
| BLAKE2b-256 |
cc5ffc28affc5a9b9d977df2d31344506e47021ad715ab40409c24cae3e2bebc
|