Drop-in triadic projection head for any HuggingFace transformer. Adds interpretable prime-factor algebraic signatures at negligible language cost (+1.7% PPL).

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

arturoornelas

These details have not been verified by PyPI

Project description

triadic-head

Drop-in triadic projection head for any HuggingFace transformer. Adds interpretable prime-factor semantic signatures at zero language cost.

What it does

Adds a single linear layer (49K params for GPT-2) that maps hidden states to discrete prime-factor signatures. Each concept becomes a composite integer like Φ(king) = 2 × 3 × 5 × 7 where each prime represents an active semantic feature.

This enables exact algebraic operations impossible with cosine similarity:

Operation	Cosine Similarity	Prime Algebra
"Does A contain all features of B?"	Approximate	`Φ(A) % Φ(B) == 0`
"What features do A and B share?"	Not possible	`GCD(Φ(A), Φ(B))`
"Combine features of A and B"	Not possible	`LCM(Φ(A), Φ(B))`
"A is to B as C is to ?"	Approximate	Exact factor transfer

Install

pip install triadic-head

Quick start

from triadic_head import TriadicWrapper

# Wrap any HuggingFace model
model = TriadicWrapper("gpt2", n_bits=64, align_mode="infonce")

# Train (see examples/train_gpt2.py for full loop)
model.freeze_backbone()
# ... phase 1: train triadic head only ...
model.unfreeze_last_n(2)
# ... phase 2: joint optimization ...

# Encode concepts to prime signatures
sigs = model.encode(["king", "queen", "dog"])

# Compare
result = model.compare("king", "queen")
# {'similarity': 0.89, 'shared_factors': [2, 3, 5, ...], ...}

Training API

# Forward pass returns (logits, triadic_proj, lang_loss)
logits, triadic_proj, lang_loss = model(input_ids, labels=input_ids)

# Triadic loss (4 components: diversity + contrastive + entropy + alignment)
tri_loss = model.triadic_loss(
    triadic_proj,
    input_ids=input_ids,
    alpha=0.05,           # triadic weight (DO NOT exceed 0.10)
    entropy_weight=1.0,   # prevent dead bits
    align_weight=5.0,     # transfer semantic structure from embeddings
    align_mode="infonce", # "mse" | "rank" | "infonce"
)

total_loss = lang_loss + tri_loss
total_loss.backward()

Alignment modes

Mode	Best for	Why
`infonce`	Pre-trained models (GPT-2, LLaMA, ...)	Mines positive/negative pairs from rich embeddings
`mse`	From-scratch training	Dense local gradients work with weak embeddings
`rank`	Best analogy accuracy	Preserves similarity ordering, not absolute values

Training guide — How long to train

The number of training steps directly determines result quality. Short runs are useful for smoke-testing the pipeline, but will NOT produce reliable semantic signatures. The triadic head needs enough steps to learn real word relationships beyond statistical noise.

Level	Steps	Time (GPT-2, 1 GPU)	What to expect
Smoke test	5,000	~5 min	Pipeline works, results are mostly noise
Minimum viable	20,000	~20 min	Basic semantic ordering emerges
Good quality	50,000	~50 min	Reliable word relationships, gap well above random
Production	100,000+	~2 hours	Publish-ready signatures

Important: Larger models (LLaMA, Mistral, etc.) need proportionally more steps. The validate() method includes a random baseline — it generates random bit patterns and measures what gap you'd get by pure chance. If your model's gap is close to the random baseline, you need more training.

# Quick smoke test (verify the pipeline works)
python examples/train_gpt2.py --data corpus.txt --phase1-steps 1000 --phase2-steps 4000

# Good quality training
python examples/train_gpt2.py --data corpus.txt --phase1-steps 10000 --phase2-steps 40000

# Production quality
python examples/train_gpt2.py --data corpus.txt --phase1-steps 20000 --phase2-steps 80000

Validation — Did training work?

# Automatic diagnostic: runs standard word groups and checks
# diversity, active bits, and semantic ordering
report = model.validate()

# Output:
# ============================================================
#   TRIADIC HEAD — VALIDATION REPORT
# ============================================================
#   [PASS] diversity: 16/16 unique signatures (100%)
#   [PASS] active_bits: 35.2/64 bits active on avg (55%)
#   [PASS] semantic_ordering: within-group 72% vs between-group 58% (gap +14%)
#   [PASS] random_baseline: model gap +14% vs random baseline +0.3% (signal +13.7%)
# ------------------------------------------------------------
#   RESULT: PASS — Triadic head is producing meaningful signatures.
#   Signal above random: +13.7%
# ============================================================

# Use your own domain-specific word groups:
report = model.validate(word_groups={
    "medical": ["heart", "lung", "brain", "kidney"],
    "legal": ["court", "judge", "law", "trial"],
})

Explore — Discover relationships

# See how any set of words relate to each other
model.explore(["king", "queen", "prince", "dog", "cat", "happy", "sad"])

# Output:
#   SIMILARITY MATRIX
#            king  queen prince    dog    cat  happy    sad
#   king      ---   78%   72%   45%   43%   38%   35%
#   queen    78%    ---   69%   41%   44%   42%   37%
#   ...
#
#   TOP 3 most similar:
#     king <-> queen: 78% (12 shared factors)
#     king <-> prince: 72% (10 shared factors)
#     dog <-> cat: 68% (9 shared factors)

Algebra API

from triadic_head import PrimeMapper, TriadicValidator

mapper = PrimeMapper(n_bits=64)
sig = mapper.encode(projection_values)  # -> composite integer

# Subsumption: does A contain all features of B?
TriadicValidator.subsumes(sig_a, sig_b)  # -> bool

# Composition: combine features
TriadicValidator.compose(sig_a, sig_b)  # -> LCM

# Gap analysis: exactly which features differ?
TriadicValidator.explain_gap(sig_a, sig_b)
# {'shared_factors': [2, 5], 'only_in_a_factors': [3], 'only_in_b_factors': [7]}

# Similarity: Jaccard over prime factor sets
TriadicValidator.similarity(sig_a, sig_b)  # -> float [0, 1]

# Analogy: A is to B as C is to ?
TriadicValidator.analogy(sig_a, sig_b, sig_c)  # -> target composite

Supported models

Works with any HuggingFace AutoModelForCausalLM:

GPT-2 (all sizes)
LLaMA / LLaMA-2 / LLaMA-3
Mistral / Mixtral
Phi-2 / Phi-3
Qwen / Qwen2
GPT-Neo / GPT-J
OPT
Falcon

Key findings from research

Zero language cost: triadic head adds no measurable degradation to language quality
49K params: for GPT-2 (768D, 64 bits) — negligible overhead
Semantic ordering emerges at scale: related concepts become more similar than unrelated ones only above ~40M parameters
InfoNCE + GPT-2 closes 48% of gap to post-hoc projection (Triadic-Neurosymbolic-Engine)
Sharp Pareto cliff at alpha > 0.05: do not exceed this value

Citation

@article{ornelas2026triadic,
  title={End-to-End Prime Factorization in a Generative Language Model:
         Learned Algebraic Encoding from Joint Training},
  author={Ornelas Brand, J. Arturo},
  year={2026},
  doi={10.5281/zenodo.19206545},
  publisher={Zenodo}
}

License

Business Source License 1.1 (BUSL-1.1). Free for individuals, academics, and non-profits. See LICENSE.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

arturoornelas

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.1.1

May 4, 2026

0.1.0

Mar 24, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

triadic_head-0.1.1.tar.gz (25.4 kB view details)

Uploaded May 4, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

triadic_head-0.1.1-py3-none-any.whl (20.4 kB view details)

Uploaded May 4, 2026 Python 3

File details

Details for the file triadic_head-0.1.1.tar.gz.

File metadata

Download URL: triadic_head-0.1.1.tar.gz
Upload date: May 4, 2026
Size: 25.4 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for triadic_head-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`de181d2ca4b8e43e4468296569f2da0cac2f3cc2a5453010b086e4403a46b184`
MD5	`f7525fae63acd29aaebacfe1a5a8006e`
BLAKE2b-256	`5ed1872eaca6d4e22a0ac48f72675074214c2edcc31766755ebc3e03f0c70a9b`

See more details on using hashes here.

Provenance

The following attestation bundles were made for triadic_head-0.1.1.tar.gz:

Publisher: publish.yml on arturoornelasb/triadic-microgpt

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: triadic_head-0.1.1.tar.gz
- Subject digest: de181d2ca4b8e43e4468296569f2da0cac2f3cc2a5453010b086e4403a46b184
- Sigstore transparency entry: 1436855056
- Sigstore integration time: May 4, 2026
Source repository:
- Permalink: arturoornelasb/triadic-microgpt@edc993a0a9457f39dcc5c8793dc47f23c38c0c73
- Branch / Tag: refs/tags/v0.3.0
- Owner: https://github.com/arturoornelasb
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@edc993a0a9457f39dcc5c8793dc47f23c38c0c73
- Trigger Event: push

File details

Details for the file triadic_head-0.1.1-py3-none-any.whl.

File metadata

Download URL: triadic_head-0.1.1-py3-none-any.whl
Upload date: May 4, 2026
Size: 20.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for triadic_head-0.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`cc6c1d527f541df6667f4a920b6e8267937755715895820bd35b559087ab2df1`
MD5	`3b8e696f2e3c5cfbaef0c137438eb763`
BLAKE2b-256	`dfcc82b7c45eb67fa3fe56aa7bb50074a007717560859ec0c4d146db6d9917ec`

See more details on using hashes here.

Provenance

The following attestation bundles were made for triadic_head-0.1.1-py3-none-any.whl:

Publisher: publish.yml on arturoornelasb/triadic-microgpt

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: triadic_head-0.1.1-py3-none-any.whl
- Subject digest: cc6c1d527f541df6667f4a920b6e8267937755715895820bd35b559087ab2df1
- Sigstore transparency entry: 1436855079
- Sigstore integration time: May 4, 2026
Source repository:
- Permalink: arturoornelasb/triadic-microgpt@edc993a0a9457f39dcc5c8793dc47f23c38c0c73
- Branch / Tag: refs/tags/v0.3.0
- Owner: https://github.com/arturoornelasb
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@edc993a0a9457f39dcc5c8793dc47f23c38c0c73
- Trigger Event: push

triadic-head 0.1.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Classifiers

Project description

triadic-head

What it does

Install

Quick start

Training API

Alignment modes

Training guide — How long to train

Validation — Did training work?

Explore — Discover relationships

Algebra API

Supported models

Key findings from research

Citation

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance