Composable neural network composennents for building models in PyTorch.

These details have not been verified by PyPI

Project links

Repository

Project description

composennent

Composable neural network composennents for building models in PyTorch.

Composennent provides modular, reusable building blocks for constructing transformer-based models. Train GPT, BERT, and other architectures with minimal code.

Features

🧩 Modular Components: Encoder, Decoder, Attention blocks that compose together
🚀 Built-in Training: Pre-training and fine-tuning with a single method call
📝 Multiple Architectures: GPT, BERT, Seq2Seq support out of the box
🔧 Tokenizer Support: WordPiece and SentencePiece tokenizers included
⚡ Mixed Precision: Automatic mixed precision (AMP) support
🎯 Instruction Tuning: Fine-tune models on instruction datasets (Alpaca format)

Installation

pip install composennent

For tokenizer support:

pip install composennent[tokenizers]

For development:

pip install composennent[dev]

Quick Start

Pre-train a GPT Model

import torch
from composennent.models import GPT
from composennent.nlp.tokenizers import SentencePieceTokenizer

# Create model
model = GPT(
    vocab_size=32000,
    latent_dim=512,
    num_heads=8,
    num_layers=6,
    max_seq_len=512,
)

# Load tokenizer
tokenizer = SentencePieceTokenizer.from_pretrained("tokenizer.model")

# Pre-train
texts = ["Your training data here...", ...]
model.pretrain(
    texts=texts,
    tokenizer=tokenizer,
    epochs=3,
    batch_size=16,
    device="cuda",
)

# Save
model.save("my_model.pt")

Fine-tune on Instructions

# Load pre-trained model
model = GPT.load("my_model.pt", device="cuda")

# Instruction data (Alpaca format)
instruction_data = [
    {
        "instruction": "What is the capital of France?",
        "input": "",
        "output": "The capital of France is Paris."
    },
    # ... more examples
]

# Fine-tune
model.fine_tune(
    data=instruction_data,
    tokenizer=tokenizer,
    epochs=2,
    lr=5e-5,
    mask_prompt=True,  # Only compute loss on outputs
)

Generate Text

prompt = tokenizer.encode("What is")
generated = model.generate(
    input_ids=prompt,
    max_length=100,
    temperature=0.8,
)
print(tokenizer.decode(generated[0].tolist()))

Documentation

Detailed documentation is available in the docs/ directory:

Models

GPT: Decoder-only architecture with self-modifying memory.
Encoder-Decoder: Flexible T5/BART-style architecture.

Modules

Memory: Differentiable Key-Value memory.
Retrieval (RAG): Scalable memory and document stores.
Experts (MoE): Mixture of Experts and LoRA adapters.

Training

Trainer Engine: Unified training loop.
Distillation: Knowledge distillation from Teacher models.
RLHF / DPO: Direct Preference Optimization.

Features

Quantization: INT8/INT4 compression.
Function Calling: Agentic tool use protocols.

Training API

For more control over training, use the trainer classes directly:

from composennent.trainer import CausalLMTrainer, train

# Option 1: Use the train() convenience function
train(model, texts, tokenizer, model_type="causal_lm", epochs=5)

# Option 2: Use trainer class directly
trainer = CausalLMTrainer(model, tokenizer, device="cuda")
trainer.train(texts, epochs=5, batch_size=16)
trainer.save_checkpoint("checkpoint.pt")

Available trainers:

CausalLMTrainer - GPT-style next-token prediction
MaskedLMTrainer - BERT-style masked language modeling
Seq2SeqTrainer - Encoder-decoder models
MultiTaskTrainer - Multi-task learning (MLM + NSP)
CustomTrainer - Custom loss functions

Requirements

Python >= 3.8
PyTorch >= 2.0.0

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Fork the repository
Create your feature branch (git checkout -b feature/amazing-feature)
Install dev dependencies (pip install -e ".[dev]")
Run tests (pytest)
Run formatters (black . && ruff check .)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

License

MIT License - see LICENSE for details.

Project details

These details have not been verified by PyPI

Project links

Repository

Release history Release notifications | RSS feed

This version

0.5.0

Jan 12, 2026

0.4.9

Jan 5, 2026

0.4.8

Jan 5, 2026

0.4.7

Jan 4, 2026

0.4.6

Jan 4, 2026

0.4.5

Dec 19, 2025

0.4.4

Dec 17, 2025

0.4.3

Dec 17, 2025

0.4.2

Dec 17, 2025

0.4.1

Dec 17, 2025

0.4.0

Dec 16, 2025

0.3.9

Dec 13, 2025

0.3.8

Dec 13, 2025

0.3.7

Dec 12, 2025

0.3.6

Dec 8, 2025

0.3.5

Dec 5, 2025

0.3.4

Dec 5, 2025

0.3.3

Dec 5, 2025

0.3.2

Dec 3, 2025

0.3.1

Dec 3, 2025

0.3.0

Dec 2, 2025

0.2.8

Dec 2, 2025

0.2.7

Dec 2, 2025

0.2.6

Dec 2, 2025

0.2.5

Nov 30, 2025

0.2.4

Nov 30, 2025

0.2.3

Nov 30, 2025

0.2.2

Nov 30, 2025

0.2.1

Nov 30, 2025

0.2.0

Nov 30, 2025

0.1.0

Nov 27, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

composennent-0.5.0.tar.gz (89.4 kB view details)

Uploaded Jan 12, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

composennent-0.5.0-py3-none-any.whl (120.6 kB view details)

Uploaded Jan 12, 2026 Python 3

File details

Details for the file composennent-0.5.0.tar.gz.

File metadata

Download URL: composennent-0.5.0.tar.gz
Upload date: Jan 12, 2026
Size: 89.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for composennent-0.5.0.tar.gz
Algorithm	Hash digest
SHA256	`802771307bbc952cffd402bb420658242cf6086fecf602c363b72faff6fb851f`
MD5	`2160d69a8ffff33bf856d9afcce6c86d`
BLAKE2b-256	`03f7be4db46fefe678370a680bcc054f296607e4c9317a781f2fa2181e8b7206`

See more details on using hashes here.

File details

Details for the file composennent-0.5.0-py3-none-any.whl.

File metadata

Download URL: composennent-0.5.0-py3-none-any.whl
Upload date: Jan 12, 2026
Size: 120.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for composennent-0.5.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`66c79265e87b2787b720ce85f0686529c570289771bcd79e95d6c808996be4e0`
MD5	`03f35f37a908a407d13a11be3782149c`
BLAKE2b-256	`c2bcfc9d88e28f4719cd6326493512048efa1eef642d37e687b6fb599d7c2989`

See more details on using hashes here.

composennent 0.5.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

composennent

Features

Installation

Quick Start

Pre-train a GPT Model

Fine-tune on Instructions

Generate Text

Documentation

Models

Modules

Training

Features

Training API

Requirements

Contributing

License

Links

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes