MorphFormer: multilingual morphological reinflection via character-level Transformer

These details have not been verified by PyPI

Project description

MorphFormer v2

Character-level Transformer for multilingual morphological reinflection.

Installation

pip install morphoformer

From source:

cd Morph_v2
pip install -e .

Requirements: Python >= 3.14, PyTorch >= 2.0

Quick Start

morphoformer download --lang rus,deu,fra --merge
morphoformer train --preset medium --data "data/collections/*_train.tsv" --device cuda
morphoformer infer --checkpoint checkpoints/morphformer_epoch50.pt \
  --word "бежать" --morph "V;IND;PRS;3;SG" --lang rus
morphoformer serve --checkpoint checkpoints/morphformer_epoch50.pt

Presets

Preset	d_model	Encoder	Decoder	dim_ff	~Params	VRAM
`small`	384	4 layers	3 layers	1024	~7M	< 4 GB
`medium`	512	8 layers	6 layers	1376	~45M	4–8 GB
`large`	768	10 layers	8 layers	2048	~120M	>= 8 GB

morphoformer init-config --preset medium --out config.toml

CLI

Command	Description
`train`	Train model
`infer`	Single inference from checkpoint
`serve`	Interactive REPL
`download`	Download SigMorphon/UniMorph data
`modules`	List registered modules
`init-config`	Generate TOML config template

morphoformer --help
morphoformer train --help

Data Format

TSV: lemma\tfeatures\tsurface_form\tlanguage

бежать	V;IND;PRS;3;SG	бежит	rus
gehen	V;IND;PRS;3;SG	geht	deu
aller	V;IND;PRS;3;SG	va	fra

Configuration

[model]
d_model = 512
num_heads = 8
num_kv_heads = 2
dim_ff = 1376
dropout = 0.15
num_languages = 50

[model.encoder]
num_layers = 8
conv = "local"
adapter = "language_conditioned"
adapter_bottleneck = 128

[model.decoder]
num_layers = 6

[model.morph_encoder]
type = "pooled"

[training]
epochs = 50
batch_size = 64
lr = 5e-4
warmup_steps = 4000
device = "auto"

Python API

import torch
from morphoformer import modules
from morphoformer.modules import MorphFormer
from morphoformer.data import CharVocab, FeatureVocab
from morphoformer.inference import greedy_decode

ckpt = torch.load("model.pt", map_location="cpu", weights_only=False)
char_vocab = CharVocab.from_dict(ckpt["char_vocab"])
feature_vocab = FeatureVocab.from_dict(ckpt["feature_vocab"])
# ... build model, load state_dict ...

result = greedy_decode(
    model, char_vocab, feature_vocab,
    "бежать", ["V", "IND", "PRS", "3", "SG"], lang_id=0,
    device=torch.device("cuda"), max_len=96, max_out=128,
)

Modules

$ morphoformer modules
[attention]  gqa, mha, cross
[feedforward]  swiglu, gelu
[norm]  rmsnorm, layernorm
[conv]  local, none
[adapter]  language_conditioned, bottleneck, none
[morph_encoder]  pooled, attention
[position]  rope

Custom module:

from morphoformer.modules.registry import register

@register("feedforward", "my_ffn")
class MyFFN(nn.Module):
    ...

Devices

morphoformer train --device auto    # CUDA -> XPU -> MPS -> CPU
morphoformer train --device cuda    # NVIDIA
morphoformer train --device rocm    # AMD
morphoformer train --device xpu     # Intel Arc
morphoformer train --device mps     # Apple Silicon

Project Structure

morphoformer/
├── cli.py                 # CLI entry point
├── config/
│   ├── schema.py          # Dataclass config schema
│   ├── loader.py          # TOML load/save
│   └── presets.py         # small / medium / large
├── data/
│   ├── vocab.py           # CharVocab
│   ├── feature_vocab.py   # FeatureVocab
│   ├── dataset.py         # MorphDataset + CUDA stream prefetch
│   └── download.py        # SigMorphon downloader
├── modules/
│   ├── registry.py        # @register / get / list_modules
│   ├── transformer.py     # MorphFormer (main model)
│   ├── encoder.py         # Encoder + EncoderLayer
│   ├── decoder.py         # Decoder + DecoderLayer + KV cache
│   ├── attention.py       # GQA, MHA, CrossAttention
│   ├── feedforward.py     # SwiGLU, GeLU
│   ├── norm.py            # RMSNorm, LayerNorm
│   ├── position.py        # RoPE
│   ├── conv.py            # Depthwise conv (Conformer-style)
│   ├── adapter.py         # Language-conditioned adapters
│   └── morph_encoder.py   # Pooled / Attention morph encoder
├── inference/
│   ├── decode.py          # Greedy decode + KV cache
│   └── cache.py           # KVCache dataclass
└── training/
    ├── trainer.py         # Training loop
    ├── scheduler.py       # Cosine LR + warmup
    └── memory.py          # VRAM profiling + auto batch planning

License

MIT

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

7.0.0

Apr 20, 2026

6.0.0

Apr 2, 2026

5.1.0

Mar 31, 2026

5.0.6

Mar 31, 2026

5.0.5

Mar 31, 2026

5.0.4

Mar 31, 2026

5.0.3

Mar 31, 2026

5.0.2

Mar 31, 2026

5.0.1

Mar 31, 2026

5.0.0

Mar 30, 2026

4.7.7

Mar 30, 2026

4.7.6

Mar 30, 2026

4.7.5

Mar 30, 2026

4.7.4

Mar 30, 2026

4.7.3

Mar 30, 2026

4.7.2

Mar 30, 2026

4.7.1

Mar 30, 2026

4.7.0

Mar 30, 2026

4.6.0

Mar 30, 2026

4.5.0

Mar 30, 2026

4.4.0

Mar 30, 2026

4.3.0

Mar 29, 2026

4.2.0

Mar 29, 2026

4.1.0

Mar 29, 2026

4.0.1

Mar 29, 2026

4.0.0

Mar 29, 2026

3.2.0

Mar 28, 2026

3.1.0

Mar 28, 2026

3.0.1

Mar 28, 2026

3.0.0

Mar 28, 2026

This version

2.3.1

Mar 28, 2026

2.3.0

Mar 27, 2026

2.2.1

Mar 27, 2026

2.2.0

Mar 27, 2026

2.1.4

Mar 26, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

morphoformer-2.3.1.tar.gz (30.5 kB view details)

Uploaded Mar 28, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

morphoformer-2.3.1-py3-none-any.whl (38.5 kB view details)

Uploaded Mar 28, 2026 Python 3

File details

Details for the file morphoformer-2.3.1.tar.gz.

File metadata

Download URL: morphoformer-2.3.1.tar.gz
Upload date: Mar 28, 2026
Size: 30.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for morphoformer-2.3.1.tar.gz
Algorithm	Hash digest
SHA256	`7d13b916495e5df81fcd6224ee39b9c26f5af88ea52553fe24918e986ed59151`
MD5	`bb9044e3582534dd3b79bc4b7e635b14`
BLAKE2b-256	`92fdd490c0173f03348a98d3455a06373e7ba02dd10a39f2e7d74c3be552ee83`

See more details on using hashes here.

File details

Details for the file morphoformer-2.3.1-py3-none-any.whl.

File metadata

Download URL: morphoformer-2.3.1-py3-none-any.whl
Upload date: Mar 28, 2026
Size: 38.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for morphoformer-2.3.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d89cfc3ad1452ee12f0cb8465ad5933a01fef075b8ccd96c888f22ef5de8dacb`
MD5	`da6d9ee0f6392803be72cf11568cffb9`
BLAKE2b-256	`161e77d9eb6f68d778da4c6f3dec154ee6dc5419eabec419fdc568f6ecdb5f7f`

See more details on using hashes here.

morphoformer 2.3.1

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

MorphFormer v2

Installation

Quick Start

Presets

CLI

Data Format

Configuration

Python API

Modules

Devices

Project Structure

License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes