A text generation model combining multiple neural network architectures

These details have not been verified by PyPI

Project links

Homepage

Project description

SENTIA

SENTIA is a PyTorch implementation of a text generation model combining multiple neural network architectures like GRUs, Transformers, MHAs and MEPA.

Installation

pip install sentia

Usage

import torch
from sentia import SENTIA

# Create model
model = SENTIA(vocab_size=10000, embedding_dim=512, num_heads=8, num_layers=6, hidden_dim=512)

# Forward pass
input_ids = torch.randint(0, 10000, (1,32)) 
outputs = model(input_ids)

# Generate text 
generated = model.generate(input_ids, max_length=128)

Model Architecture

The SENTIA model consists of the following components:

Embedding layer
GRU layer
MEPA (Mutation Enhanced Plasticity Architecture) layers
Transformer decoder layers
Multi-head attention layer
Output head layers These components are combined to leverage the strengths of multiple architectures for improved text generation capabilities.

Training

The fit() method can bne used to train the model on a dataset. It handles the training loop, gradient accumulation, and RL calculations. Currently the scheduler parameter only supports StepLR

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

1.17

Dec 5, 2023

1.15

Nov 3, 2023

1.1

Oct 11, 2023

1.0.5

Sep 10, 2023

1.0.3

Sep 2, 2023

This version

1.0

Aug 23, 2023

0.0.7

Aug 12, 2023

0.0.6

Aug 12, 2023

0.0.5

Aug 12, 2023

0.0.4

Aug 12, 2023

0.0.3

Aug 12, 2023

0.0.2

Aug 12, 2023

0.0.1

Aug 12, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sentia-1.0.tar.gz (25.0 kB view hashes)

Uploaded Aug 23, 2023 Source

Built Distribution

sentia-1.0-py3-none-any.whl (37.1 kB view hashes)

Uploaded Aug 23, 2023 Python 3

Hashes for sentia-1.0.tar.gz

Hashes for sentia-1.0.tar.gz
Algorithm	Hash digest
SHA256	`a156910e771bb5d1c791cf6f1a138d0493e930cf29bb1c74fbb45404bd6c8721`
MD5	`db838dab9fb3a7340a2c423bce003e06`
BLAKE2b-256	`eed03e524616c98819d23b810f5b116e41a2a17a92bbd743137b27d03b9d5e51`

Hashes for sentia-1.0-py3-none-any.whl

Hashes for sentia-1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`fa3f5d5ce639bc03801be7751a8db7687148fde0e1677082f699d6adaf7ffbe1`
MD5	`035f5a5a4301f1cb6fbe2949f4c3ed39`
BLAKE2b-256	`d5a93a654acc9c0cf87d1c14e3006863edb6fcebbec0dc591a7e744c21b9e934`