nGPT-pytorch

nGPT

These details have not been verified by PyPI

Project links

Repository

Project description

nGPT (normalized GPT) - Pytorch

Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI. The question is whether there is any loss of expressivity they swept under the rug, but I'll take it with good faith.

This type of network should also be studied in the context of continual learning and loss of plasticity

Adaptation to vision transformers is here

Install

$ pip install nGPT-pytorch

Usage

import torch
from nGPT_pytorch import nGPT

model = nGPT(
    num_tokens = 256,
    dim = 512,
    depth = 4,
    attn_norm_qk = True
)

x = torch.randint(0, 256, (2, 2048))

loss = model(x, return_loss = True)
loss.backward()

logits = model(x) # (2, 2048, 256)

Test

Enwik8

$ python train.py

Citations

@inproceedings{Loshchilov2024nGPTNT,
    title   = {nGPT: Normalized Transformer with Representation Learning on the Hypersphere},
    author  = {Ilya Loshchilov and Cheng-Ping Hsieh and Simeng Sun and Boris Ginsburg},
    year    = {2024},
    url     = {https://api.semanticscholar.org/CorpusID:273026160}
}

@article{Luo2017CosineNU,
    title     = {Cosine Normalization: Using Cosine Similarity Instead of Dot Product in Neural Networks},
    author    = {Chunjie Luo and Jianfeng Zhan and Lei Wang and Qiang Yang},
    journal   = {ArXiv},
    year      = {2017},
    volume    = {abs/1702.05870},
    url       = {https://api.semanticscholar.org/CorpusID:1505432}
}

Project details

These details have not been verified by PyPI

Project links

Repository

Release history Release notifications | RSS feed

0.2.6

Nov 3, 2024

0.2.5

Nov 2, 2024

0.2.4

Nov 2, 2024

0.2.3

Nov 2, 2024

0.2.2

Nov 2, 2024

0.2.1

Nov 2, 2024

0.2.0

Oct 31, 2024

0.1.24

Oct 31, 2024

0.1.23

Oct 31, 2024

0.1.22

Oct 31, 2024

0.1.21

Oct 30, 2024

0.1.20

Oct 29, 2024

0.1.19

Oct 28, 2024

0.1.17

Oct 28, 2024

0.1.16

Oct 28, 2024

0.1.15

Oct 28, 2024

This version

0.1.14

Oct 28, 2024

0.1.12

Oct 27, 2024

0.1.11

Oct 21, 2024

0.1.10

Oct 17, 2024

0.1.9

Oct 17, 2024

0.1.8

Oct 17, 2024

0.1.7

Oct 13, 2024

0.1.6

Oct 11, 2024

0.1.5

Oct 11, 2024

0.1.4

Oct 11, 2024

0.1.2

Oct 10, 2024

0.1.1

Oct 10, 2024

0.1.0

Oct 10, 2024

0.0.14

Oct 10, 2024

0.0.12

Oct 10, 2024

0.0.11

Oct 9, 2024

0.0.10

Oct 9, 2024

0.0.9

Oct 9, 2024

0.0.8

Oct 9, 2024

0.0.7

Oct 9, 2024

0.0.6

Oct 8, 2024

0.0.5

Oct 8, 2024

0.0.4

Oct 8, 2024

0.0.2

Oct 8, 2024

0.0.1

Oct 8, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ngpt_pytorch-0.1.14.tar.gz (36.9 MB view hashes)

Uploaded Oct 28, 2024 Source

Built Distribution

ngpt_pytorch-0.1.14-py3-none-any.whl (14.4 kB view hashes)

Uploaded Oct 28, 2024 Python 3

Hashes for ngpt_pytorch-0.1.14.tar.gz

Hashes for ngpt_pytorch-0.1.14.tar.gz
Algorithm	Hash digest
SHA256	`248b40f9db25bbbe3cd57760ef14904545b7fe7b17d6d566f43097ce09b0e089`
MD5	`1776819f09860f1ed260396666377fad`
BLAKE2b-256	`9124d7dea2eb558d97e5d2a4f0bfcc8446d480d9e7894c9d993e6f1451078955`

Hashes for ngpt_pytorch-0.1.14-py3-none-any.whl

Hashes for ngpt_pytorch-0.1.14-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9cf4e57072fbf6e0eab2c81d09d1779aa0dec14d9af334a7b2835f0c82ae8d72`
MD5	`750db2bd9e06cc2f8f15f3ae041b02a2`
BLAKE2b-256	`799ce9e7f2a64a339b4a07e41e621532cf4c33141a2c5a347b6d91e10e46a204`