Skip to main content

PEER - Pytorch

Project description

PEER - Pytorch

Pytorch implementation of the PEER block from the Deepmind paper, Mixture of A Million Experts, by Xu Owen He.

Install

$ pip install PEER-pytorch

Usage

import torch
from PEER_pytorch import PEER

peer = PEER(
    dim = 512,
    heads = 8,                   # tested up to 32 - (hk = heads * num_experts_per_head (16))
    num_experts = 1_000_000,     # he chose 1 million
    num_experts_per_head = 16,   # he settled on 16, but was 32 in PKM paper
    dim_key = 128,
    pre_rmsnorm = True
).cuda()

x = torch.randn(2, 1024, 512).cuda()

out = peer(x) + x

assert x.shape == out.shape

Citations

@inproceedings{He2024MixtureOA,
    title   = {Mixture of A Million Experts},
    author  = {Xu Owen He},
    year    = {2024},
    url     = {https://api.semanticscholar.org/CorpusID:271038610}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

peer_pytorch-0.1.5.tar.gz (266.6 kB view details)

Uploaded Source

Built Distribution

peer_pytorch-0.1.5-py3-none-any.whl (7.9 kB view details)

Uploaded Python 3

File details

Details for the file peer_pytorch-0.1.5.tar.gz.

File metadata

  • Download URL: peer_pytorch-0.1.5.tar.gz
  • Upload date:
  • Size: 266.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.19

File hashes

Hashes for peer_pytorch-0.1.5.tar.gz
Algorithm Hash digest
SHA256 fa8c8dcb7378c7a482cab3add4cea8533e1bbc302b77f31886c33eba9ec7fb7d
MD5 ff5ac35ee4806210496900daccac6786
BLAKE2b-256 49623f0f3a2b44ffce66766d58aa8d1da6c9bbded8b93a976dc3fdea05adadf0

See more details on using hashes here.

File details

Details for the file peer_pytorch-0.1.5-py3-none-any.whl.

File metadata

File hashes

Hashes for peer_pytorch-0.1.5-py3-none-any.whl
Algorithm Hash digest
SHA256 7f706c9686d438ecccb4d6f6d4a1a08af5a43263ba647c445ca82128ed0878fe
MD5 a4c9bda9b8884fb8a92ca3f5753e1e27
BLAKE2b-256 d9f4badc9fe63ff743bbc4bf8607a44d1788acc91cd7c922b4544b6b47ce84cf

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page