Unofficial codebase for the "Retentive Network: A Successor to Transformer for Large Language Models" paper [https://arxiv.org/pdf/2307.08621.pdf]
Project description
RetentiveNetwork
Unofficial codebase for the "Retentive Network: A Successor to Transformer for Large Language Models" paper [https://arxiv.org/pdf/2307.08621.pdf]
The official codebase for RetNet should be made available roughly August 1st, 2023 according to Microsoft here:
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for retentive_network-0.0.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0a9b6b80c4f8ef16dffba0adfa20c744d99f8f9d8de3b9f8c3f4e3c34e0b40c7 |
|
MD5 | 5f4a800c8d6924b95cf058104c9ba2e5 |
|
BLAKE2b-256 | 7ec709dba9ceb12d7e07af86f08b5b3533cdd01a4b4e5c2008db01f750a38378 |