Transformers at zeta scales

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Zeta - A Transgalactic Library for Scalable Transformations

Zeta is a PyTorch-powered library, forged in the heart of the Halo array, that empowers researchers and developers to scale up Transformers efficiently and effectively. It leverages seminal research advancements to enhance the generality, capability, and stability of scaling Transformers while optimizing training efficiency.

Installation

To install:

pip install zetascale

To get hands-on and develop it locally:

git clone https://github.com/kyegomez/zeta.git
cd zeta
pip install -e .

Initiating Your Journey

Creating a model empowered with the aforementioned breakthrough research features is a breeze. Here's how to quickly materialize a BERT-like encoder:

>>> from zeta import EncoderConfig
>>> from zeta import Encoder

>>> config = EncoderConfig(vocab_size=64000)
>>> model = Encoder(config)

>>> print(model)

Additionally, we support the Decoder and EncoderDecoder architectures:

# To create a decoder model
>>> from zeta import DecoderConfig
>>> from zeta import Decoder

>>> config = DecoderConfig(vocab_size=64000)
>>> decoder = Decoder(config)
>>> print(decoder)

# To create an encoder-decoder model
>>> from zeta import EncoderDecoderConfig
>>> from zeta import EncoderDecoder

>>> config = EncoderDecoderConfig(vocab_size=64000)
>>> encdec = EncoderDecoder(config)
>>> print(encdec)

Key Features

Most of the transformative features mentioned below can be enabled by simply setting the corresponding parameters in the config:

>>> from zeta import EncoderConfig
>>> from zeta import Encoder

>>> config = EncoderConfig(vocab_size=64000, deepnorm=True, multiway=True)
>>> model = Encoder(config)

>>> print(model)

For a complete overview of our key features, refer to our Feature Guide.

Examples

Discover how to wield Zeta in a multitude of scenarios/tasks, including but not limited to:

Language
Vision
- ViT/BEiT [In progress]
Speech
Multimodal
- Multiway Transformers/BEiT-3

We are working tirelessly to expand the collection of examples spanning various tasks (e.g., vision pretraining, speech recognition) and various deep learning frameworks (e.g., DeepSpeed, Megatron-LM). Your comments, suggestions, or contributions are welcome!

Results

Check out our Results Page to witness Zeta's exceptional performance in Stability Evaluations and Scaling-up Experiments.

Acknowledgments

Zeta is a masterpiece inspired by elements of FairSeq and UniLM.

Citations

If our work here in Zeta has aided you in your journey, please consider acknowledging our efforts in your work. You can find relevant citation details in our Citations Document.

Contributing

We're always thrilled to welcome new ideas and improvements from the community. Please check our Contributor's Guide for more details about contributing.

Create an modular omni-universal Attention class with flash multihead attention or regular mh or dilated attention -> then integrate into Decoder/ DecoderConfig

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

2.5.5

May 27, 2024

2.5.4

May 27, 2024

2.5.3

May 27, 2024

2.5.1

May 25, 2024

2.4.8

May 13, 2024

2.4.6

May 12, 2024

2.4.5

May 1, 2024

2.4.4

May 1, 2024

2.4.3

Apr 30, 2024

2.4.2

Apr 15, 2024

2.4.1

Apr 15, 2024

2.4.0

Apr 15, 2024

2.3.9

Apr 6, 2024

2.3.6

Apr 6, 2024

2.3.5

Apr 6, 2024

2.3.1

Apr 4, 2024

2.3.0

Apr 4, 2024

2.2.9

Apr 4, 2024

2.2.8

Apr 4, 2024

2.2.7

Apr 1, 2024

2.2.6

Mar 21, 2024

2.2.5

Mar 20, 2024

2.2.4

Mar 19, 2024

2.2.3

Mar 19, 2024

2.2.1

Mar 2, 2024

2.1.9

Feb 29, 2024

2.1.8

Feb 29, 2024

2.1.7

Feb 23, 2024

2.1.6

Feb 19, 2024

2.1.3

Feb 16, 2024

2.1.2

Feb 16, 2024

2.1.1

Feb 12, 2024

2.1.0

Feb 10, 2024

2.0.8

Feb 9, 2024

2.0.7

Feb 5, 2024

2.0.6

Feb 1, 2024

2.0.5

Feb 1, 2024

2.0.3

Jan 30, 2024

2.0.2

Jan 30, 2024

2.0.0

Jan 23, 2024

1.9.8

Jan 21, 2024

1.9.6

Jan 18, 2024

1.9.5

Jan 18, 2024

1.9.4

Jan 18, 2024

1.9.1

Jan 16, 2024

1.8.8

Jan 16, 2024

1.8.7

Jan 15, 2024

1.8.2

Jan 13, 2024

1.7.8

Jan 12, 2024

1.7.6

Jan 12, 2024

1.7.5

Jan 12, 2024

1.7.1

Jan 11, 2024

1.7.0

Jan 11, 2024

1.6.7

Jan 10, 2024

1.6.6

Jan 10, 2024

1.6.5

Jan 10, 2024

1.6.3

Jan 9, 2024

1.6.1

Jan 9, 2024

1.6.0

Jan 9, 2024

1.5.8

Jan 8, 2024

1.5.7

Jan 8, 2024

1.5.4

Jan 8, 2024

1.4.9

Jan 7, 2024

1.4.8

Jan 7, 2024

1.4.6

Jan 7, 2024

1.4.5

Jan 7, 2024

1.4.4

Jan 6, 2024

1.4.3

Jan 5, 2024

1.4.1

Jan 4, 2024

1.4.0

Jan 4, 2024

1.3.8

Jan 1, 2024

1.3.7

Dec 29, 2023

1.3.6

Dec 29, 2023

1.3.4

Dec 28, 2023

1.3.1

Dec 27, 2023

1.3.0

Dec 27, 2023

1.2.9

Dec 27, 2023

1.2.5

Dec 24, 2023

1.2.4

Dec 23, 2023

1.2.3

Dec 23, 2023

1.2.2

Dec 23, 2023

1.2.1

Dec 21, 2023

1.2.0

Dec 21, 2023

1.1.9

Dec 21, 2023

1.1.7

Dec 20, 2023

1.1.6

Dec 20, 2023

1.1.5

Dec 20, 2023

1.1.4

Dec 20, 2023

1.1.3

Dec 20, 2023

1.1.2

Dec 20, 2023

1.1.1

Dec 20, 2023

1.0.9

Dec 20, 2023

1.0.8

Dec 20, 2023

1.0.7

Dec 20, 2023

1.0.6

Dec 20, 2023

1.0.0

Dec 20, 2023

0.9.9

Dec 17, 2023

0.9.8

Dec 17, 2023

0.9.5

Dec 17, 2023

0.9.4

Dec 16, 2023

0.9.3

Dec 15, 2023

0.9.2

Dec 14, 2023

0.9.1

Dec 12, 2023

0.9.0

Dec 12, 2023

0.8.9

Nov 30, 2023

0.8.8

Nov 30, 2023

0.8.7

Nov 30, 2023

0.8.6

Nov 20, 2023

0.8.5

Nov 13, 2023

0.8.4

Nov 4, 2023

0.8.3

Oct 27, 2023

0.8.2

Oct 27, 2023

0.8.1

Oct 27, 2023

0.8.0

Oct 27, 2023

0.7.9

Oct 27, 2023

0.7.8

Oct 25, 2023

0.7.7

Oct 24, 2023

0.7.5

Oct 17, 2023

0.7.4

Oct 17, 2023

0.7.3

Oct 17, 2023

0.7.2

Oct 17, 2023

0.7.1

Oct 16, 2023

0.7.0

Oct 2, 2023

0.6.9

Oct 2, 2023

0.6.8

Oct 2, 2023

0.6.7

Oct 2, 2023

0.6.6

Oct 2, 2023

0.6.5

Sep 28, 2023

0.6.4

Sep 28, 2023

0.6.3

Sep 24, 2023

0.6.2

Sep 24, 2023

0.6.1

Sep 23, 2023

0.6.0

Sep 23, 2023

0.5.9

Sep 20, 2023

0.5.8

Sep 20, 2023

0.5.6

Sep 8, 2023

0.5.5

Sep 8, 2023

0.5.4

Sep 8, 2023

0.5.3

Sep 6, 2023

0.5.2

Sep 6, 2023

0.5.1

Sep 6, 2023

0.5.0

Sep 6, 2023

0.4.9

Sep 5, 2023

0.4.8

Sep 5, 2023

0.4.7

Sep 4, 2023

0.4.4

Aug 29, 2023

0.4.3

Aug 29, 2023

0.4.2

Aug 28, 2023

0.4.1

Aug 28, 2023

0.3.9

Aug 28, 2023

0.3.8

Aug 26, 2023

0.3.7

Aug 26, 2023

0.3.6

Aug 25, 2023

0.3.5

Aug 25, 2023

0.3.4

Aug 25, 2023

0.3.3

Aug 25, 2023

0.3.2

Aug 25, 2023

0.3.1

Aug 23, 2023

0.3.0

Aug 23, 2023

0.2.9

Aug 23, 2023

0.2.7

Aug 23, 2023

0.2.6

Aug 23, 2023

0.2.5

Aug 23, 2023

0.2.4

Aug 23, 2023

0.2.3

Aug 23, 2023

0.2.2

Aug 23, 2023

0.2.1

Aug 23, 2023

0.2.0

Jul 10, 2023

0.0.9

Aug 23, 2023

0.0.8

Aug 22, 2023

0.0.7

Aug 22, 2023

0.0.6

Aug 22, 2023

0.0.5

Aug 22, 2023

0.0.4

Aug 22, 2023

This version

0.0.2

Jul 10, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

zetascale-0.0.2.tar.gz (60.6 kB view hashes)

Uploaded Jul 10, 2023 Source

Built Distribution

zetascale-0.0.2-py3-none-any.whl (81.3 kB view hashes)

Uploaded Jul 10, 2023 Python 3

Hashes for zetascale-0.0.2.tar.gz

Hashes for zetascale-0.0.2.tar.gz
Algorithm	Hash digest
SHA256	`6017376ccaf8582aa0a57266b496cf161af2852289f505f22d0bd5202ab331ee`
MD5	`a29ab9e6e331270c21ba978032b0daa6`
BLAKE2b-256	`b572074b0248270234cc9570bc35eb300070a77936c80993591a4780611aa2a9`

Hashes for zetascale-0.0.2-py3-none-any.whl

Hashes for zetascale-0.0.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d8a14c573ada040f29a941ae7b61b8a57b4d1988bfd52bae3e66abc0cd7bb723`
MD5	`187d08d8767a114c4417f8c45a90922b`
BLAKE2b-256	`b686660677a9f9386f2c17b39512214de1cfc466638f7e7e9e3ba88f20986968`