MaskBit
Project description
MaskBit - Pytorch (wip)
Implementation of the proposed MaskBit from Bytedance AI
This paper can be viewed as a modernized version of the architecture from Taming Transformers from Esser et al.
They use the binary scalar quantization proposed in MagVit2 in their autoencoder, and then non-autoregressive mask decoding, where the masking is setting the bit (-1 or +1) to 0, projected for the transformer without explicit embeddings for the trit
Usage
import torch
from maskbit_pytorch import BQVAE, MaskBit
images = torch.randn(1, 3, 64, 64)
# train vae
vae = BQVAE(
image_size = 64,
dim = 512
)
loss = vae(images, return_loss = True)
loss.backward()
# train maskbit
maskbit = MaskBit(
vae,
dim = 512,
bits_group_size = 512,
depth = 2
)
loss = maskbit(images)
loss.backward()
# after much training
sampled_image = maskbit.sample() # (1, 3, 64, 64)
Citations
@inproceedings{Weber2024MaskBitEI,
title = {MaskBit: Embedding-free Image Generation via Bit Tokens},
author = {Mark Weber and Lijun Yu and Qihang Yu and Xueqing Deng and Xiaohui Shen and Daniel Cremers and Liang-Chieh Chen},
year = {2024},
url = {https://api.semanticscholar.org/CorpusID:272832013}
}
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file maskbit_pytorch-0.0.2.tar.gz.
File metadata
- Download URL: maskbit_pytorch-0.0.2.tar.gz
- Upload date:
- Size: 285.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.20
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8e4ff775594d0bf11822352ab423f9ad543eb0d5137ff59f91a7d654b5dd55bf
|
|
| MD5 |
84c9678ff54dce944e5d10fe16f2e979
|
|
| BLAKE2b-256 |
2270e71008d095a51c62623618e262bdc2ab170aad2b78d3c046a3f1f7891ac7
|
File details
Details for the file maskbit_pytorch-0.0.2-py3-none-any.whl.
File metadata
- Download URL: maskbit_pytorch-0.0.2-py3-none-any.whl
- Upload date:
- Size: 12.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.20
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9d5a48b99faaad5cf5d4d5096de782188c2adba2d156ea7175bb333261b3fc3d
|
|
| MD5 |
17e5987d582fa8086c1c48d5eec4824a
|
|
| BLAKE2b-256 |
9f4c9c2ed1c2440c76c902c71c83d52c4717010acee26f14ea5a9a05ff254710
|