kosmosg - Pytorch
Project description
KosmosG
My implementation of the model KosmosG from "KOSMOS-G: Generating Images in Context with Multimodal Large Language Models"
Installation
pip install kosmosg
Usage
import torch
from kosmosg.main import KosmosG
# usage
img = torch.randn(1, 3, 256, 256)
text = torch.randint(0, 20000, (1, 1024))
model = KosmosG()
output = model(img, text)
print(output)
Architecture
text, image => KosmosG => text tokens with multi modality understanding
License
MIT
Todo
- Create Aligner in pytorch
- Create Diffusion module
- Integrate these pieces
- Create a training script
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
kosmosg-0.0.4.tar.gz
(4.7 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file kosmosg-0.0.4.tar.gz.
File metadata
- Download URL: kosmosg-0.0.4.tar.gz
- Upload date:
- Size: 4.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.3.2 CPython/3.11.0 Darwin/22.4.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2721a5d38684c47e2452717c44e5cdf039ea73de67a941da1ca2e13d861631b1
|
|
| MD5 |
5a9caa3691519dd5b381384ebd4aab1e
|
|
| BLAKE2b-256 |
267a7ef1e2efb021c0318b7a0c44eb9f65a1b22a684845885795022b537e4a1e
|
File details
Details for the file kosmosg-0.0.4-py3-none-any.whl.
File metadata
- Download URL: kosmosg-0.0.4-py3-none-any.whl
- Upload date:
- Size: 4.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.3.2 CPython/3.11.0 Darwin/22.4.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ae7206b315c6ab15f77fe63d94b26bb72d43216bdfef879f1187beca64731750
|
|
| MD5 |
99c09c2d402e87bc36e4e5b251a626ea
|
|
| BLAKE2b-256 |
8ede5461107f228385baa151c4309d95a7d625f3816c4644a7992ec9b47e4486
|