Skip to main content

kosmosg - Pytorch

Project description

Multi-Modality

KosmosG

My implementation of the model KosmosG from "KOSMOS-G: Generating Images in Context with Multimodal Large Language Models"

Installation

pip install kosmosg

Usage

import torch
from kosmosg.main import KosmosG

# usage
img = torch.randn(1, 3, 256, 256)
text = torch.randint(0, 20000, (1, 1024))

model = KosmosG()
output = model(img, text)
print(output)

Architecture

text, image => KosmosG => text tokens with multi modality understanding

License

MIT

Todo

  • Create Aligner in pytorch
  • Create Diffusion module
  • Integrate these pieces
  • Create a training script

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kosmosg-0.0.4.tar.gz (4.7 kB view details)

Uploaded Source

Built Distribution

kosmosg-0.0.4-py3-none-any.whl (4.8 kB view details)

Uploaded Python 3

File details

Details for the file kosmosg-0.0.4.tar.gz.

File metadata

  • Download URL: kosmosg-0.0.4.tar.gz
  • Upload date:
  • Size: 4.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.3.2 CPython/3.11.0 Darwin/22.4.0

File hashes

Hashes for kosmosg-0.0.4.tar.gz
Algorithm Hash digest
SHA256 2721a5d38684c47e2452717c44e5cdf039ea73de67a941da1ca2e13d861631b1
MD5 5a9caa3691519dd5b381384ebd4aab1e
BLAKE2b-256 267a7ef1e2efb021c0318b7a0c44eb9f65a1b22a684845885795022b537e4a1e

See more details on using hashes here.

File details

Details for the file kosmosg-0.0.4-py3-none-any.whl.

File metadata

  • Download URL: kosmosg-0.0.4-py3-none-any.whl
  • Upload date:
  • Size: 4.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.3.2 CPython/3.11.0 Darwin/22.4.0

File hashes

Hashes for kosmosg-0.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 ae7206b315c6ab15f77fe63d94b26bb72d43216bdfef879f1187beca64731750
MD5 99c09c2d402e87bc36e4e5b251a626ea
BLAKE2b-256 8ede5461107f228385baa151c4309d95a7d625f3816c4644a7992ec9b47e4486

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page