Skip to main content

Swarms - Pytorch

Project description

Multi-Modality

BRAVE or Swarms of Vision Transformers

Implementation of the paper: "BRAVE : Broadening the visual encoding of vision-language models". BRAVE achieves state-of-the-art performance on a broad range of captioning and VQA benchmarks and significantly reduces the aforementioned issues of VLMs, while requiring a smaller number of trainable parameters than existing methods and having a more compressed representation.

install

pip3 install brave-torch

usage

import torch
from brave_torch.main import SwarmOfViTs

# IMG Tensor
x = torch.randn(1, 3, 224, 224) 

# Model
model = SwarmOfViTs(
    image_size=224,
    patch_size=32,
    encoder_dim=512,
    encoder_depth=6,
    encoder_heads=8,
    num_of_vits=4
)

# Forward
out = model(x)
print(out)

Citations

Todo

  • Citation link
  • Citation Bibtex
  • Diagram photo
  • Implement Andromeda Base LLM architecture
  • Provide multi-modal tokenizer
  • Train and release the model

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

brave_torch-4.7.9.tar.gz (6.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

brave_torch-4.7.9-py3-none-any.whl (6.6 kB view details)

Uploaded Python 3

File details

Details for the file brave_torch-4.7.9.tar.gz.

File metadata

  • Download URL: brave_torch-4.7.9.tar.gz
  • Upload date:
  • Size: 6.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.3.2 CPython/3.11.0 Darwin/23.3.0

File hashes

Hashes for brave_torch-4.7.9.tar.gz
Algorithm Hash digest
SHA256 ad2f4b63ef35520a08e8968016cc6b8870c3173b2008ba8a2655760ea0b310e6
MD5 f751266589f3c7e95a4c55a9a59bfab1
BLAKE2b-256 bb128b9ebe67ad9f7c4d74e31638e2fcbd5eb65f95c35c77a41bf3c42208dcb1

See more details on using hashes here.

File details

Details for the file brave_torch-4.7.9-py3-none-any.whl.

File metadata

  • Download URL: brave_torch-4.7.9-py3-none-any.whl
  • Upload date:
  • Size: 6.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.3.2 CPython/3.11.0 Darwin/23.3.0

File hashes

Hashes for brave_torch-4.7.9-py3-none-any.whl
Algorithm Hash digest
SHA256 1ecd965fe9ef4ae554a31075cde450dc3f342bc29b44f0d4d4b591f388db7c83
MD5 74dc100e1d8e5ffd8ef63f25b7c07cca
BLAKE2b-256 eb7fe260d2829f894d9a9ab35c77dd76606566f9fc836a329393d3d0e3c3cba4

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page