Skip to main content

namo is a nano level multi-modal training framework

Project description

Namo (纳摩): Nano Multi-Modal Training Framework

Introducing namo. Namo is a dead simple multi-modal training framework focusing on training small MLLMs. As more and more MLLM opensource, while small multi-modal LLMs with more powerful abilities remain untouched. Hence, we crafted this framework for anyone who wants training their own MLLM without finetuning on existing one. Anyone can training a base MLLM model with ease now. The model data you have used in training base, the more ability you will get in training larger models.

Namo not only a framework, but also provided our experiences in training MLLMs, we make easily make MLLM work on small models, then the same component (such as ViT, AudioEncoder etc) can be easily adopt into larger LLMs, largely reduced overall training time and resources.

Our model not only showed excellent performance compare with other small vlms, but also support a wide range of downstream tasks. To highlight the advantages of our model, here is:

  • dynamic input: namo model uses dynamic input, supports input ranges from 224 to 1080;
  • less token: nano models only needs 576 tokens even with 800 input resolution, largely efficient than other vlms;
  • flexibal: unlike other vlms coupled with their LLMs, we using ViT you can grab in opensource as well as LLMs, so that you can train any version by your own by any LLMs;
  • audio: we supports do visual + audio + text at the same time, not like some other models only supports audio + text;

Overall, namo is not only a series of model, but also a set of revealable training framework. We hoping our work can push the area further.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

namo-0.0.1.tar.gz (2.5 kB view details)

Uploaded Source

File details

Details for the file namo-0.0.1.tar.gz.

File metadata

  • Download URL: namo-0.0.1.tar.gz
  • Upload date:
  • Size: 2.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.9.13

File hashes

Hashes for namo-0.0.1.tar.gz
Algorithm Hash digest
SHA256 bc123b4eb187cf236859fb81958dc52c25c6f00e822518a953b7257ed07c9089
MD5 61e89feaf577d298946b74c6121584b2
BLAKE2b-256 84cd2a279a7a0dcbcada331aed14d835dcf58538507f9b304063cd9550fa4366

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page