Lightweight Python kit for easy multimodal data processing.
Project description
ezMM: Mini-Suite for Easy Multimodal Data Processing
This lightweight Python package aims to streamline and simplify the processing of multimodal data. The core philosophy of ezMM is to treat any data (whether strings, images, audios, tables, etc.) as a multimodal sequence.
Usage
Core is the MultimodalSequence class. Here is an example:
from ezmm import MultimodalSequence, Image
img1 = Image("in/roses.jpg")
img2 = Image("in/garden.jpg")
seq = MultimodalSequence("The image", img1, "shows two beautiful roses while",
img2, "shows a nice garden with many flowers.")
seq comprehensively aggregates the different modalities into one handy object. It also offers some useful features:
MultimodalSequence is stringifyable
print(seq)
will return
The image <image:1> shows two beautiful roses while <image:2> shows a nice garden with many flowers.
That is, non-string items in the MultimodalSequence get replaced by their unique reference when turned into strings.
MultimodalSequence understands references
Conversely, you can do
seq2 = MultimodalSequence("The image <image:1> shows two beautiful roses while <image:2> shows a nice garden with many flowers.")
which obeys seq == seq2. That is, MultimodalSequence resolves references within the input string and loads the corresponding items under the hood.
Access MultimodalSequence like a list
You can apply list comprehension to seq. For example,
seq[1] == img.
Easy modality checks
You can check for specific modalities like images quickly, e.g., with seq.has_images().
Feature Overview
- ✅ Image support
- ✅ Video support
- ✅ Saving and organizing media in a database along with their origin URL
- ✅ Rendering
MultimodalSequencein a web UI - ⏳ Duplication management: Identify and re-use duplicates
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file ezmm-0.5.1.tar.gz.
File metadata
- Download URL: ezmm-0.5.1.tar.gz
- Upload date:
- Size: 19.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.10.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3efd53ec75a2dd6a0f5432a91b052d47ef380bfd86f8c59177148339985dbc18
|
|
| MD5 |
4504d8442ba2081dd1497d040e511d5c
|
|
| BLAKE2b-256 |
4b931510e7575bc496cc66592dd27ebda2a0f7be7be7946ec39d7f79ae0a9726
|
File details
Details for the file ezmm-0.5.1-py3-none-any.whl.
File metadata
- Download URL: ezmm-0.5.1-py3-none-any.whl
- Upload date:
- Size: 21.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.10.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4c4d158e25eaabdd730ef382fa8daf0d681c3707020a781011c0bffcebbc38fd
|
|
| MD5 |
cb3a28d7e569ad13488d38c5f7ea3f52
|
|
| BLAKE2b-256 |
f64a3682869450de46927eb5ec2c4c29e24432157ec333f41721bfec9bfe42bd
|