Paper - Pytorch
Project description
ClipQ (WIP)
An easy-to-use interface for experimenting with OpenAI's CLIP model by encoding image quadrants. By splitting images into quadrants and encoding each with CLIP, we can explore how the model perceives various parts of an image.
Appreciation
- Christopher in LAION for the idea
- Thanks to OpenAI for the CLIP model.
- Inspiration drawn from various CLIP-related projects in the community.
Table of Contents
Installation
Install the package via pip:
pip install clipq
Quickstart
Here's a brief example to get you started:
from clipq.main import CLIPQ
#init
test = CLIPQ(query_text="A photo of a cat")
#input, url => embed
vectors = test.run_from_url(url="https://picsum.photos/800", h_splits=3, v_splits=3)
#print
print(vectors)
Documentation
Contributing
- Fork the repository on GitHub.
- Clone the forked repository to your machine.
- Create a new branch with an appropriate name.
- Make your changes and commit with a meaningful commit message.
- Push your changes to your forked repository.
- Create a Pull Request against the original repository.
License
This project is licensed under the MIT License - see the LICENSE file for details.
Todo
- Output captions of all 4 quadrants
- Make captions using any of the following: openclip G, OpenCLIP G or siglip L or EVA G
- Image Division: Ability to split an image into quadrants (2x2). Extended ability to split an image into 9 equal parts (3x3).
- Vector Representation: Generation of a CLIP vector for the entire image and individual CLIP vectors for each split part or quadrant.
- Sub-clip Concerns: Identification of hard chunking issues with standard quadrant splitting.
- Noise Reduction: Introduction of non-standard shapes (possibly polygons) for image parts to reduce noise. Aim to tackle interlacing issues during upscaling.
- Upscaling: Address potential tiling issues during the upscaling process.
- Flexibility in Sub-clipping: Configurable options to choose between 2x2 or 3x3 image division.
- Prior Training: Training mechanism to use the data of quadrant CLIP vectors.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
clipq-0.0.7.tar.gz
(5.2 kB
view details)
Built Distribution
clipq-0.0.7-py3-none-any.whl
(5.2 kB
view details)
File details
Details for the file clipq-0.0.7.tar.gz
.
File metadata
- Download URL: clipq-0.0.7.tar.gz
- Upload date:
- Size: 5.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.3.2 CPython/3.11.0 Darwin/22.4.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ac72e69355c66f0c320f584903cc8f227f9122f372bb5a24d5fddcd0a262faa7 |
|
MD5 | a07decd1a033b428f70bd627ad78f78d |
|
BLAKE2b-256 | 9538e4e33e375a3e7021df81f3b173220109b13d45e9c33517d225b6d9fcbc9d |
File details
Details for the file clipq-0.0.7-py3-none-any.whl
.
File metadata
- Download URL: clipq-0.0.7-py3-none-any.whl
- Upload date:
- Size: 5.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.3.2 CPython/3.11.0 Darwin/22.4.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | a87e76453951e717151f8ce3989b7da8e2d5072903146c51f6e523c8543b7aeb |
|
MD5 | 55535f346fb1fd5d663e5005a2af92ac |
|
BLAKE2b-256 | 3dfe762a5c1d2c324997e2f9a405c820011f975eb25265a83b1888ce8cfb26f6 |