Lightweight Learned Image Compression
Project description
Lightweight Learned Image Compression (LLIC)
Installation
- Follow the installation instructions for torch and compressai
- Install LLIC via pip:
pip install LLIC
Pre-trained checkpoints
An imagenet-trained checkpoint for RGB images is available on huggingface: LLIC_rgb_v0.0.1.pth
Request access to other checkpoints (grayscale, hyperspectral, microscopy, etc)
Usage example
import torch
import zlib
import numpy as np
import compressai
from io import BytesIO
from IPython.display import display
from PIL import Image
from LLIC import LLIC
from torchvision.transforms import ToPILImage, PILToTensor
Load the model
checkpoint = torch.load("LLIC_rgb_v0.0.1.pth",map_location="cpu")
codec = LLIC.RateDistortionAutoEncoder()
codec.load_state_dict(checkpoint['model_state_dict'])
<All keys matched successfully>
Download example image
!wget https://r0k.us/graphics/kodak/kodak/kodim05.png
original_image = Image.open("kodim05.png")
original_image
The analysis and synthesis transforms expect dimensions to be multiples of of 16. Zero padding can be applied otherwise.
def pad(x, p=2**5):
h, w = x.size(2), x.size(3)
pad, _ = compressai.ops.compute_padding(h, w, min_div=p)
return torch.nn.functional.pad(x, pad, mode="constant", value=0)
def preprocess(pil_image):
tensor = PILToTensor()(pil_image)
tensor = tensor.unsqueeze(0)
tensor = tensor.to(torch.float)
tensor = tensor/255
tensor = tensor - 0.5
return pad(tensor)
Compress the image and save file
padded_image = preprocess(original_image)
original_size = padded_image.shape
compressed_image, compressed_shape = LLIC.compress(padded_image, codec)
with open("kodim05.llic", 'wb') as f:
f.write(compressed_image)
Decompress and view the image
def crop(x, size):
H, W = x.size(2), x.size(3)
h, w = size
_, unpad = compressai.ops.compute_padding(h, w, out_h=H, out_w=W)
return torch.nn.functional.pad(x, unpad, mode="constant", value=0)
def postprocess(tensor):
tensor = tensor[0] + 0.5
tensor = 255*tensor
tensor = tensor.clamp(0,255)
tensor = tensor.to(torch.uint8)
pil_image = ToPILImage()(tensor)
return pil_image
with open("kodim05.llic", 'rb') as f:
compressed_image = f.read()
tensor = LLIC.decompress(compressed_image, compressed_shape, codec)
recovered_image = postprocess(crop(tensor, (512,768)))
recovered_image
!jupyter nbconvert --to markdown README.ipynb
[NbConvertApp] Converting notebook README.ipynb to markdown
[NbConvertApp] Support files will be in README_files/
[NbConvertApp] Making directory README_files
[NbConvertApp] Writing 2873 bytes to README.md
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file llic-0.1.13.tar.gz.
File metadata
- Download URL: llic-0.1.13.tar.gz
- Upload date:
- Size: 6.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.10.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
65536ba6d4fcb29fa301f7530ff1018c2fa8db5995cd77fc0b038d0404096b36
|
|
| MD5 |
a631b5099f6087cf383f6e1913ea8995
|
|
| BLAKE2b-256 |
38fa8a7e06bdbdd91351721a41f61bf5b886be8f99e5684e5835544bf3b813d6
|
File details
Details for the file LLIC-0.1.13-py3-none-any.whl.
File metadata
- Download URL: LLIC-0.1.13-py3-none-any.whl
- Upload date:
- Size: 5.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.10.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d473043c72dfe5f80b3b7686ff375e6dca1052311c174d1938a498274796fbd7
|
|
| MD5 |
3b101735bd33984c274b2fbf61e1ee7d
|
|
| BLAKE2b-256 |
1a823c3a9a6c16a51e8cf38ce93aa8c88886a5b9b6506a3a60c27f87ac4f7cc0
|