image processing and stuff
Project description
image2layout_computer_vision
An image processing module for some computer vision tasks (public module for image2layout)
Package Page: pypi
Features:
- Text Detection and Recognition (OCR)
- Color extraction (background and main foreground)
Installations
Install with python/conda [Linux]
- (Optional) Conda
curl https://repo.anaconda.com/archive/Anaconda3-2023.03-1-Linux-x86_64.sh -o ~/conda.sh
bash ~/conda.sh -b -f -p /opt/conda
rm ~/conda.sh
conda init --all --dry-run --verbose
conda create -n cv python=3.10 -y
conda activate cv
- Python libraries (python>=3.8)
CPU
python -m pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cpu
python -m pip install paddleocr paddlepaddle
python -m pip install datasets transformers scikit-learn Pillow numpy pandas chardet
python -m pip install --upgrade image2layout-computer-vision
GPU
# python -m pip install 'torch>=2.0' torchvision torchaudio
conda install pytorch torchvision torchaudio pytorch-cuda=11.8 -c pytorch -c nvidia -y
python -m pip install paddleocr paddlepaddle-gpu
python -m pip install datasets transformers scikit-learn Pillow numpy pandas chardet
python -m pip install --upgrade image2layout-computer-vision
Install with docker
For running with CPU on Ubuntu
sudo docker build --tag cv -f Dockerfile_cpu .
sudo docker run -it -p 0.0.0.0:8000:8000 -p 0.0.0.0:8001:8001 -v $(pwd):/app cv bash
From inside container
cd deployment
conda activate cv
python api_serve.py -n CV -p 8000
from git
python -m pip install git+https://github.com/felix-do-wizardry/image2layout-computer-vision.git
Usage
Note: Input image/images expects a filepath, an Image.Image object, or a numpy array
- Run this python code to pre-download model weights
from image2layout_computer_vision import OCR
OCR._load()
- Recognize texts
from image2layout_computer_vision.ocr as OCR
# [A] no text, box only, 2 lists of dicts with keys [text (empty), box, score (empty)]
data_merged, data_raw = OCR.detect_text_data('path/to/image.png', recognition=False)
# [B] text + box from multiple images -> list of list of dicts with keys [text, box, score]
data_raw_multi = OCR.detect_text_elements(['path/to/image.png', 'path/to/image2.png'])
- Extract colors
import image2layout_computer_vision as icv
# [A] list [ tuples [ 2 rgb-color tuples ] ] for background and foreground
# sample output: [((2, 2, 2), (4, 4, 4)), ((6, 6, 6), (8, 8, 8))]
colors_all = icv.extract_colors(['path/to/image.png', 'path/to/image2.png'])
# [B] 2 rgb-color tuples for background and foreground
# sample output: ((9, 9, 9), (6, 6, 6))
color_bg, color_fg = icv.extract_colors('path/to/image.png')
- Detect elements [work-in-progress]
import image2layout_computer_vision.yolov6 as Detection
# pd.DataFrame with columns [box, score, class_index, class_name]
df_element = Detection.detect_element('path/to/image.png')
Build
(for building and uploading this package)
python -m pip install --upgrade pip
python -m pip install --upgrade build twine "keyring<19.0"
rm -rf dist
python -m build
python -m twine upload dist/* --verbose
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file image2layout_computer_vision-0.1.10.tar.gz.
File metadata
- Download URL: image2layout_computer_vision-0.1.10.tar.gz
- Upload date:
- Size: 13.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0c2cbcf5ae1cd852472f919d009329263131fd8277c84b0ffb7399b6d44632d0
|
|
| MD5 |
758163cf299f36d3ca0de148a0d2f885
|
|
| BLAKE2b-256 |
5cb04100411dc2288e83518679bacb684565180eadd57af7f67ebbceef2c7744
|
File details
Details for the file image2layout_computer_vision-0.1.10-py3-none-any.whl.
File metadata
- Download URL: image2layout_computer_vision-0.1.10-py3-none-any.whl
- Upload date:
- Size: 19.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
de5daa1c53057daa6360ea539bcd0b79c6c0915c74ecbde526a3e3befff08fa2
|
|
| MD5 |
614876c086d63bb7325c1305ad80f507
|
|
| BLAKE2b-256 |
8a352f765b2a3b3d160645e276731317ecaa86e51ded28dc32e895beaaf76af1
|