이미지를 시각적 경계 기준으로 안전하게 섹션 분리 & OCR (Korean, Chinese)

These details have not been verified by PyPI

Project links

Project description

gandol2-ocr

이미지 섹션 분리와 OCR을 통합한 Python 유틸리티입니다. 세로로 긴 상세페이지 이미지를 시각적 경계 기준으로 안전하게 섹션 분리하고, 각 섹션에 대해 OCR을 수행할 수 있습니다.

✨ 주요 기능

🖼️ 이미지 섹션 분리 (Splitter)

세로로 긴 상세페이지 이미지를 시각적 경계 기준으로 자동 분리
OpenCV 기반의 고급 이미지 처리 알고리즘
시각적 경계 감지 및 안전한 섹션 분할
분리된 섹션 정보를 구조화된 데이터로 제공

🔍 OCR (Optical Character Recognition)

PaddleOCR 기반의 고성능 OCR 엔진
한국어(korean) 포함 다국어 지원
GPU/CPU 선택 가능
바운딩박스와 텍스트 결과를 JSON으로 저장

📦 설치

중요

# paddlepaddle-gpu==3.2.0 버전이 필수 입니다.
uv add paddlepaddle-gpu==3.2.0 -i https://www.paddlepaddle.org.cn/packages/stable/cu126/
# 이후 패키지를 설치해주세요

# uv 사용 (권장)
uv add gandol2-ocr

# pip 사용
pip install gandol2-ocr

또는 소스에서 설치:

# uv 사용
uv add git+https://github.com/gandol2/gandol2-ocr.git

# pip 사용
pip install git+https://github.com/gandol2/gandol2-ocr.git

🖥️ 요구 사항

Python 3.9-3.12 (pyproject.toml에 명시됨)
(선택) NVIDIA GPU + CUDA 12.6 환경 (PaddlePaddle GPU 지원)

🚀 사용법

1) 이미지 섹션 분리

from splitter import split_image_sections

# 이미지를 섹션으로 분리
sections = split_image_sections(
    image_path="./input/image.png",
    output_dir="./output"
)

print(f"총 {len(sections)}개 섹션 생성:")
for section in sections:
    print(f"  {section.order}. {section.top}-{section.bottom} ({section.height}px)")

2) OCR 실행

from image_ocr.ocr import run_ocr

# OCR 실행
results = run_ocr(
    input_image="./input/image.png",
    output_dir="./output",
    lang="korean",
    device="gpu"
)

# 결과 출력
for result in results:
    print(result["rec_texts"])

3) 통합 워크플로우

from splitter import split_image_sections
from image_ocr.ocr import run_ocr

# 1단계: 이미지 섹션 분리
sections = split_image_sections(
    image_path="./input/long_image.png",
    output_dir="./sections"
)

# 2단계: 각 섹션에 대해 OCR 수행
for section in sections:
    ocr_results = run_ocr(
        input_image=section.path,
        output_dir=f"./ocr_results/section_{section.order}",
        lang="korean",
        device="gpu"
    )
    print(f"섹션 {section.order}: {len(ocr_results)}개 텍스트 감지")

🧱 프로젝트 구조

gandol2-ocr/
├─ pyproject.toml
├─ README.md
├─ src/
│  ├─ splitter/           # 이미지 섹션 분리 모듈
│  │  ├─ __init__.py
│  │  └─ splitter.py
│  └─ image_ocr/          # OCR 모듈
│     ├─ __init__.py
│     └─ ocr.py
├─ examples/
│  ├─ demo_splitter.py    # 섹션 분리 예제
│  └─ demo_ocr.py         # OCR 예제
├─ input/                 # 입력 이미지
└─ output/                # 출력 결과

🔧 API 참조

Splitter API

`split_image_sections(image_path, output_dir, debug_verbose=True, save_diagnostic=True)`

이미지를 섹션으로 분리하는 메인 함수입니다.

매개변수:

image_path (str): 입력 이미지 경로
output_dir (str): 출력 디렉토리
debug_verbose (bool): 디버그 출력 여부 (기본: True)
save_diagnostic (bool): 진단 이미지 저장 여부 (기본: True)

반환값:

List[SectionInfo]: 섹션 정보 리스트

`SectionInfo` 클래스

@dataclass
class SectionInfo:
    order: int      # 섹션 순서
    top: int        # 상단 좌표
    bottom: int     # 하단 좌표
    height: int     # 섹션 높이
    path: str       # 저장된 파일 경로

OCR API

`run_ocr(input_image, output_dir, lang="korean", device="cpu")`

이미지에서 텍스트를 인식하는 함수입니다.

매개변수:

input_image (str): 입력 이미지 경로
output_dir (str): 출력 디렉토리
lang (str): OCR 언어 (기본: "korean")
device (str): "cpu" 또는 "gpu" (기본: "cpu")

반환값:

List[dict]: OCR 결과 리스트

🛠️ 개발 및 배포

로컬 빌드

uv build

TestPyPI 업로드

PyPI 업로드

twine upload dist/*

🔎 참고사항

의존성: OpenCV, PaddleOCR, PaddlePaddle, Matplotlib, Pandas, SciPy
GPU 지원: CUDA 12.6 환경에서 PaddlePaddle GPU 사용 가능
이미지 형식: PNG, JPG, JPEG 등 OpenCV 지원 형식
언어 지원: PaddleOCR이 지원하는 모든 언어 (한국어, 영어, 중국어 등)

📄 라이선스

MIT License

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.0.4

Oct 21, 2025

0.0.3

Oct 21, 2025

This version

0.0.2

Oct 21, 2025

0.0.1

Oct 21, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gandol2_ocr-0.0.2.tar.gz (10.4 kB view details)

Uploaded Oct 21, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

gandol2_ocr-0.0.2-py3-none-any.whl (11.5 kB view details)

Uploaded Oct 21, 2025 Python 3

File details

Details for the file gandol2_ocr-0.0.2.tar.gz.

File metadata

Download URL: gandol2_ocr-0.0.2.tar.gz
Upload date: Oct 21, 2025
Size: 10.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.12

File hashes

Hashes for gandol2_ocr-0.0.2.tar.gz
Algorithm	Hash digest
SHA256	`977e0cdbf197c0d3608799924618439b5d43439629908efff3a91a036e52adb4`
MD5	`79ab9c9ffeafeecf0f2739286c8fad64`
BLAKE2b-256	`0b94e595134f73623756bee1ec3b62dcd9ca64a72252d5447247557ee6178eb3`

See more details on using hashes here.

File details

Details for the file gandol2_ocr-0.0.2-py3-none-any.whl.

File metadata

Download URL: gandol2_ocr-0.0.2-py3-none-any.whl
Upload date: Oct 21, 2025
Size: 11.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.12

File hashes

Hashes for gandol2_ocr-0.0.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`8949f47d1527660e125df3911f0a5660302ed4778e613e7955deceaf9ba46369`
MD5	`344a6f9e50bc18f43a22c2faef37e68e`
BLAKE2b-256	`60b7c8861fa302ba9590a15add8cd5ba6a980cc418369f9fd9ad1e810bec5e65`

See more details on using hashes here.

gandol2-ocr 0.0.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

gandol2-ocr

✨ 주요 기능

🖼️ 이미지 섹션 분리 (Splitter)

🔍 OCR (Optical Character Recognition)

📦 설치

중요

🖥️ 요구 사항

🚀 사용법

1) 이미지 섹션 분리

2) OCR 실행

3) 통합 워크플로우

🧱 프로젝트 구조

🔧 API 참조

Splitter API

split_image_sections(image_path, output_dir, debug_verbose=True, save_diagnostic=True)

SectionInfo 클래스

OCR API

run_ocr(input_image, output_dir, lang="korean", device="cpu")

🛠️ 개발 및 배포

로컬 빌드

TestPyPI 업로드

PyPI 업로드

🔎 참고사항

📄 라이선스

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`split_image_sections(image_path, output_dir, debug_verbose=True, save_diagnostic=True)`

`SectionInfo` 클래스

`run_ocr(input_image, output_dir, lang="korean", device="cpu")`