Thư viện xử lý ảnh thông minh

These details have not been verified by PyPI

Project links

Homepage

Project description

MagicImg - Thư viện xử lý ảnh thông minh

MagicImg là thư viện Python mạnh mẽ cho xử lý và tiền xử lý ảnh, được tối ưu cho OCR và computer vision.

🚀 Tính năng mới trong v1.0.8

✅ Logging Control: Hoàn toàn kiểm soát được logging levels (DEBUG/INFO/WARNING)
✅ Custom Logging: Hỗ trợ custom format và log to file
✅ Silent Mode: Chế độ WARNING level để tắt progress messages
✅ Debug Flexibility: Debug riêng từng module
✅ User Preference: Tôn trọng logging configuration của user

📦 Cài đặt

Cài đặt cơ bản

pip install magicimg

Cài đặt Tesseract OCR

Windows

# Sử dụng winget
winget install UB-Mannheim.TesseractOCR

# Thêm vào PATH
set PATH=%PATH%;C:\Program Files\Tesseract-OCR

Ubuntu/Debian

sudo apt-get update
sudo apt-get install tesseract-ocr tesseract-ocr-vie

macOS

brew install tesseract

🎯 Sử dụng nhanh

Basic Usage

import magicimg

# Xử lý ảnh cơ bản với preserve color
result = magicimg.process_image("input.jpg", "output.jpg", preserve_color=True)
print(f"Success: {result.success}")
print(f"Processing steps: {result.processing_steps}")

# Kiểm tra chất lượng ảnh
is_good, quality_info, enhanced = magicimg.check_image_quality("input.jpg")
print(f"Quality score: {quality_info['quality_score']}")

# Tăng cường ảnh
enhanced = magicimg.enhance_image("input.jpg", "enhanced.jpg")

# Preprocessing cho OCR (tối ưu cho text recognition)
result = magicimg.preprocess_for_ocr("input.jpg", "output.jpg", preserve_color=False)

API Mode (cho Google, Anthropic, Local API)

import magicimg

# Preprocessing cho Google Vision API
success, info, output_path = magicimg.preprocess_image_for_api(
    "input.jpg", 
    provider="google",
    output_dir="./output/",
    min_quality_score=0.3
)

print(f"Success: {success}")
print(f"Output: {output_path}")
print(f"Processing steps: {info['processing_steps']}")

📚 API Reference

Core Functions

`process_image(image_path, output_path=None, auto_rotate=True, preserve_color=True)`

Xử lý ảnh hoàn chỉnh với tất cả các bước tối ưu.

Parameters:

image_path (str | np.ndarray): Đường dẫn ảnh hoặc numpy array
output_path (str, optional): Đường dẫn lưu kết quả
auto_rotate (bool): Tự động xoay ảnh (default: True)
preserve_color (bool): Giữ nguyên màu sắc (default: True)

Returns:

ProcessingResult: Object chứa kết quả xử lý

`check_image_quality(image_path)`

Kiểm tra chất lượng ảnh và đánh giá các metrics.

Returns:

tuple(bool, dict, np.ndarray): (is_good, quality_info_dict, enhanced_image)

`preprocess_image_for_api(image_path, provider="google")`

Tiền xử lý ảnh tối ưu cho các API nhà cung cấp.

Parameters:

provider: "google", "anthropic", hoặc "local"

Returns:

tuple(bool, dict, str): (success, processing_info, output_path)

Quality Enhancement Functions

`enhance_image(image_path, output_path=None)`

Tăng cường chất lượng ảnh một cách nhẹ nhàng.

`enhance_image_for_ocr(image_path, output_path=None)`

Tối ưu ảnh đặc biệt cho OCR (chuyển binary).

Orientation & Skew Functions

`detect_skew(image_path)`

Phát hiện góc nghiêng của ảnh.

`correct_skew(image_path, angle=None)`

Sửa góc nghiêng của ảnh.

`rotate_image(image_path, angle)`

Xoay ảnh theo góc cho trước.

⚙️ Cấu hình nâng cao

Quality Thresholds

# Cấu hình ngưỡng chất lượng
config = {
    "min_blur_index": 100.0,      # Tăng yêu cầu độ nét
    "min_brightness": 200.0,      # Tăng yêu cầu độ sáng
    "min_contrast": 60.0,         # Tăng yêu cầu độ tương phản
    "min_resolution": (1200, 1600) # Tăng độ phân giải tối thiểu
}

processor = ImageProcessor(config=config)

Processing Modes

High Quality Mode (cho ảnh quan trọng)

config = {
    "min_blur_index": 120.0,
    "min_brightness": 220.0,
    "min_contrast": 70.0,
    "min_resolution": (1500, 2000)
}
result = magicimg.process_image("input.jpg", config=config, preserve_color=True)

Fast Processing Mode (cho batch processing)

config = {
    "min_blur_index": 60.0,
    "min_brightness": 160.0,
    "min_contrast": 40.0,
    "skip_rotation": True
}
result = magicimg.process_image("input.jpg", config=config)

OCR Optimization Mode

result = magicimg.preprocess_for_ocr("input.jpg", preserve_color=False)

🔍 Debug Mode

Bật chế độ DEBUG

import logging
import magicimg

# Cách 1: Bật debug trước khi import
logging.basicConfig(level=logging.DEBUG)
import magicimg  # Import sau khi set DEBUG

# Cách 2: Bật debug sau khi import
logging.getLogger().setLevel(logging.DEBUG)

Các level logging

import logging
import magicimg

# 1. DEBUG - Hiển thị tất cả thông tin
logging.basicConfig(level=logging.DEBUG)
# Sẽ hiển thị:
# - Chi tiết cấu hình
# - Thông số GPU
# - Các bước xử lý
# - Quality metrics
# - Line detection parameters
# - Skew detection details

# 2. INFO - Chỉ hiển thị tiến trình (mặc định)
logging.basicConfig(level=logging.INFO)
# Sẽ hiển thị:
# - 🔄 Đang xử lý: input.jpg
# - ✅ Hoàn thành: output.jpg

# 3. WARNING - Silent mode (chỉ warnings)
logging.basicConfig(level=logging.WARNING)
# Chỉ hiển thị warnings và errors

# 4. ERROR - Chỉ hiển thị errors
logging.basicConfig(level=logging.ERROR)

Custom Debug Output

import logging

# Log ra file
logging.basicConfig(
    level=logging.DEBUG,
    format='%(asctime)s - %(levelname)s - %(message)s',
    handlers=[logging.FileHandler('magicimg_debug.log')]
)

# Custom format
logging.basicConfig(
    level=logging.DEBUG,
    format='[%(levelname)s] %(message)s'
)

# Multiple handlers
logging.basicConfig(
    level=logging.DEBUG,
    format='%(asctime)s - %(message)s',
    handlers=[
        logging.FileHandler('debug.log'),
        logging.StreamHandler()  # Console output
    ]
)

Debug với GPU

import logging
logging.basicConfig(level=logging.DEBUG)

config = {
    "use_gpu": True,
    "gpu_id": 0,
    "gpu_memory_limit": 0.7
}

# Sẽ hiển thị chi tiết về GPU:
# - GPU name
# - Memory usage
# - CUDA version
# - Processing mode
processor = magicimg.ImageProcessor(config=config)

Debug Quality Metrics

import logging
logging.basicConfig(level=logging.DEBUG)

# Hiển thị chi tiết quality metrics
result = magicimg.process_image("input.jpg")
# DEBUG: Quality metrics:
#   - Blur index: 85.3
#   - Dark ratio: 0.15
#   - Brightness: 195.2
#   - Contrast: 55.8
#   - Resolution: (1200, 1600)

Debug Processing Steps

import logging
logging.basicConfig(level=logging.DEBUG)

# Hiển thị chi tiết các bước xử lý
result = magicimg.process_image("input.jpg")
# DEBUG: Processing steps:
#   1. Loading image
#   2. Checking quality
#   3. Detecting orientation
#   4. Correcting skew (2.5 degrees)
#   5. Enhancing contrast
#   6. Saving result

Tips khi debug

Set DEBUG trước khi import: Để capture tất cả log messages
Log to file: Khi cần analyze logs sau
Custom format: Thêm timestamps hoặc log levels
WARNING level: Khi chỉ quan tâm issues
DEBUG specific modules:

# Debug chỉ magicimg.core
logging.getLogger('magicimg.core').setLevel(logging.DEBUG)

# Debug chỉ GPU operations
logging.getLogger('magicimg.gpu').setLevel(logging.DEBUG)

🔧 Troubleshooting

Common Issues

1. GPU không được phát hiện

# Kiểm tra CUDA
python -c "import torch; print(torch.cuda.is_available())"

# Nếu False, cài đặt PyTorch với CUDA
pip install torch torchvision --index-url https://download.pytorch.org/whl/cu118

2. Tesseract error

# Windows: Thêm Tesseract vào PATH
set PATH=%PATH%;C:\Program Files\Tesseract-OCR

# Linux: Cài đặt Tesseract
sudo apt-get install tesseract-ocr

3. Memory errors với ảnh lớn

# Giảm batch_size và GPU memory limit
config = {
    "batch_size": 2, 
    "gpu_memory_limit": 0.5
}
processor = magicimg.ImageProcessor(config=config)

4. Debug logs không hiển thị

# Đảm bảo set DEBUG trước khi import
import logging
logging.basicConfig(level=logging.DEBUG)  # ĐẶT TRƯỚC import!

import magicimg  # Import sau

# Hoặc set level sau import
logging.getLogger().setLevel(logging.DEBUG)

System Information

import magicimg
magicimg.print_system_info()
# In thông tin hệ thống và dependencies

📊 Performance Tips

1. Batch Processing

from magicimg import ImageProcessor

processor = ImageProcessor(config={"batch_size": 8})

# Xử lý nhiều ảnh cùng lúc
image_paths = ["img1.jpg", "img2.jpg", "img3.jpg"]
for path in image_paths:
    result = processor.process_image(path)
    print(f"Processed {path}: {result.success}")

2. Memory Management

# Sử dụng context manager cho GPU
from magicimg.core import gpu_memory_manager

with gpu_memory_manager():
    result = magicimg.process_image("large_image.jpg")

3. Preserve Color vs OCR Mode

# Cho ảnh cần giữ chất lượng cao
result = magicimg.process_image("photo.jpg", preserve_color=True)

# Cho text recognition
result = magicimg.preprocess_for_ocr("document.jpg", preserve_color=False)

🧪 Testing

Run API Consistency Tests

# Test tất cả API functions
python -c "
import magicimg
print('Testing basic APIs...')
result1 = magicimg.check_image_quality('input.jpg')
result2 = magicimg.process_image('input.jpg')
result3 = magicimg.preprocess_image_for_api('input.jpg')
print('All APIs working correctly!')
"

Performance Benchmark

import time
import magicimg

start = time.time()
result = magicimg.process_image("test.jpg")
end = time.time()

print(f"Processing time: {end-start:.2f}s")
print(f"Success: {result.success}")

📝 Examples

Complete Processing Pipeline

import magicimg

def process_document(input_path, output_dir="./output/"):
    """Xử lý document hoàn chỉnh"""
    
    # 1. Kiểm tra chất lượng
    is_good, quality_info, enhanced = magicimg.check_image_quality(input_path)
    print(f"Quality score: {quality_info['quality_score']:.2f}")
    
    # 2. Xử lý với preserve color
    result = magicimg.process_image(
        input_path, 
        output_path=f"{output_dir}/processed.jpg",
        preserve_color=True
    )
    
    # 3. Tạo version cho OCR
    ocr_result = magicimg.preprocess_for_ocr(
        input_path,
        output_path=f"{output_dir}/ocr_ready.jpg", 
        preserve_color=False
    )
    
    # 4. API processing
    api_success, api_info, api_path = magicimg.preprocess_image_for_api(
        input_path,
        provider="google",
        output_dir=output_dir
    )
    
    return {
        "quality": quality_info,
        "processed": result.success,
        "ocr_ready": ocr_result.success,
        "api_ready": api_success,
        "steps": result.processing_steps
    }

# Sử dụng
results = process_document("document.jpg")
print(results)

🆕 Changelog v1.0.8

Fixed

✅ Sửa lỗi logging interference với user configuration
✅ Sửa lỗi DEBUG logs hiển thị khi không cần
✅ Cải thiện logging initialization
✅ Sửa lỗi logging handlers bị ghi đè

Improved

🚀 Tối ưu logging system với nhiều levels
🚀 Thêm custom logging format support
🚀 Cải thiện debug output readability
🚀 Silent mode cho production environment

Added

➕ Comprehensive logging documentation
➕ Debug mode examples
➕ Logging level control
➕ Module-specific debugging

🤝 Contributing

Fork repository
Tạo feature branch (git checkout -b feature/AmazingFeature)
Commit changes (git commit -m 'Add some AmazingFeature')
Push to branch (git push origin feature/AmazingFeature)
Tạo Pull Request

📄 License

Distributed under the MIT License. See LICENSE for more information.

🙋‍♂️ Support

📧 Email: shumi2011@gmail.com
🐛 Issues: GitHub Issues
📖 Documentation: Full Docs

Made with ❤️ by MagicImg Team

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

1.0.10

Jun 22, 2025

This version

1.0.9

Jun 22, 2025

1.0.8

Jun 22, 2025

1.0.7

Jun 22, 2025

1.0.6

Jun 22, 2025

1.0.5

Jun 22, 2025

1.0.4

Jun 22, 2025

1.0.3

Jun 22, 2025

1.0.2

Jun 22, 2025

1.0.1

Jun 22, 2025

1.0.0

Jun 22, 2025

0.1.3

Jun 14, 2025

0.1.2

Jun 14, 2025

0.1.1

Jun 14, 2025

0.1.0

Jun 14, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

magicimg-1.0.9.tar.gz (36.9 kB view details)

Uploaded Jun 22, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

magicimg-1.0.9-py3-none-any.whl (34.5 kB view details)

Uploaded Jun 22, 2025 Python 3

File details

Details for the file magicimg-1.0.9.tar.gz.

File metadata

Download URL: magicimg-1.0.9.tar.gz
Upload date: Jun 22, 2025
Size: 36.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.10.7

File hashes

Hashes for magicimg-1.0.9.tar.gz
Algorithm	Hash digest
SHA256	`492ccc5cbf1ffbc72af5d48407b486332749edf30d22e0f18f5abee54557855a`
MD5	`4762f05eebf3f2e3df9d5fd9e062cf90`
BLAKE2b-256	`cf5f962c3245c45419f4d7f1d09dda0a56664a062c6c794bd860e22f5ac6f81c`

See more details on using hashes here.

File details

Details for the file magicimg-1.0.9-py3-none-any.whl.

File metadata

Download URL: magicimg-1.0.9-py3-none-any.whl
Upload date: Jun 22, 2025
Size: 34.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.10.7

File hashes

Hashes for magicimg-1.0.9-py3-none-any.whl
Algorithm	Hash digest
SHA256	`74e8a638acd8e588570db51bb7cfa3459e17e724a6cdd17ff9f7f2772a0e04d1`
MD5	`ff9532f3786375ad83f25852b388690c`
BLAKE2b-256	`47d28b4e190f26d19ad7864bd6650f1d927c10a153831728871e037ca7698602`

See more details on using hashes here.

magicimg 1.0.9

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

MagicImg - Thư viện xử lý ảnh thông minh

🚀 Tính năng mới trong v1.0.8

📦 Cài đặt

Cài đặt cơ bản

Cài đặt Tesseract OCR

Windows

Ubuntu/Debian

macOS

🎯 Sử dụng nhanh

Basic Usage

API Mode (cho Google, Anthropic, Local API)

📚 API Reference

Core Functions

process_image(image_path, output_path=None, auto_rotate=True, preserve_color=True)

check_image_quality(image_path)

preprocess_image_for_api(image_path, provider="google")

Quality Enhancement Functions

enhance_image(image_path, output_path=None)

enhance_image_for_ocr(image_path, output_path=None)

Orientation & Skew Functions

detect_skew(image_path)

correct_skew(image_path, angle=None)

rotate_image(image_path, angle)

⚙️ Cấu hình nâng cao

Quality Thresholds

Processing Modes

High Quality Mode (cho ảnh quan trọng)

Fast Processing Mode (cho batch processing)

OCR Optimization Mode

🔍 Debug Mode

Bật chế độ DEBUG

Các level logging

Custom Debug Output

Debug với GPU

Debug Quality Metrics

Debug Processing Steps

Tips khi debug

🔧 Troubleshooting

Common Issues

1. GPU không được phát hiện

2. Tesseract error

3. Memory errors với ảnh lớn

4. Debug logs không hiển thị

System Information

📊 Performance Tips

1. Batch Processing

2. Memory Management

3. Preserve Color vs OCR Mode

🧪 Testing

Run API Consistency Tests

Performance Benchmark

📝 Examples

Complete Processing Pipeline

🆕 Changelog v1.0.8

Fixed

Improved

Added

🤝 Contributing

📄 License

🙋‍♂️ Support

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

`process_image(image_path, output_path=None, auto_rotate=True, preserve_color=True)`

`check_image_quality(image_path)`

`preprocess_image_for_api(image_path, provider="google")`

`enhance_image(image_path, output_path=None)`

`enhance_image_for_ocr(image_path, output_path=None)`

`detect_skew(image_path)`

`correct_skew(image_path, angle=None)`

`rotate_image(image_path, angle)`