Skip to main content

Get text from HWP/HWPX file

Project description

Description

HWP 파일 내 텍스트를 문자열 형태로 추출해주는 파이썬 라이브러리입니다. (HWPX 내 문자열 치환도 지원)

Developed by

  • Suh Seungwan(Yumeta lab)

Thanks to

  • Faith6

사용법

HWP 파일

import gethwp
hwp = gethwp.read_hwp('test.hwp')
print(hwp)

HWPX 파일

import gethwp
hwp = gethwp.read_hwpx('test.hwpx')
print(hwp)

HWPX 파일 내 텍스트 치환

HWPX 파일만 가능합니다.

import gethwp
gethwp.change_word('test.hwpx','test_output.hwpx','Find Text','Replace Text')

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gethwp-1.1.1.tar.gz (3.8 kB view hashes)

Uploaded Source

Built Distribution

gethwp-1.1.1-py3-none-any.whl (3.9 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page