Skip to main content

Convert Korean HWP (Hangul Word Processor) files to Markdown

Project description

hwp-to-md

Convert Korean HWP (Hangul Word Processor) files to Markdown format.

Installation

pip install hwp-to-md

Usage

Command Line

# Convert single file
hwp-to-md convert document.hwp

# Convert with custom output path
hwp-to-md convert document.hwp -o output.md

# Convert all HWP files in directory
hwp-to-md convert ./documents/

# Check dependencies
hwp-to-md check

# Install dependencies
hwp-to-md install

Python API

from hwp_to_md import convert_hwp, batch_convert

# Single file
success, message = convert_hwp("document.hwp")

# Batch convert
results = batch_convert("./documents/")
for file_path, success, message in results:
    print(f"{file_path}: {'✓' if success else '✗'} {message}")

Requirements

  • Python 3.8+
  • pyhwp (hwp5html)
  • html2text

How it works

  1. HWP → HTML: Uses hwp5html from pyhwp to extract HTML
  2. HTML → Markdown: Uses html2text to convert to Markdown

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hwp_to_md-0.1.0.tar.gz (3.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

hwp_to_md-0.1.0-py3-none-any.whl (4.9 kB view details)

Uploaded Python 3

File details

Details for the file hwp_to_md-0.1.0.tar.gz.

File metadata

  • Download URL: hwp_to_md-0.1.0.tar.gz
  • Upload date:
  • Size: 3.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for hwp_to_md-0.1.0.tar.gz
Algorithm Hash digest
SHA256 5597f6d2729219278a4af4e9a9cf498d6a3f23aed4c7faf0ed51ce723dca5fb5
MD5 a3641b847709ec3e76cb956b0b89d914
BLAKE2b-256 48e0146dd82a8d8d1b6ce017345de9822b3a5071fd82809f6d38e50a6bb68d99

See more details on using hashes here.

File details

Details for the file hwp_to_md-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: hwp_to_md-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 4.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for hwp_to_md-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 2fdef508138dd9337a87570ab6254b6ad2e200a2f87b06642352bb891d66c75e
MD5 ceee55f7e6955a3bb036841b10a01b95
BLAKE2b-256 b3e3725c0d94048a9c28727c9ff96a0d3c9882c8761f8427ca71b0c99709eadb

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page