MarkItDown plugin for HWP documents.
HWP is a document format used by Hancom Office, the most widely used word processor in South Korea — commonly found in government, legal, and academic documents.
Powered by docpler.
pip install markitdown-hwpmarkitdown and docpler are installed automatically.
from markitdown import MarkItDown
md = MarkItDown(enable_plugins=True)
result = md.convert("document.hwp")
print(result.text_content)No extra configuration needed — just enable_plugins=True and HWP files are recognized automatically.
HWP(한글 워드프로세서) 파일을 지원하는 MarkItDown 플러그인입니다. 내부적으로 docpler를 사용합니다.
pip install markitdown-hwpmarkitdown과 docpler가 자동으로 설치됩니다.
from markitdown import MarkItDown
md = MarkItDown(enable_plugins=True)
result = md.convert("document.hwp")
print(result.text_content)Business Source License 1.1 (BSL 1.1) — Free to use, cannot be provided as a managed service. Converts to Apache 2.0 on 2031-04-05.
This product was developed with reference to the HWP document file (.hwp) specification published by Hancom.