Skip to content

docpler/markitdown-hwp

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

markitdown-hwp

MarkItDown plugin for HWP documents.

HWP is a document format used by Hancom Office, the most widely used word processor in South Korea — commonly found in government, legal, and academic documents.

Powered by docpler.

Installation

pip install markitdown-hwp

markitdown and docpler are installed automatically.

Usage

from markitdown import MarkItDown

md = MarkItDown(enable_plugins=True)
result = md.convert("document.hwp")
print(result.text_content)

No extra configuration needed — just enable_plugins=True and HWP files are recognized automatically.


한국어

HWP(한글 워드프로세서) 파일을 지원하는 MarkItDown 플러그인입니다. 내부적으로 docpler를 사용합니다.

설치

pip install markitdown-hwp

markitdowndocpler가 자동으로 설치됩니다.

사용법

from markitdown import MarkItDown

md = MarkItDown(enable_plugins=True)
result = md.convert("document.hwp")
print(result.text_content)

License

Business Source License 1.1 (BSL 1.1) — Free to use, cannot be provided as a managed service. Converts to Apache 2.0 on 2031-04-05.

HWP Format Notice

This product was developed with reference to the HWP document file (.hwp) specification published by Hancom.

About

MarkItDown plugin for HWP (Korean document format). Powered by Docpler.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages