← all repositories

xberg-io/html-to-markdown

High-performance HTML to Markdown converter with multi-language bindings, part of a document intelligence platform that includes OCR.

784 stars HTML Data ToolingOther AI
html-to-markdown
Velocity · 7d
+1.6
★ / day
Trend
steady
star history

This library converts HTML to Markdown format following CommonMark specifications. It is maintained as part of the Kreuzberg project, a polyglot document intelligence engine with a Rust core. The engine can extract structured data from over 56 document formats using streaming parsers and built-in optical character recognition. The tool is available across multiple programming language ecosystems (Rust, Python, Node.js, Java, Go, C#, PHP, Ruby) and is tagged for use in RAG pipelines.

Frequently asked

What is xberg-io/html-to-markdown?
High-performance HTML to Markdown converter with multi-language bindings, part of a document intelligence platform that includes OCR.
Is html-to-markdown open source?
Yes — xberg-io/html-to-markdown is open source, released under the MIT license.
What language is html-to-markdown written in?
xberg-io/html-to-markdown is primarily written in HTML.
How popular is html-to-markdown?
xberg-io/html-to-markdown has 784 stars on GitHub and is currently holding steady.
Where can I find html-to-markdown?
xberg-io/html-to-markdown is on GitHub at https://github.com/xberg-io/html-to-markdown.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.