xberg-io/html-to-markdown
High-performance HTML to Markdown converter with multi-language bindings, part of a document intelligence platform that includes OCR.

This library converts HTML to Markdown format following CommonMark specifications. It is maintained as part of the Kreuzberg project, a polyglot document intelligence engine with a Rust core. The engine can extract structured data from over 56 document formats using streaming parsers and built-in optical character recognition. The tool is available across multiple programming language ecosystems (Rust, Python, Node.js, Java, Go, C#, PHP, Ruby) and is tagged for use in RAG pipelines.
Frequently asked
- What is xberg-io/html-to-markdown?
- High-performance HTML to Markdown converter with multi-language bindings, part of a document intelligence platform that includes OCR.
- Is html-to-markdown open source?
- Yes — xberg-io/html-to-markdown is open source, released under the MIT license.
- What language is html-to-markdown written in?
- xberg-io/html-to-markdown is primarily written in HTML.
- How popular is html-to-markdown?
- xberg-io/html-to-markdown has 784 stars on GitHub and is currently holding steady.
- Where can I find html-to-markdown?
- xberg-io/html-to-markdown is on GitHub at https://github.com/xberg-io/html-to-markdown.