Docling
Docling simplifies document processing, parsing diverse formats — including advanced PDF understanding — and providing seamless integrations with the gen AI ecosystem.
About
Docling is IBM's open-source document conversion library that transforms complex documents into clean, structured formats. It handles PDFs, Word, PowerPoint, Excel, images, and HTML, outputting Markdown or JSON suitable for LLM ingestion.