Docling

IBM

Docling simplifies document processing, parsing diverse formats — including advanced PDF understanding — and providing seamless integrations with the gen AI ecosystem.

About

Docling is IBM's open-source document conversion library that transforms complex documents into clean, structured formats. It handles PDFs, Word, PowerPoint, Excel, images, and HTML, outputting Markdown or JSON suitable for LLM ingestion.

Deployment Options

1 stack

You might also like