Docling parses PDF, DOCX, PPTX, HTML, and other formats into a rich unified representation including document layout, tables etc., making them ready for generative AI workflows like RAG.
This integration provides Docling’s capabilities via the DoclingLoader
document loader.
langchain-docling
from your package manager, e.g. pip:
DoclingLoader
class in langchain-docling
seamlessly integrates Docling into
LangChain, enabling you to: