Docling
Open-source toolkit for parsing complex documents (PDF, DOCX, HTML) into structured JSON/Markdown — purpose-built for Generative AI and RAG ingestion pipelines.
Category: ETL under Data & Analytics
Open-source toolkit for parsing complex documents (PDF, DOCX, HTML) into structured JSON/Markdown — purpose-built for Generative AI and RAG ingestion pipelines.