🗂️ Parsing of multiple document formats incl. PDF, DOCX, PPTX, XLSX, HTML, WAV, MP3, WebVTT, images (PNG, TIFF, JPEG, ...), LaTeX, plain text, and more 📑 Advanced PDF understanding incl. page layout, reading order, table structure, code, formulas, image classification, and more 🧬 Unified, expressive DoclingDocument representation format ↪️ Various export formats and options, including Markdown,

