PyMuPDF4LLM
Native PDF Structure Intelligence for LLMs
PyMuPDF4LLM is a lightweight extension for PyMuPDF that converts PDFs and other supported documents into clean, structured data for LLM and RAG workflows, with Markdown, JSON and text extraction, layout analysis, OCR support, and LlamaIndex/LangChain integrations.
Recent stories
0 linked stories
No linked stories yet.