PyMuPDF

Fast, feature-rich PDF toolkit for text/HTML extraction, images, metadata, and page rendering with coordinates.

document-parsing-frameworksRecently releasednpm: pdf-lib

Hero Score

Popularity

Performance

Ecosystem

Maturity

Dev Experience

⭐ 10,204 stars⬇ 22.4M downloads/wkFirst release: Apr 2016Last release: Jun 2026

Async Support: NoPlugin Extensions: MediumSpeed: FastDoc Quality: HighLearning Curve: Medium

• Fastest PDF library in benchmarks with low memory usage
• Precise text coordinates and excellent structure preservation for layout-aware parsing
• Multi-format support (PDF, XPS, EPUB) with access to images, fonts, and annotations