Check for page.find_tables returning None

Oege_Dijk · December 3, 2025, 7:52am

Hi there,

I am running into an issue where we modify the pdf (deleting text) before running extraction, and this seems to corrupt the pdf in a way that crashes pymupdf4llm, because page.find_tables() returns None instead of raising, but then the None is not handled in the next line:

pymupdf_rag.py:1031-1032

tabs = page.find_tables(clip=parms.clip,
strategy=table_strategy)
for t in tabs.tables: # No None check!

The fix would be:
tabs = page.find_tables(clip=parms.clip,
strategy=table_strategy)
if tabs is not None:
for t in tabs.tables:

or maybe there should be a flag to either raise or ignore the tables?

Jamie_Lemon · December 3, 2025, 9:38pm

Thanks @Oege_Dijk can you confirm which version of pymupdf_rag.py you are using? The latest one here: pymupdf4llm/pymupdf4llm/pymupdf4llm/helpers/pymupdf_rag.py at main · pymupdf/pymupdf4llm · GitHub seems to be different? Perhaps the latest version doesn’t exhibit this problem?

Topic		Replies	Views
Pymupdf layout table detection issue PyMuPDF	14	141	February 24, 2026
Bug: `ValueError: min() iterable argument is empty` in `table.bbox` when calling `to_markdown() PyMuPDF	3	31	May 5, 2026
Import pymupdf4llm silently activates pymupdf.layout and changes find_tables() results PyMuPDF font , watermarking	1	31	May 24, 2026
Bug: pymupdf4llm: mis-interpreted layout and IndexError on specific pages (insurance policy PDF) PyMuPDF	5	49	January 6, 2026
fitz.Page.find_tables() misses last column when using horizontal_strategy="text"` How To	10	117	July 21, 2025

Check for page.find_tables returning None

Related topics