Astra
April 21, 2026, 8:56am
1
Hello, I’m getting the error ValueError: invalid literal for int() with base 10: ‘3,585’
page_list = pymupdf4llm.to_text(
file_path,
page_chunks=True,
use_ocr=False,
ignore_images=True,
ignore_graphics=True,
write_images=False,
embed_images=False,
header=False,
footer=False,
pages=[206],
show_progress=True,
)
on this pdf: https://static.elisa.com/v2/image/2tqybbhjs47b/4X1mA7vtwlpAw693BjgPdy/Elisa_AnnualReport_2025.pdf
Can you confirm what versions of PyMuPDF & PyMuPDF4LLM you are using? I am using version 1.27.2.2 for both and the file parses well for me with no error.
Astra
April 21, 2026, 1:37pm
3
pymupdf4llm.version
‘1.27.2.2’
pymupdf.version
‘1.27.2.2’
tabulate.version
‘0.10.0’
I am using Python version 3.14.3
Aha! Interesting. I was on tabulate 0.9.0 and it was good, when I upgrade to 0.10.0 then I see the error too. Need to investigate further!