The chars given by page.get_texttrace() includes font glyph id and its font name. However, provided that fonts in pdf usually come in multiple subsets, does pymupdf include any information about which subset a font glyph may be referring to (assuming there are multiple subsets of the same font name)?
Hi @Vallery ![]()
There is a global option you could try pymupdf.TOOLS.set_subset_fontnames(True).
The default is 0 (False) which omits the subset identifier.