-
Notifications
You must be signed in to change notification settings - Fork 554
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error in extracting the electronic signature from the PDF #4190
Comments
We cannot deal with issues that we cannot reproduce. Please provide the respective file. |
Please be aware that we will close issues that we cannot reproduce within at most five days. |
Sorry, my files involve confidential information and cannot be disclosed. I tried to create a non-confidential PDF with an electronic signature on my own (which is not consistent with the confidential files). I found that d = page.get_text("dict") |
Method |
Please close the issue if our above discussion explains your observations. |
Thank you for your answer. I'd like to ask if there is a situation where the method Page.get_text("dict") can obtain images, but in the Page.get_image_info() method, xref = 0. |
Yes, this is very possible: |
Description of the bug
There are omissions in extracting the electronic signatures.
How to reproduce the bug
Why can't this function extract the electronic signature on the first page, while it can extract those on other pages?
the signature of first page:
the d information:
the signature of other page: (have image information)
Moreover, the image information of the electronic signature obtained by function ima_info=page.get_image_info() is incorrect. The ima_info doesn't have image
There is no binary stream of the picture in this field and the cross-reference (xref) equals 0, so it cannot be extracted.
PyMuPDF version
1.24.11
Operating system
Linux
Python version
3.9
The text was updated successfully, but these errors were encountered: