You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
when doing a simple test on my devcontainer (python 3.11)
with open('test.pptx', 'rb') as f:
kind = filetype.guess(f.read())
print(kind.mime, kind.extension)
it returns application/zip zip
converter.convert('test.pptx')
Input document test.pptx does not match any allowed format.
File format not allowed: test.pptx
it works from a remote location because there the content type is specified, but not from a local import
ps pptx is essentially a zip file
latest versions i'm using docling==2.8.3
Thanks
ps2 your documentation on the installation for Tesseract needs some update, that doesn't work, i found the way to do it (much later) in the custom conversion example
The text was updated successfully, but these errors were encountered:
I have the same problem, probably it's related to this: h2non/filetype.py#171
the 'file' utility correctly detect the application/vnd.openxmlformats-officedocument.presentationml.presentation mimetype on the same document
when doing a simple test on my devcontainer (python 3.11)
it returns application/zip zip
Input document test.pptx does not match any allowed format.
File format not allowed: test.pptx
it works from a remote location because there the content type is specified, but not from a local import
ps pptx is essentially a zip file
latest versions i'm using docling==2.8.3
Thanks
ps2 your documentation on the installation for Tesseract needs some update, that doesn't work, i found the way to do it (much later) in the custom conversion example
The text was updated successfully, but these errors were encountered: