You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the bug
It appears the install of the package pypdf_table_extraction does not install two needed dependencies (cv2 and pdfium) which prevents the package from working "out of the box".
I did read the installation instructions and the associated installing the dependencies, which indicates that pdfium has replaced ghostscript as the default image conversion backend, so I assumed that a simple pip install of the package pypdf_table_extraction would be sufficient. When trying to replicate the README example I ran into two errors, the first related to not having cv2 installed, and the second related to not having ghostscript installed. A pip install of cv2 and pdfium resolved my issue.
Since these two packages are also pip installable, it would make sense to include these as dependencies that are installed at the same time as pypdf_table_extraction so that the package works "out of the box".
Steps to reproduce the bug
create new venv
pip install pypdf_table_extraction
run README example
observe error ModuleNotFoundError: No module named 'cv2' from core.py line 16
pip install opencv-python
run README example
observe error camelot.backends.image_conversion.ImageConversionError: Image conversion failed with image conversion backend 'ghostscript' from image_conversion.py line 134
pip install pypdfium2
run README example (it now works)
Expected behavior
create new venv
pip install pypdf_table_extraction
run README example (it works out of the box)
Code
from README example
PDF
from README example
Screenshots
N/A
Environment
OS: Windows 11
Python version: 3.120
Numpy version: 2.2.3
OpenCV version: 4.11.0.86
Ghostscript version: N/A
pypdf_table_extraction version: 1.0.1
Additional context
N/A
The text was updated successfully, but these errors were encountered:
Describe the bug
It appears the install of the package
pypdf_table_extraction
does not install two needed dependencies (cv2 and pdfium) which prevents the package from working "out of the box".I did read the installation instructions and the associated installing the dependencies, which indicates that pdfium has replaced ghostscript as the default image conversion backend, so I assumed that a simple pip install of the package
pypdf_table_extraction
would be sufficient. When trying to replicate the README example I ran into two errors, the first related to not having cv2 installed, and the second related to not having ghostscript installed. A pip install of cv2 and pdfium resolved my issue.Since these two packages are also pip installable, it would make sense to include these as dependencies that are installed at the same time as
pypdf_table_extraction
so that the package works "out of the box".Steps to reproduce the bug
pypdf_table_extraction
ModuleNotFoundError: No module named 'cv2'
fromcore.py line 16
opencv-python
camelot.backends.image_conversion.ImageConversionError: Image conversion failed with image conversion backend 'ghostscript'
fromimage_conversion.py line 134
pypdfium2
Expected behavior
pypdf_table_extraction
Code
from README example
PDF
from README example
Screenshots
N/A
Environment
Additional context
N/A
The text was updated successfully, but these errors were encountered: