Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dependencies not installed #348

Open
DoomedJupiter opened this issue Feb 27, 2025 · 0 comments
Open

Dependencies not installed #348

DoomedJupiter opened this issue Feb 27, 2025 · 0 comments
Labels
bug Something isn't working

Comments

@DoomedJupiter
Copy link

Describe the bug
It appears the install of the package pypdf_table_extraction does not install two needed dependencies (cv2 and pdfium) which prevents the package from working "out of the box".

I did read the installation instructions and the associated installing the dependencies, which indicates that pdfium has replaced ghostscript as the default image conversion backend, so I assumed that a simple pip install of the package pypdf_table_extraction would be sufficient. When trying to replicate the README example I ran into two errors, the first related to not having cv2 installed, and the second related to not having ghostscript installed. A pip install of cv2 and pdfium resolved my issue.

Since these two packages are also pip installable, it would make sense to include these as dependencies that are installed at the same time as pypdf_table_extraction so that the package works "out of the box".

Steps to reproduce the bug

  1. create new venv
  2. pip install pypdf_table_extraction
  3. run README example
  4. observe error ModuleNotFoundError: No module named 'cv2' from core.py line 16
  5. pip install opencv-python
  6. run README example
  7. observe error camelot.backends.image_conversion.ImageConversionError: Image conversion failed with image conversion backend 'ghostscript' from image_conversion.py line 134
  8. pip install pypdfium2
  9. run README example (it now works)

Expected behavior

  1. create new venv
  2. pip install pypdf_table_extraction
  3. run README example (it works out of the box)

Code

from README example

PDF

from README example

Screenshots

N/A

Environment

  • OS: Windows 11
  • Python version: 3.120
  • Numpy version: 2.2.3
  • OpenCV version: 4.11.0.86
  • Ghostscript version: N/A
  • pypdf_table_extraction version: 1.0.1

Additional context

N/A

@DoomedJupiter DoomedJupiter added the bug Something isn't working label Feb 27, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant