Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Petastorm not working due to PyArrow version hell #806

Open
kiranzo opened this issue Aug 27, 2024 · 2 comments
Open

Petastorm not working due to PyArrow version hell #806

kiranzo opened this issue Aug 27, 2024 · 2 comments

Comments

@kiranzo
Copy link

kiranzo commented Aug 27, 2024

I was trying to build something from petastorm dataset minimal example.
When I tried to launch my code, I got error:

    from pyarrow.filesystem import LocalFileSystem
ModuleNotFoundError: No module named 'pyarrow.filesystem'

Because apparently it's called pyarrow.fs now?

After downgrading my pyarrow version to 13.0, then 12.0 as suggested here:

Traceback (most recent call last):
  File "/.myenv/lib/python3.8/site-packages/petastorm/utils.py", line 21, in <module>
    import pyarrow
  File "/.myenv/lib/python3.8/site-packages/pyarrow/__init__.py", line 65, in <module>
    import pyarrow.lib as _lib
  File "pyarrow/compat.pxi", line 43, in init pyarrow.lib
  File "/.myenv/lib/python3.8/site-packages/cloudpickle/__init__.py", line 1, in <module>
    from . import cloudpickle
  File "/.myenv/lib/python3.8/site-packages/cloudpickle/cloudpickle.py", line 73, in <module>
    import uuid
  File "/usr/lib/python3.8/uuid.py", line 60, in <module>
    _platform_system = platform.system()
  File "/usr/lib/python3.8/platform.py", line 892, in system
    return uname().system
  File "/usr/lib/python3.8/platform.py", line 858, in uname
    processor = _syscmd_uname('-p', '')
  File "/usr/lib/python3.8/platform.py", line 612, in _syscmd_uname
    import subprocess
  File "/usr/lib/python3.8/subprocess.py", line 218, in <module>
    _PopenSelector = selectors.SelectSelector
AttributeError: module 'selectors' has no attribute 'SelectSelector'

What version of pyarrow is it supposed to use??

@westfly
Copy link

westfly commented Sep 11, 2024

@kiranzo as ISSUE Validate_schema keyword not supported yet describe

 pip install pyarrow==10.0.1

@kiranzo
Copy link
Author

kiranzo commented Sep 16, 2024

@kiranzo as ISSUE Validate_schema keyword not supported yet describe

 pip install pyarrow==10.0.1

Gratz! We've already settled for Pandas with parquet format for now though 😅

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants