Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Integer Overflow Error on String Field #64

Open
nickvalin1 opened this issue Jul 28, 2021 · 0 comments
Open

Integer Overflow Error on String Field #64

nickvalin1 opened this issue Jul 28, 2021 · 0 comments

Comments

@nickvalin1
Copy link

nickvalin1 commented Jul 28, 2021

If you have a dataset in Domo with a field of type string that contains numeric data with values larger than your system's signed integer maximum (i.e. some sort of 12+ digit numeric id value), the ds_get function will throw and integer overflow error.

This occurs because the Domo API exports this field in the csv with quotation marks to denote that it is a string, not a number, causing the pandas read_csv function to create the field with a dtype of "object". The ds_get function then (on line 195) attempts to interpret all object type fields as dates. Then, down the stack, the Pandas datetime parser tries to read the entire string as an integer. Because this happens in Cython, it throws an overflow error.

Here's the stack trace (minus my code):

[ERROR] OverflowError: signed integer is greater than maximum
Traceback (most recent call last):
File "/var/task/pydomo/init.py", line 195, in ds_get
df[col] = to_datetime(df[col])
File "/var/task/pandas/core/tools/datetimes.py", line 805, in to_datetime
values = convert_listlike(arg._values, format)
File "/var/task/pandas/core/tools/datetimes.py", line 472, in _convert_listlike_datetimes
allow_object=True,
File "/var/task/pandas/core/arrays/datetimes.py", line 2081, in objects_to_datetime64ns
require_iso8601=require_iso8601,
File "pandas/_libs/tslib.pyx", line 364, in pandas._libs.tslib.array_to_datetime
File "pandas/_libs/tslib.pyx", line 591, in pandas._libs.tslib.array_to_datetime
File "pandas/_libs/tslib.pyx", line 726, in pandas._libs.tslib.array_to_datetime_object
File "pandas/_libs/tslib.pyx", line 717, in pandas._libs.tslib.array_to_datetime_object
File "pandas/_libs/tslibs/parsing.pyx", line 243, in pandas._libs.tslibs.parsing.parse_datetime_string
File "/var/task/dateutil/parser/_parser.py", line 1374, in parse
return DEFAULTPARSER.parse(timestr, **kwargs)
File "/var/task/dateutil/parser/_parser.py", line 655, in parse
ret = self._build_naive(res, default)
File "/var/task/dateutil/parser/_parser.py", line 1241, in _build_naive
naive = default.replace(**repl)

chris-brundage added a commit to chris-brundage/domo-python-sdk that referenced this issue Jul 26, 2022
…vert Dataset fields with large integers represented as strings into dates.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant