-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
signal SIGABRT in testing #33
Comments
The branch is https://github.com/brtnfld/hdf5/tree/ASYNC_F To run the test:
|
I'm also getting hanging periodically with 8 ranks, but that is probably a separate issue:
|
@brtnfld , can you add your full test code file here? |
It is also here: line 252 is the issue. |
Got it. Is there a C version of this test code? |
No, only Fortran. |
@brtnfld I'm able to reproduce the error. Basically, the issue comes from HDF5 trying to check whether an attribute is already opened in H5Oattribute.c and it seems to not like the future ID used by async vol when some are already created/opened and some are not. I found two workarounds that will not cause this error:
I forgot whether it was Neil or Jordan who looked at this issue before, can you check with them and see if there is a better solution? Also, the test code seems to always segfault at the end:
|
Thanks, I'll ask Jordan and Neil. I've not seen that segmentation fault before. Though I've only run it on a local desktop. |
BTW, even if I add an ESwait after the last exists, it still fails. |
@brtnfld does setting the environment variable work for you? |
Yes, HDF5_ASYNC_EXE_FCLOSE fixes the issue. |
@houjun could you share more details of your debugging? Looking through the future ID code I'm having trouble understanding how this could happen. |
Hi @fortnern , I have tried two things in my debugging that seem to fix this issue, the first is to comment out the code in HDF5 library (473-479 and 512 of H5Oattribute.c) so that HDF5 doesn't check whether an attribute is already opened. The second is in vol-async, I can delay the execution of all the attribute operations to a later time (e.g. at file close time). |
It passes most of the time, but running it over and over, I can sometimes get it to fail with:
|
@brtnfld I think this is probably the same issue I mentioned earlier with the opened attribute, did you set "export HDF5_ASYNC_EXE_FCLOSE=1"? |
That was my mistake. It got removed in my editing of the run script. Using that, all the test pass. |
As I'm developing the FORTRAN async tests in HDF5, I'm seeing an issue with H5Aopen_async_f (backtrace below)
Sometimes the test fails and sometimes it does not. I'm running on 6 ranks.
It is basically doing:
The text was updated successfully, but these errors were encountered: