-
Notifications
You must be signed in to change notification settings - Fork 24
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bug: Jacobian runs not completing with MERRA2 meteorology in v2.0 #281
Comments
Hi @sabourbaray. I was able to run the IMI with MERRA-2 using the latest state of Can you try to run the IMI with the settings in my configuration file and see if you get the same errors? I noticed you made several changes to your config file (domain, buffer cells, etc.), so verifying if you can at least get the out-of-the-box example run going will help us narrow down what the issue is. If you're still having the issue, can you try turning on verbose in both geoschem_config.rc and HEMCO_Config.rc for your jacobian runs. It may be easiest to do this in the template_run files, then regenerate your jacobian_run directories. That should help provide additional printout so we can see where exactly the model is seg faulting. |
Thanks @msulprizio. I'm running into the same bug but I have a lot of extra information to help narrow it down a bit. See below:
But when it is processing the container:
Similarly, I am seeing incorrect values when HEMCO is processing
Investigating both of these .nc files I don't see any obvious indication of spurious values. Although I did notice Statevector.nc had one NaN value in this run directory. But when I swapped out for a different state vector file produced by an earlier 1.2 version of the IMI without the NaN value, the same result occurred. I also don't notice anything unusual about the Perturbations_000x.txt files.
So in summary the issue appears to be related to how HEMCO is processing the StateVector.nc and HEMCO_sa_diagnostics.$YYYY$MM$DD0000.nc files at this grid resolution, which results in unusual -9.99E+30 fill values causing the model to crash when it begins time stepping. It's strange this doesn't happen on your machines at Harvard or on our machines when running at 0.25° resolution. I am hoping there is something simple on my end that I've overlooked. Open to any suggestions for what to try next. |
Thanks @sabourbaray. Can you confirm that you're using the latest HEMCO version in your runs? I assume when you did the out-of-the-box run you cloned the correct version of GCClassic pegged to that version of the IMI and did |
Hi @msulprizio. The version of GCClassic is 14.4.1 and HEMCO is 3.9.1. |
OK. The fix I was referring to is geoschem/HEMCO#229 and went into HEMCO 3.7.1 so that should not impact you. |
Name: Sabour Baray
Institution: Environment and Climate Change Canada
Version: I am using a forked version of the IMI v2.0 localized for compatibility with the ECCC HPC system (operational-eccc/v2.0a) and GCClassic v.14.4.1.
Description: The IMI runs normally when running using GEOS-FP meteorology at 0.25° resolution. When running with MERRA2 meteorology at 0.5° resolution, the inversion stops at the Jacobian component. Jacobian 0000 completes successfully indicating no issue with the MERRA2 met files, but any of the subsequent simulations halt immediately after "beginning time stepping" (see below). There is no error message produced by GEOS-Chem, but I do get a stack smashing detected warning before a segmentation fault. This error also persists when setting NumJacobianTracers to 1.
Rollback and input files testing: Note that this issue begins with IMI versions after the update to Jacobian tracers. When rolling back to IMI v1.2 (operational-eccc/v1.2) and GCClassic v12.0.1, I am able to run with MERRA2 meteorology and 0.5° resolution for any time period between 2018–2023 normally.
Follow-up questions: Are any other users able to reproduce this issue on their machines? I would like to know if this may be due to an error on my end, or if there's an bug with the way the tracers are programmed for this grid resolution in the new Jacobian runs.
The text was updated successfully, but these errors were encountered: