Machine Learning based vertical diffusivity in EPBL mixing scheme used for ocean surface boundary layer #737

aakashsane · 2024-10-09T02:15:16Z

Requesting to add a few subroutines to the EPBL vertical mixing module.  These changes enhances the existing vertical diffusivity used in EPBL with machine learning, and makes it constrained on second moment closure. Using symbolic regression and empirical fitting, a shape function ( g(\sigma) ) has been formulated that responds to changes in the surface forcing (Latitude, wind stress, surface buoyancy flux, boundary layer depth). g(\sigma) goes from zero to 1 and its skewness changes as per surface forcing conditions. The velocity scale, v_0, is an approximation that depends on (Coriolis f, ustar, and surface buoyancy flux).
When v_0 is combined with g(\sigma) and multiplied by the energetics based boundary layer depth h, i.e \nu = . g(\sigma) X v_0 X h, we get a diffusivity which is constrained on a second moment closure.

The subroutines are activated by using the flags:

Equation_Discovery_shape = True
Equation_Discovery_velocity = True
Equation_Discovery_velocity_h = False
activates the new equation for shape function. 2. Activates velocity scale equations that approximate neural networks as described in Sane et al. 2023. 3. Is a new velocity scale that includes boundary layer depth as input and agrees with well known scaling.
Either 2. or 3. should be 'True', both cannot be True or False.

The new subroutines have been tested by running the OM4 configuration (Adcroft et al. 2019) and comparing the simulation against observations (SST from WOA and Mixed Layer Depth from ARGO). The equations are approximating the neural networks. The neural network based diffusivity has been published in Sane et al. 2023 ( https://doi.org/10.1029/2023MS003890 ) and the equations based diffusivity will soon be submitted for a publication.
The changes have also been tested using the scale test.

All the commits can be squashed into one as only the latest is relevant.

…ts for ePBL

updating ML_diffusivity to latest dev/gfdl

Ml diffusivity

replaced the sigma to z coord algorithm used to map neural network output with a simpler and better algorithm that finds the shape function on the hz vertical grid.

… hz grid.

some typo corrections

extra unwanted file. deleted.

f_lower is a lower cap on abs_f used inside equation for v_0. A cap is required to avoid singularity. Capping at any value below 1 deg is okay, solution is not sensitive. f_lower was tested for 1 deg and 0.1 deg. SST and MLD did not change.

Merging latest dev/gfdl

updating latest dev/gfdl with ML_diffusivity

updating

…iffusivity

deleted unwanted file

codecov · 2024-10-31T16:19:58Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 40.94%. Comparing base (f90b071) to head (992f266).
Report is 5 commits behind head on dev/gfdl.

Additional details and impacted files

@@             Coverage Diff              @@
##           dev/gfdl     #737      +/-   ##
============================================
+ Coverage     36.63%   40.94%   +4.30%     
============================================
  Files           274       42     -232     
  Lines         84153     5288   -78865     
  Branches      15834     1013   -14821     
============================================
- Hits          30829     2165   -28664     
+ Misses        47509     2938   -44571     
+ Partials       5815      185    -5630

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

adcroft

I can't (shouldn't) make the final approval on this since I'm involved in the research, but there are some obvious things that need to be changed before someone else reviews the PR. Once these are cleaned up, my take is it will be smooth sailing.

You added several files that I think should not be included:

.testing/tc4/Makefile is generated by autoconf using the Makefile.in template
.testing/gen_data.dSYM/Contents.Info.plist - looks like something leftover from your a mac build process
.testing/gen_grid.dSYM/Contents.Info.plist - ditto

Please remove these files. I believe the only file you meant to submit changes for is MOM_energetic_PBL.F90. There's also a real variable with missing unit documentation which I added a separate inline comment for.

adcroft · 2024-11-21T16:20:01Z

src/parameterizations/vertical/MOM_energetic_PBL.F90

-
+
+  ! variables for ML based diffusivity
+  real :: v0_dummy ! Variable which get recycled, set equal to v_0


This real variable needs units in the comment (looks like it should be the same as CS%vo)

Thanks for pointing it out.
I have completed:

Units added to that variable.

All the files mentioned above have been deleted.

Added units to a missing variables (v0_dummy)

file deleted

adcroft · 2024-11-25T21:27:48Z

I've just been discussing with Bob, and we think we need to add a new version of get_param to solve a problem that has been highlighted by this PR, namely that we're adding 16 run-time parameters, for one scheme, when they could be a single line. We'll look into adding a vector default option and get back to this PR shortly.

…nput

adcroft · 2024-12-10T16:27:36Z

src/parameterizations/vertical/MOM_energetic_PBL.F90

-          if (CS%MixLenExponent==2.0) then
-            MixLen_shape(K) = CS%transLay_scale + (1.0 - CS%transLay_scale) * &
-                 (max(0.0, (MLD_guess - dz_rsum)*I_MLD) )**2 ! CS%MixLenExponent
+


It looks like you are replacing some existing code, rather than adding a new option. I suspect you need to add elseif (CS%eqdisc_v0 .or.CS%eqdisc_v0h) then

The subroutines associated with CS%eqdisc_v0 and CS%eqdisc_v0h are different.
CS%eqdisc_v0 calls the subroutine 'get_eqdisc_v0(CS,absf,B_flux,u_star,v0_dummy)'
while
CS%eqdisc_v0h calls 'get_eqdisc_v0h(CS,B_flux,u_star,MLD_guess,v0_dummy)'.

Both of them formulate v0 in a different manner.
So it would not be ideal to call them in a single elseif statement.

Hallberg-NOAA · 2025-01-01T21:49:46Z

src/parameterizations/vertical/MOM_energetic_PBL.F90

+    call kappa_eqdisc(MixLen_shape, CS, GV, h, absf, B_flux, u_star, MLD_guess)
+  else
+  do K=2,nz+1
+    h_rsum = h_rsum + h(k-1)*GV%H_to_Z


I think that the line above is a copy from a much older version of the code and relocated from about line 987 of the code that is being replaced. Unfortunately, this line is reintroducing the Boussinesq reference density into non-Boussinesq configurations, something that we had put a great deal of effort into avoiding, and that was merged onto the dev/gfdl branch of MOM6 on Sept. 23, 2023. Please use dz instead of GV%H_to_z*h here, and if the subroutine getshapefunction() is (as it appears to be) largely intended to be a copy of the code that was displaced from about 981, please ensure that it is a true copy.

Hallberg-NOAA · 2025-01-01T21:57:11Z

src/parameterizations/vertical/MOM_energetic_PBL.F90

+  nz = SZK_(GV)+1
+  hz(1) = 0.0
+  do K=2,nz
+    hz(K) = hz(K-1) + h(K-1)*GV%H_to_Z


Please do not use GV%H_to_Z (which resolves to 1/Rho0, where Rho0 is the Boussinesq reference density) when in non-Boussinesq mode. You might find it convenient to pass the dz(:) array that is already being passed into ePBL_column down into this routine and work with that instead of h(:).

Hallberg-NOAA · 2025-01-01T21:58:52Z

src/parameterizations/vertical/MOM_energetic_PBL.F90

+  if (CS%eqdisc) then ! update Kd as per Machine Learning equation discovery
+    call kappa_eqdisc(MixLen_shape, CS, GV, h, absf, B_flux, u_star, MLD_guess)
+  else
+  do K=2,nz+1


Please revise this code to follow the MOM6 2-point indenting convention, as described in the MOM6 style guide at github.com/NOAA-GFDL/MOM6/wiki/Code-style-guide.

Hallberg-NOAA · 2025-01-01T22:01:24Z

src/parameterizations/vertical/MOM_energetic_PBL.F90

          else
-            MixLen_shape(K) = CS%transLay_scale + (1.0 - CS%transLay_scale) * &


Please eliminate this empty else block or add at least a comment explaining why it is here. Also please follow the MOM6 2-point intending conventions in this block of code.

(I particularly dislike that the way that git Blame handles this code with the removed lines from this block, and how git Blame on the final code makes it look like I was the one who wrote the code with an empty else block and the one who is flouting the MOM6 indenting conventions, thereby making me look like a hypocrite!)

Hallberg-NOAA · 2025-01-01T22:05:39Z

src/parameterizations/vertical/MOM_energetic_PBL.F90

@@ -1271,6 +1286,10 @@ subroutine ePBL_column(h, dz, u, v, T0, S0, dSV_dT, dSV_dS, SpV_dt, TKE_forcing,
                !  instead of redoing the computation will change answers...
                  Kd(K) = (h_dz_int(K)*vstar) * CS%vonKar *  ((dz_tt*hbs_here)*vstar) / &
                        ((CS%Ekman_scale_coef * absf) * (dz_tt*hbs_here) + vstar)
+
+                  elseif (CS%eqdisc .eqv. .true.)  then  ! ML-eqdisc line2/2 


In the line above, please replace elseif (CS%eqdisc .eqv. .true.) then with the much simpler, clearer and equivalent elseif (CS%eqdisc) then. Also, this is another case where the indenting convention is being violated. (This convention really does make it MUCH easier for a human to read and understand what the code is doing, even if F90 compilers ignore the white space indents!)

Hallberg-NOAA · 2025-01-01T22:11:02Z

src/parameterizations/vertical/MOM_energetic_PBL.F90

+    if (bflux_c >= 0.0) then ! surface heating and neutral conditions
+    ! Equation 16 in Sane et al. 2024:
+    ! \frac{v}{u_*} = \frac{-c_9}{p_1 + c_{10} + \frac{c_{11}^2}{p_1 - c_{11}} }
+      p1 =  -(1.0/ust_c) * sqrt(abs(bflux_c)/absf_c) 


These expressions have lots of extra divides that are expensive but could be refactored out in mathematically equivalent expressions. A good compiler will do this, but only in optimized mode, meaning that these extra divides are likely to lead to unnecessary answer changes due to optimization. Please consider multiplying by reciprocals of common denominators and other steps to refactor these to eliminate the extra divides. If we don't do this now, we will just end up adding an answer date setting to do this later, but is is much more convenient for everyone if we do it the right way in the first place.

Hallberg-NOAA · 2025-01-01T22:12:17Z

src/parameterizations/vertical/MOM_energetic_PBL.F90

+    ust_c_I = 1.0 / ust_c
+
+    ! setting p1 here:
+    p1 =  abs(bflux_c * MLD_guess)**(1.0/3.0)


Please replace A**(1.0/3.0) with cuberoot(A) to avoid breaking dimensional consistency testing.

Hallberg-NOAA · 2025-01-01T22:15:05Z

src/parameterizations/vertical/MOM_energetic_PBL.F90

+
+  ! The coefficients used for machine learned diffusivity
+  ! c1 to c8 used for sigma_m,  
+  call get_param(param_file, mdl, "c1", CS%c1, &


Please replace all of these calls to get_param_real() for a long list of identically described nondimensional coefficients with a single call to get_param_array(), using the recently added defaults= argument to set the differing defaults for all of these coefficients.

aakashsane and others added 30 commits July 17, 2024 22:37

added the Machine Learned + empirically fitted diffusivity coefficien…

a6f1bc3

…ts for ePBL

Merge pull request #1 from aakashsane/dev/gfdl

60dd73c

updating ML_diffusivity to latest dev/gfdl

Merge branch 'NOAA-GFDL:dev/gfdl' into ML_diffusivity

6328056

Merge pull request #2 from aakashsane/ML_diffusivity

c77845b

Ml diffusivity

Update MOM_energetic_PBL.F90

3483057

replaced the sigma to z coord algorithm used to map neural network output with a simpler and better algorithm that finds the shape function on the hz vertical grid.

updated the algorithm used in interpolating between neural network to…

67e3474

… hz grid.

Update MOM_energetic_PBL.F90

5d79f16

some typo corrections

added type for hbl

c367e28

fixed typo

c025605

Delete src/parameterizations/vertical/Modified_MOM_energetic_PBL.F90

fcc96f3

extra unwanted file. deleted.

fixed another typo

43c3113

some typos

fa720d3

some other typo

0e91642

f_lower value changed to 0.1 deg.

e3eb879

f_lower is a lower cap on abs_f used inside equation for v_0. A cap is required to avoid singularity. Capping at any value below 1 deg is okay, solution is not sensitive. f_lower was tested for 1 deg and 0.1 deg. SST and MLD did not change.

Merge branch 'NOAA-GFDL:dev/gfdl' into dev/gfdl

57af2a7

Merge pull request #3 from aakashsane/dev/gfdl

01bb7c6

Merging latest dev/gfdl

Merge pull request #4 from aakashsane/dev/gfdl

f8da1ec

updating latest dev/gfdl with ML_diffusivity

fixed some typos

4fe384f

fixed some typos and removed unwanted variables

838304d

added comments to the code

9579165

deleted some unwanted files that were not part of original code

7fb2f80

changing the code to meet mom6 code style requirements

1f56fd5

Merge pull request #5 from aakashsane/dev/gfdl

4f7e3b7

updating

added the scales

c5a88e5

Merge branch 'ML_diffusivity' of github.com:aakashsane/MOM6 into ML_d…

3a35685

…iffusivity

corrected scales stuff

2e97964

fixed some issues pointed out by Raph

02ad656

Merge branch 'dev/gfdl' into ML_diffusivity

538cc5b

Delete src/parameterizations/vertical/.vscode/settings.json

f940b2e

deleted unwanted file

made some changes

13aec05

made some edits, changed the greater than and less than signs, etc.

992f266

aakashsane added 3 commits November 18, 2024 13:13

edited the comments in the subroutines

001273c

added comments in the subroutines

37089a3

Merge branch 'dev/gfdl' into ML_diffusivity

03a07b9

adcroft requested changes Nov 21, 2024

View reviewed changes

aakashsane added 4 commits November 21, 2024 11:50

Update MOM_energetic_PBL.F90

0c0309b

Added units to a missing variables (v0_dummy)

deleted .testing/tc4/Makefile

c11441f

Delete .testing/tc4/gen_data.dSYM/Contents/Info.plist

79414c5

file deleted

Delete .testing/tc4/gen_grid.dSYM/Contents/Info.plist

57a6e51

adcroft mentioned this pull request Nov 26, 2024

Adds a vector of default values to get_param_real_array() #760

Merged

aakashsane added 2 commits December 1, 2024 23:57

added a second subroutine for v_0. It takes boundary layer depth as i…

6d20d1f

…nput

made minor changes

821b83f

adcroft reviewed Dec 10, 2024

View reviewed changes

Hallberg-NOAA reviewed Jan 1, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Machine Learning based vertical diffusivity in EPBL mixing scheme used for ocean surface boundary layer #737

Machine Learning based vertical diffusivity in EPBL mixing scheme used for ocean surface boundary layer #737

aakashsane commented Oct 9, 2024 •

edited

Loading

codecov bot commented Oct 31, 2024

adcroft left a comment

adcroft Nov 21, 2024

aakashsane Nov 21, 2024 •

edited

Loading

adcroft commented Nov 25, 2024

adcroft Dec 10, 2024

aakashsane Dec 13, 2024

Hallberg-NOAA Jan 1, 2025

Hallberg-NOAA Jan 1, 2025

Hallberg-NOAA Jan 1, 2025

Hallberg-NOAA Jan 1, 2025 •

edited

Loading

Hallberg-NOAA Jan 1, 2025

Hallberg-NOAA Jan 1, 2025

Hallberg-NOAA Jan 1, 2025

Hallberg-NOAA Jan 1, 2025

Hallberg-NOAA Jan 1, 2025



		! variables for ML based diffusivity
		real :: v0_dummy ! Variable which get recycled, set equal to v_0

		else
		MixLen_shape(K) = CS%transLay_scale + (1.0 - CS%transLay_scale) * &

Machine Learning based vertical diffusivity in EPBL mixing scheme used for ocean surface boundary layer #737

Are you sure you want to change the base?

Machine Learning based vertical diffusivity in EPBL mixing scheme used for ocean surface boundary layer #737

Conversation

aakashsane commented Oct 9, 2024 • edited Loading

codecov bot commented Oct 31, 2024

Codecov Report

adcroft left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

aakashsane Nov 21, 2024 • edited Loading

Choose a reason for hiding this comment

adcroft commented Nov 25, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Hallberg-NOAA Jan 1, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

aakashsane commented Oct 9, 2024 •

edited

Loading

aakashsane Nov 21, 2024 •

edited

Loading

Hallberg-NOAA Jan 1, 2025 •

edited

Loading