Mpi v2 #330

ptrbortolotti · 2024-12-27T15:09:42Z

Purpose

MPI calls to WEIS have always been fragile and always required expert users for a successful setup.
This PR improves the usability of WEIS by simplifying the setup for a MPI calls. The user no longer needs to count the number of design variables (finite differencing) nor OpenFAST calls. These two numbers still need to be estimated, but this is done in a pre-call of WEIS which can be quickly executed right before a real call to WEIS. Notably, the counting of the DVs is done by OpenMDAO and no longer by our own dedicated python function (simpler maintenance for us). Once available, the two numbers are passed among the modeling options, and are used by WEIS to allocate the processors for finite differencing (visible to OpenMDAO) and the processors for OpenFAST calls (hidden from OpenMDAO).
A new page on the documentation shows how to make the calls to WEIS. Nothing changes for users running WEIS on a single processors. New tests have been added.

Many thanks to @johnjasa @dzalkind @gbarter. This PR is a team effort

Type of change

What types of change is it?

Bugfix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (non-backwards-compatible fix or feature)
[] Code style update (formatting, renaming)
Refactoring (no functional changes, no API changes)
Documentation update
Maintenance update
Other (please describe)

Testing

GitHub actions must be executed once NREL/ROSCO#406 is merged

Checklist

I have run existing tests which pass locally with my changes
I have added new tests or examples that prove my fix is effective or that my feature works
I have added necessary documentation

…EIS call

ptrbortolotti · 2024-12-30T12:02:07Z

ok, this is now ready for review!

dzalkind · 2024-12-30T17:39:39Z

examples/05_IEA-3.4-130-RWT/weis_driver_model_only.py

@ptrbortolotti, this was added to test the model_only flag. Do you still want to get rid of it?

oh I missed that... I guess we could reinstate it? I don't have strong opinions...

dzalkind · 2024-12-30T17:43:31Z

weis/glue_code/runWEIS.py

-            max_parallel_OF_runs = max([int(np.floor((max_cores - n_DV) / n_DV)), 1])
-            n_OF_runs_parallel = min([int(n_OF_runs), max_parallel_OF_runs])
+        if not prepMPI:
+            nFD = modeling_options['General']['openfast_configuration']['nFD']


Does the user now need to enter nFD and nOFp into the modeling options?

I believe this comment is now outdated. the users can provide those two inputs, but it's a lot safer to let WEIS compute them with the --preMPI flag

so the short answer is no, the user does not need to provide those two inputs

dzalkind · 2024-12-30T18:36:01Z

I tried to hide some of the argument and modeling option code we'll repeat in a new file here: 5a9bc96

Do we want to apply the same arguments to all of our examples eventually?

dzalkind · 2024-12-30T18:58:37Z

examples/02_run_openfast_cases/weis_driver_loads.py


 tt = time.time()
-wt_opt, modeling_options, opt_options = run_weis(fname_wt_input, fname_modeling_options, fname_analysis_options)
+maxnP = get_max_procs(args)
+


Do we want the double-run when MPI is used to be the default for all examples? Or should users do a dry-run and update the modeling options themselves? We're demonstrating both here, which is also fine.

I found both use cases useful. expert users might want to know the size of their problem, whereas newer users may just want to run

dzalkind · 2024-12-30T20:05:17Z

Question for all: If we have a DV that's an array, should there be a finite differencing step for each index in the array?

This page seems to suggest not, but I don't think it's representative of the use case I suggest because the array value is a connected input/output of the whole group.

gbarter · 2024-12-30T20:31:08Z

Question for all: If we have a DV that's an array, should there be a finite differencing step for each index in the array?

This page seems to suggest not, but I don't think it's representative of the use case I suggest because the array value is a connected input/output of the whole group.

I thought there was. We have at least anecdotal observations that the computational workload in an optimization scales directly (and nonlinearly) with the number of points in a design vector (blade chord/twist, tower diameter/thickness). We had previously counted the DVs based on total vector length. I'd be surprised if there was one finite difference step for the entire variable vector in one shot.

dzalkind · 2024-12-30T20:48:02Z

Question for all: If we have a DV that's an array, should there be a finite differencing step for each index in the array?
This page seems to suggest not, but I don't think it's representative of the use case I suggest because the array value is a connected input/output of the whole group.

I thought there was. We have at least anecdotal observations that the computational workload in an optimization scales directly (and nonlinearly) with the number of points in a design vector (blade chord/twist, tower diameter/thickness). We had previously counted the DVs based on total vector length. I'd be surprised if there was one finite difference step for the entire variable vector in one shot.

This was my understanding, as well. I made this update to account for array DVs: 7aedb7c

ptrbortolotti · 2024-12-30T22:50:50Z

I tried to hide some of the argument and modeling option code we'll repeat in a new file here: 5a9bc96

Do we want to apply the same arguments to all of our examples eventually?

sure, although the documentation page only talks about examples 02 and 03

dzalkind · 2025-01-02T17:04:11Z

Thanks for this update @ptrbortolotti !!

I'm good to merge this when you are.

I don't think we necessarily need to apply the preMPI arguments to all the examples. I'm okay testing it out on some of our favorite examples and refining the interface from there.

johnjasa · 2025-01-02T17:41:59Z

Question for all: If we have a DV that's an array, should there be a finite differencing step for each index in the array?
This page seems to suggest not, but I don't think it's representative of the use case I suggest because the array value is a connected input/output of the whole group.

I thought there was. We have at least anecdotal observations that the computational workload in an optimization scales directly (and nonlinearly) with the number of points in a design vector (blade chord/twist, tower diameter/thickness). We had previously counted the DVs based on total vector length. I'd be surprised if there was one finite difference step for the entire variable vector in one shot.

This was my understanding, as well. I made this update to account for array DVs: 7aedb7c

Correct, there's an FD step for each entry in the DV array so Dan's change correctly accounts for that.

If you're using coloring this number might be reduced if there are independent DVs so the FD vector can be perturbed in multiple indices simultaneously (see here for more info). WEIS doesn't use coloring and I don't think it's worth accounting for that here as most of the DVs are not independent in WEIS design cases.

johnjasa · 2025-01-02T17:44:03Z

I like this PR, @ptrbortolotti et al! I went through it and the workflow, examples, and docs all make sense to me. At a minimum, the addition of the run_in_parallel doc is very valuable, not to mention the rest of this PR.

ptrbortolotti added 17 commits December 18, 2024 13:08

remove logic for olaf/openmp

6488f1a

support in setting mpi parameters

a49cc3b

add documentation page

f62be3f

example for simple call to openfast with mpi

00f9713

postpone return so that we can stack the preMPI call and the actual W…

9cb54d3

…EIS call

work in progress, mpi settings moved into modeling options

9882405

adjust if statements

ca3a4d8

broadcast mod and opt options

139263a

more progress, not there yet

77ad691

sequential or preMPI and actual weis call now working

ce527f6

better, but MPI can still hang

86fef37

adjust if settings, things seem to be running fine now

cf4fcee

fix last typos

97a6bb1

add tests, switch to mpiexec

ee0f6bc

remove sbatch kestrel (can't mantain...) and shorten OF sims

36d1653

remove outdated py file

9ad56b5

suppress print statements when not needed

e9aa791

ptrbortolotti requested review from johnjasa, gbarter and dzalkind December 27, 2024 15:09

ptrbortolotti added 4 commits December 27, 2024 18:06

adjust if condition

a5a71cf

lock openfast wisdem and rosco

fcae582

adjust list of examples run during testing

005cd35

try again

58b697c

dzalkind reviewed Dec 30, 2024

View reviewed changes

Tidy up weis_driver_loads

5a9bc96

Print information about modeling options to user

08ebbf3

dzalkind reviewed Dec 30, 2024

View reviewed changes

dzalkind added 2 commits December 30, 2024 12:58

Simplify weis_driver_loads more

b903ba1

Make control example case

fc4cf7c

Count elements in each design variable

7aedb7c

ptrbortolotti added 2 commits January 2, 2025 13:03

bring back weis_driver_model_only

9a99005

update front scripts examples 3 4 5

00f02c7

ptrbortolotti merged commit 29fcb49 into develop Jan 3, 2025
36 checks passed

ptrbortolotti deleted the mpi_v2 branch January 3, 2025 19:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mpi v2 #330

Mpi v2 #330

ptrbortolotti commented Dec 27, 2024

ptrbortolotti commented Dec 30, 2024

dzalkind Dec 30, 2024

ptrbortolotti Dec 30, 2024

dzalkind Dec 30, 2024

ptrbortolotti Dec 30, 2024

ptrbortolotti Dec 30, 2024

dzalkind commented Dec 30, 2024

dzalkind Dec 30, 2024

ptrbortolotti Dec 30, 2024

dzalkind commented Dec 30, 2024

gbarter commented Dec 30, 2024

dzalkind commented Dec 30, 2024

ptrbortolotti commented Dec 30, 2024

dzalkind commented Jan 2, 2025

johnjasa commented Jan 2, 2025

johnjasa commented Jan 2, 2025

Mpi v2 #330

Mpi v2 #330

Conversation

ptrbortolotti commented Dec 27, 2024

Purpose

Type of change

Testing

Checklist

ptrbortolotti commented Dec 30, 2024

dzalkind Dec 30, 2024

Choose a reason for hiding this comment

ptrbortolotti Dec 30, 2024

Choose a reason for hiding this comment

dzalkind Dec 30, 2024

Choose a reason for hiding this comment

ptrbortolotti Dec 30, 2024

Choose a reason for hiding this comment

ptrbortolotti Dec 30, 2024

Choose a reason for hiding this comment

dzalkind commented Dec 30, 2024

dzalkind Dec 30, 2024

Choose a reason for hiding this comment

ptrbortolotti Dec 30, 2024

Choose a reason for hiding this comment

dzalkind commented Dec 30, 2024

gbarter commented Dec 30, 2024

dzalkind commented Dec 30, 2024

ptrbortolotti commented Dec 30, 2024

dzalkind commented Jan 2, 2025

johnjasa commented Jan 2, 2025

johnjasa commented Jan 2, 2025