Deterministic version of CUDA forces and stresses kernels #3684

mtaillefumier · 2024-04-18T16:06:37Z

Calculations of the forces and stress are deterministic on GPU. It does not imply that the DeepMD code is deterministic by default as TensorFlow also requires to be set up properly either at runtime or during the initialization phase.

To obtain the same model parameters, add the following variables to the job scripts

export TF_DETERMINISTIC_OPS=1
export TF_INTER_OP_PARALLELISM_THREADS=0
export TF_INTER_OP_PARALLELISM_THREADS=0

Details of the changes:

Remove the use of atomic operations in the forces and stress kernels.
Use template programming to minimize code duplication and minor refactoring

Authors :

(@mtaillefumier) M. Taillefumier (ETH Zurich / CSCS)
(@asedova) A. Sedova (ORNL)

- Remove the use of atomic operations in the forces and stress kernels. - Use template programming to minimize code duplication. Authors : - Mathieu Taillefumier (ETH Zurich / CSCS) - Ada Sedova (ORNL)

for more information, see https://pre-commit.ci

njzjz · 2024-04-18T22:21:40Z

Please rebase the PR against the devel branch. We don't accept PRs on other branches.

wanghan-iapcm · 2024-04-19T02:22:50Z

source/lib/src/gpu/prod_force.cu

+    // search the index of the atom i in the local neighbor list of atom j
+    for (atom_id_position = 0; atom_id_position < nnei; atom_id_position++) {
+      if (nei_nei_list_[atom_id_position] == atom_id) {
+        break;
+      }
+    }


The complexity of the index searching is of order N_nei, which does not present in the atomic operation implementation. Does it have an observable side effect on the performance of the prod_force operator?

I could not observe any side effect except fluctuations of 5 % in performance in my miniapp. We are speaking about us here as well. The gain is that these operators are deterministic by default which is worth the 5% potential penalty (or less) introduced by this code change.

mtaillefumier · 2024-04-19T07:55:01Z

Please rebase the PR against the devel branch. We don't accept PRs on other branches.

sorry for this. It is not possible to change the source branch without creating a new PR. I can do it now or after the discussion to avoid reviewing the same code twice.

njzjz · 2024-04-19T14:01:02Z

sorry for this. It is not possible to change the source branch without creating a new PR. I can do it now or after the discussion to avoid reviewing the same code twice.

I don't think we can review the code until all unit tests pass. The current PR blocks the CI from being triggered.

mtaillefumier · 2024-04-19T14:37:31Z

I opened a new PR #3693 on the develop branch starting the latest devel branch as well. Same title, the same content

Deterministic version of CUDA forces and stresses kernels

ca88aab

- Remove the use of atomic operations in the forces and stress kernels. - Use template programming to minimize code duplication. Authors : - Mathieu Taillefumier (ETH Zurich / CSCS) - Ada Sedova (ORNL)

github-actions bot added Core CUDA ROCM Docs labels Apr 18, 2024

[pre-commit.ci] auto fixes from pre-commit.com hooks

093e9e3

for more information, see https://pre-commit.ci

mtaillefumier mentioned this pull request Apr 18, 2024

Reproducibility of LAMMPS run with DP potential #3270

Open

njzjz changed the base branch from master to devel April 18, 2024 17:46

njzjz added the Test CUDA Trigger test CUDA workflow label Apr 18, 2024

github-actions bot removed the Test CUDA Trigger test CUDA workflow label Apr 18, 2024

wanghan-iapcm requested review from denghuilu and njzjz April 19, 2024 02:00

wanghan-iapcm reviewed Apr 19, 2024

View reviewed changes

mtaillefumier requested a review from wanghan-iapcm April 19, 2024 12:55

mtaillefumier mentioned this pull request Apr 19, 2024

Deterministic version of CUDA forces and stresses kernels #3693

Closed

mtaillefumier closed this Apr 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deterministic version of CUDA forces and stresses kernels #3684

Deterministic version of CUDA forces and stresses kernels #3684

mtaillefumier commented Apr 18, 2024

njzjz commented Apr 18, 2024

wanghan-iapcm Apr 19, 2024

mtaillefumier Apr 19, 2024 •

edited

Loading

mtaillefumier commented Apr 19, 2024 •

edited

Loading

njzjz commented Apr 19, 2024 •

edited

Loading

mtaillefumier commented Apr 19, 2024

Deterministic version of CUDA forces and stresses kernels #3684

Deterministic version of CUDA forces and stresses kernels #3684

Conversation

mtaillefumier commented Apr 18, 2024

njzjz commented Apr 18, 2024

wanghan-iapcm Apr 19, 2024

Choose a reason for hiding this comment

mtaillefumier Apr 19, 2024 • edited Loading

Choose a reason for hiding this comment

mtaillefumier commented Apr 19, 2024 • edited Loading

njzjz commented Apr 19, 2024 • edited Loading

mtaillefumier commented Apr 19, 2024

mtaillefumier Apr 19, 2024 •

edited

Loading

mtaillefumier commented Apr 19, 2024 •

edited

Loading

njzjz commented Apr 19, 2024 •

edited

Loading