GPU implementation of ocean baroclinic velocity tendencies by philipwjones · Pull Request #572 · MPAS-Dev/MPAS-Model

philipwjones · 2020-05-26T16:54:24Z

This PR is a substantial modification to all the ocean baroclinic velocity tendencies and includes:

a complete GPU implementation in which all tendencies - except tidal for now - are computed on the accelerator (using OpenACC) and all data is transfered at the top level driver (ocn_tend_vel)
a number of CPU optimizations performed along the way
elimination of meshPool, configPool
misc cleanup of comments and stuff

This PR contains similar changes that are in #513 #536 #569 at least so will need to modify/rebase once those are merged.

Performance speedup for this part of the code was 2.8x using 2 GPUs on Summit compared with an 8-rank MPI only case. Details of performance will depend on configuration. CPU performance improved by ~20% in the same 8-rank QU240 test. More speedup is expected as we migrate more data to the device elsewhere. The computational part excluding data transfer showed a 10x speedup.

This is not quite b4b due to changes in order of operations in a couple of routines and not b4b using the accelerator (different chip architecture). But in both cases, the differences are at roundoff level. Tested most of the options (eg for hmix, pgrad), though since it was tested in standalone QU240, the forcing routines really didn't get much of a workout.

This commit is a substantial modification to all the ocean baroclinic velocity tendencies and includes: - a complete GPU implementation in which all tendencies are computed on the accelerator (using OpenACC) and all data is transfered at the top level driver (ocn_tend_vel) - a number of CPU optimizations performed along the way - elimination of meshPool, configPool

pwolfram · 2020-05-26T17:42:50Z

Quick questions @philipwjones:

This still works with tidal tendencies, correct?
Is there a plan to move over the tidal tendencies?

pwolfram · 2020-05-26T17:43:40Z

cc @sbrus89 and Nairita

mattdturner · 2020-05-26T17:58:48Z

This still works with tidal tendencies, correct?

Unless I'm mistaken, if running with GPUs a info message will be printed and then the tidal tendency code will be calculated in the CPU instead of the GPU. So the calculations are still run, its just not on the GPU. Notice how the only code in #ifdef MPAS_OPENACC is the call to mpas_log_write, and the rest of the code is run no matter what pre-compiler flags are used.

I was mistaken.

philipwjones · 2020-05-26T19:10:58Z

@pwolfram If you are running on GPUs (OPENACC enabled) and with tidal forcing, this will exit with an error. It still works for CPU only runs. This is temporary - trying to get a lot of GPU code integrated before the end of the month and an ECP deliverable. But the way the tidal forcing modifies zMid and moves pointers back and forth interferes with the copies on the GPU and it was going to take some thought on how to manage that in an efficient way. Sorry - will get back to it once I finish integrating other stuff

pwolfram · 2020-05-27T15:29:17Z

@philipwjones, we should have removed changes to zMid in the new version but it may not yet be on ocean/develop (cc @mark-petersen because we are in discussions about this merge). Can you please make an issue of potential concerns so we can avoid them or refractor here? Don't want this to not be included in the GPU-enabled version because of some programming issue on our side. Also know we all want to write better code moving forward too-- thanks!

philipwjones · 2020-08-28T16:47:44Z

Tried to rebase this yesterday, but there have been some reorganization of code since this was first submitted and the rebase got a little ugly and I don't have high confidence. So...will probably do a fresh checkout of a more recent version and re-implement the GPU mods, maybe incorporating some new ideas from Az first. Will update the branch when this is done.

mattdturner · 2020-08-28T16:56:42Z

So...will probably do a fresh checkout of a more recent version and re-implement the GPU mods, maybe incorporating some new ideas from Az first

That's probably a good idea. We just had an issue where the merge conflict resolution in one of my PRs reverted prior changes (bugfix in #672), so starting over with a fresh checkout in a few subroutines might be better.

philipwjones · 2020-12-16T21:18:30Z

Replaced by #772 and another future PR.

philipwjones added Ocean performance labels May 26, 2020

philipwjones requested review from mark-petersen and mattdturner May 26, 2020 16:54

mark-petersen self-assigned this May 27, 2020

philipwjones mentioned this pull request Dec 16, 2020

GPU implementation of ocean baroclinic velocity tendencies - phase 1 #772

Merged

philipwjones closed this Dec 16, 2020

matthewhoffman added Ocean performance labels Mar 17, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPU implementation of ocean baroclinic velocity tendencies#572

GPU implementation of ocean baroclinic velocity tendencies#572
philipwjones wants to merge 1 commit intoMPAS-Dev:ocean/developfrom
philipwjones:ocean/GPUvel

philipwjones commented May 26, 2020

Uh oh!

pwolfram commented May 26, 2020

Uh oh!

pwolfram commented May 26, 2020

Uh oh!

mattdturner commented May 26, 2020 •

edited

Loading

Uh oh!

philipwjones commented May 26, 2020

Uh oh!

pwolfram commented May 27, 2020

Uh oh!

philipwjones commented Aug 28, 2020

Uh oh!

mattdturner commented Aug 28, 2020

Uh oh!

philipwjones commented Dec 16, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

philipwjones commented May 26, 2020

Uh oh!

pwolfram commented May 26, 2020

Uh oh!

pwolfram commented May 26, 2020

Uh oh!

mattdturner commented May 26, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

philipwjones commented May 26, 2020

Uh oh!

pwolfram commented May 27, 2020

Uh oh!

philipwjones commented Aug 28, 2020

Uh oh!

mattdturner commented Aug 28, 2020

Uh oh!

philipwjones commented Dec 16, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

mattdturner commented May 26, 2020 •

edited

Loading