Fix max_pool2d perf regression #41174

xwang233 · 2020-07-09T05:09:33Z

The two pointer variables ptr_top_diff and ptr_top_mask were introduced in #38953. Some end-to-end tests showed training performance regression due to this change. The performance is restored after removing the two pointer variables, and adding offset directly below in the indexing [ ] calculations.

See PR change https://github.com/pytorch/pytorch/pull/38953/files#diff-8085d370f4e98295074a51b8a1f829e9R187-R188

pytorch/aten/src/ATen/native/cuda/DilatedMaxPool2d.cu

Lines 186 to 195 in e4a3c58

    
           int offset = (n * channels + c) * pooled_height * pooled_width; 
        
           const scalar_t* ptr_top_diff = top_diff + offset; 
        
           const int64_t* ptr_top_mask = top_mask + offset; 
        
           for (int ph = phstart; ph < phend; ++ph) { 
        
             for (int pw = pwstart; pw < pwend; ++pw) { 
        
               if (ptr_top_mask[ph * pooled_width + pw] == h * width + w) { 
        
                 gradient += ScalarConvert<scalar_t, accscalar_t>::to(ptr_top_diff[ph * pooled_width + pw]); 
        
               } 
        
             } 
        
           }

xwang233 · 2020-07-09T05:09:56Z

cc @ptrblck @mcarilli

facebook-github-bot

@ngimel has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2020-07-09T22:15:09Z

@ngimel merged this pull request in 2cf31fb.

fix max_pool2d perf regression

974a281

xwang233 requested a review from ngimel July 9, 2020 05:10

pytorchbot added the open source label Jul 9, 2020

ngimel approved these changes Jul 9, 2020

View reviewed changes

facebook-github-bot reviewed Jul 9, 2020

View reviewed changes

facebook-github-bot closed this in 2cf31fb Jul 9, 2020

facebook-github-bot added the merged label Jul 9, 2020

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix max_pool2d perf regression #41174

Fix max_pool2d perf regression #41174

Uh oh!

xwang233 commented Jul 9, 2020 •

edited

Loading

Uh oh!

xwang233 commented Jul 9, 2020

Uh oh!

facebook-github-bot left a comment

Uh oh!

facebook-github-bot commented Jul 9, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

	int offset = (n * channels + c) * pooled_height * pooled_width;
	const scalar_t* ptr_top_diff = top_diff + offset;
	const int64_t* ptr_top_mask = top_mask + offset;
	for (int ph = phstart; ph < phend; ++ph) {
	for (int pw = pwstart; pw < pwend; ++pw) {
	if (ptr_top_mask[ph * pooled_width + pw] == h * width + w) {
	gradient += ScalarConvert<scalar_t, accscalar_t>::to(ptr_top_diff[ph * pooled_width + pw]);
	}
	}
	}

Fix max_pool2d perf regression #41174

Fix max_pool2d perf regression #41174

Uh oh!

Conversation

xwang233 commented Jul 9, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

xwang233 commented Jul 9, 2020

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Jul 9, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

xwang233 commented Jul 9, 2020 •

edited

Loading