CuDNN 7 grouped convolution #5879

kouyoumin · 2017-08-29T01:59:39Z

Uses CuDNN 7's new grouped convolution instead of for loop.

fengziyong · 2017-09-15T03:03:53Z

@kouyoumin Have you test the forward time?
I have try to use cuDNNv7 before, but I found it was slower than the original implementation in group mode.

Noiredd · 2017-11-02T11:19:09Z

I have restarted the tests that failed prior to #5973, it passes now. However I'm not sure if Travis is even able to test cuDNN 7 right now. Could you take a look at #5972, where the dependencies script was also modified, and compare?

Also, this needs a squash before merging, but I think that for a review it's good as is.

twmht · 2017-11-03T10:23:13Z

not fast enough.

Anyone have tested this on MobileNet?

Noiredd · 2017-11-07T12:07:24Z

@kouyoumin Please note before rebasing that #5972 modified the cudnn.hpp, adding the v7 cases.

Noiredd

@kouyoumin I made a more thorough testing of this PR: from what I see (both on some synthetic benchmark nets like a 1x256x256x256 cube convolved with 5x5 filters in a group of 256, as well as more realistic nets), this does not seem to make it faster (on my old GTX TITAN Z), but the RAM saving is quite significant (from 4% to almost 60% - the larger group the better). Please take a look at the comments I left in the code review, squash and rebase and I will merge this.

Noiredd · 2018-02-09T11:21:52Z

src/caffe/layers/cudnn_conv_layer.cu

              cudnn::dataType<Dtype>::one,
              top_descs_[i],  top_diff + top_offset_ * g,
              cudnn::dataType<Dtype>::one,
              bias_desc_, bias_diff + bias_offset_ * g));
+#else
+        CUDNN_CHECK(cudnnConvolutionBackwardBias(handle_[g],


Shouldn't we leave CUDNN_CHECK(cudnnConvolutionBackwardBias(handle_[0*this->group_ + g], here?

Noiredd · 2018-02-09T11:23:16Z

include/caffe/util/cudnn.hpp

+    case CUDNN_STATUS_RUNTIME_IN_PROGRESS:
+      return "CUDNN_STATUS_RUNTIME_IN_PROGRESS";
+    case CUDNN_STATUS_RUNTIME_FP_OVERFLOW:
+      return "CUDNN_STATUS_RUNTIME_FP_OVERFLOW";


Adding this fragement is no longer necessary (since #5972), please remove this while rebasing.

kouyoumin added 3 commits August 28, 2017 12:11

Added support for cudnn 7

655b98d

Fixed allocation/null out bug

fbb2468

Fixed lint error

ea7d414

shaibagon mentioned this pull request Oct 30, 2017

group convolution #5726

Closed

Noiredd mentioned this pull request Nov 2, 2017

CuDNN 7 cases added, warnings removed #5869

Closed

Noiredd mentioned this pull request Nov 3, 2017

fix cudnn v7 compatibility #6029

Closed

Noiredd mentioned this pull request Nov 7, 2017

Could caffe work with cudnn v7 now? #5868

Closed

Noiredd added the focus label Nov 7, 2017

Noiredd reviewed Feb 9, 2018

View reviewed changes

Noiredd added the needs rebase label Feb 9, 2018

Update README.md

e9e23db

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CuDNN 7 grouped convolution #5879

CuDNN 7 grouped convolution #5879

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

CuDNN 7 grouped convolution #5879

Are you sure you want to change the base?

CuDNN 7 grouped convolution #5879

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!