Enable Conv fusion optimizations in optimizeForIdeep #9255

gujinghui · 2018-07-09T07:28:23Z

Enable fusion for IDEEP in optimizeForIdeep
including Conv+ReLU, Conv+Sum, Conv+Sum+ReLU, Conv+BN

gujinghui · 2018-07-09T09:47:22Z

yinghai

Thanks for splitting it. Looks good overall. I have 2 minor comments regarding the interface.

caffe2/opt/converter.cc

@@ -101,6 +101,14 @@ std::vector<int> getKernelShape(std::map<std::string, caffe2::Argument> argMap)
  return kernelShape;
 }

+int getGroup(std::map<std::string, caffe2::Argument> argMap) {


caffe2/opt/optimize_ideep.cc

+}
+
+void OptimizeForIdeep(repr::NNModule* nn, caffe2::Workspace* ws, bool training_mode) {
+  if (training_mode) {


gujinghui · 2018-07-11T07:51:39Z

@yinghai any other concerns?

facebook-github-bot

@yinghai has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

yinghai

Please clang-format your code.

caffe2/opt/optimize_ideep.cc

+    return nullptr;
+  }
+
+  return dyn_cast<Caffe2Annotation>(annotation)->getOperatorDef();


caffe2/opt/optimize_ideep.cc

-      return false;
+    auto convOutput = repr::nn::getOutputs(convNode).front();
+    auto consumers = repr::nn::getConsumers(convOutput);
+    // convOutput is NOT referenced by sequencial ops after BN.


caffe2/opt/optimize_ideep.cc

+        break;
+      }
+    }
+    // Sum inputs should not be referenced by sequencial ops.


caffe2/opt/converter.cc

@@ -101,6 +101,14 @@ std::vector<int> getKernelShape(std::map<std::string, caffe2::Argument> argMap)
  return kernelShape;
 }

+int getGroup(std::map<std::string, caffe2::Argument> argMap) {
+  if (argMap.count("group")) {
+    assert(argMap["group"].has_i() && "Invalid group argument");


caffe2/opt/converter.cc

@@ -101,6 +101,14 @@ std::vector<int> getKernelShape(std::map<std::string, caffe2::Argument> argMap)
  return kernelShape;
 }

+int getGroup(std::map<std::string, caffe2::Argument> argMap) {


Signed-off-by: Gu, Jinghui <jinghui.gu@intel.com>

gujinghui · 2018-07-12T02:55:52Z

@yinghai
code in convert.cc all use 'pass by value'.
Or, we have totally different code base?
Or, we'd better fix all of them...

Signed-off-by: Gu, Jinghui <jinghui.gu@intel.com>

yinghai

Let's have a separate pass to fix the pass-by-value issue.

facebook-github-bot

@yinghai has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

gujinghui · 2018-07-13T09:08:36Z

@yinghai
This PR is under evaluation? When will it be merged?
Another patch depends on this one. Pls inform me if this is merged.
Thanks.

yinghai

Sorry, Let's use CAFFE_ENFORCE

caffe2/opt/converter.cc

@@ -42,6 +42,14 @@ std::vector<int> getDilations(std::map<std::string, caffe2::Argument> argMap) {
  return dilations;
 }

+int getGroup(std::map<std::string, caffe2::Argument>& argMap) {
+  if (argMap.count("group")) {
+    CAFFE_ENFORCE(argMap["group"].has_i() && "Invalid group argument");


caffe2/opt/optimize_ideep.cc

+
+Blob *getBlob(repr::NNGraph::NodeRef node, caffe2::Workspace *ws) {
+  auto tensor = repr::nn::get<repr::Tensor>(node);
+  assert(ws->HasBlob(tensor->getName()) && "Blob not in workspace");


Signed-off-by: Gu, Jinghui <jinghui.gu@intel.com>

gujinghui · 2018-07-15T13:51:20Z

@yinghai
fixed

facebook-github-bot

@yinghai has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Summary: Enable fusion for IDEEP in optimizeForIdeep including Conv+ReLU, Conv+Sum, Conv+Sum+ReLU, Conv+BN Pull Request resolved: pytorch#9255 Reviewed By: bddppq Differential Revision: D8809030 Pulled By: yinghai fbshipit-source-id: af30bad3b96cb965bd26a4dfa810370faec4bb88

gujinghui mentioned this pull request Jul 9, 2018

[Caffe2]Enable fusion for IDEEP in optimizeForIdeep #8105

Closed

yinghai reviewed Jul 9, 2018

View reviewed changes

weiyangfb added the caffe2 label Jul 10, 2018

facebook-github-bot reviewed Jul 11, 2018

View reviewed changes

yinghai suggested changes Jul 11, 2018

View reviewed changes

gujinghui added 2 commits July 12, 2018 10:13

Enable Conv fusion optimizations in optimizeForIdeep

9eecae2

Signed-off-by: Gu, Jinghui <jinghui.gu@intel.com>

fix test case of conv_fusion op

1d801ed

Signed-off-by: Gu, Jinghui <jinghui.gu@intel.com>

Refine code by clang-format

90213c0

Signed-off-by: Gu, Jinghui <jinghui.gu@intel.com>

gujinghui force-pushed the ideep_fusion branch from 17a7679 to 90213c0 Compare July 12, 2018 03:22

yinghai approved these changes Jul 12, 2018

View reviewed changes

facebook-github-bot reviewed Jul 12, 2018

View reviewed changes

yinghai suggested changes Jul 14, 2018

View reviewed changes

Use CAFFE_ENFORCE, instead of assert

aedefb8

Signed-off-by: Gu, Jinghui <jinghui.gu@intel.com>

facebook-github-bot reviewed Jul 16, 2018

View reviewed changes

yinghai approved these changes Jul 16, 2018

View reviewed changes

facebook-github-bot closed this in e8b8c38 Jul 17, 2018

ezyang added open source merged labels Jun 24, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable Conv fusion optimizations in optimizeForIdeep #9255

Enable Conv fusion optimizations in optimizeForIdeep #9255

gujinghui commented Jul 9, 2018

gujinghui commented Jul 9, 2018

yinghai left a comment

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

gujinghui commented Jul 11, 2018

facebook-github-bot left a comment

yinghai left a comment

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

gujinghui commented Jul 12, 2018

yinghai left a comment

facebook-github-bot left a comment

gujinghui commented Jul 13, 2018

yinghai left a comment

This comment was marked as off-topic.

This comment was marked as off-topic.

gujinghui commented Jul 15, 2018

facebook-github-bot left a comment

Enable Conv fusion optimizations in optimizeForIdeep #9255

Enable Conv fusion optimizations in optimizeForIdeep #9255

Conversation

gujinghui commented Jul 9, 2018

gujinghui commented Jul 9, 2018

yinghai left a comment

Choose a reason for hiding this comment

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

gujinghui commented Jul 11, 2018

facebook-github-bot left a comment

Choose a reason for hiding this comment

yinghai left a comment

Choose a reason for hiding this comment

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

gujinghui commented Jul 12, 2018

yinghai left a comment

Choose a reason for hiding this comment

facebook-github-bot left a comment

Choose a reason for hiding this comment

gujinghui commented Jul 13, 2018

yinghai left a comment

Choose a reason for hiding this comment

This comment was marked as off-topic.

This comment was marked as off-topic.

gujinghui commented Jul 15, 2018

facebook-github-bot left a comment

Choose a reason for hiding this comment