Refactor Onnx runtime Server to only use public APIs #1271

csteegz · 2019-06-21T01:43:18Z

Description: Refactors ONNXRT server to only use public APIs. First step in switching to dynamic linking.

Motivation and Context

Why is this change required? What problem does it solve?
ONNXRT server must be statically linked with the ONNXRT. It also uses internal (non-stable) APIs, which can limit iteration speed. This PR moves to only using public APIs so that they can be more loosely coupled.

onnxruntime/server/executor.cc

tmccrmck · 2019-06-25T18:06:08Z

This is a big PR and hard to review because it seems to be an amalgamation of several things. It would be easier to review if you did this in 3 PRs:

All the converter changes
Change the logger to spdlog
Switch to the C++ API

I think it would also speed up the review process.

letmaik · 2019-06-26T14:16:15Z

onnxruntime/server/executor.cc

  auto logger = env_->GetLogger(request_id_);

  size_t cpu_tensor_length = 0;
-  auto status = onnxruntime::utils::GetSizeInBytesFromTensorProto<0>(input_tensor, &cpu_tensor_length);
+  auto status = onnxruntime::server::GetSizeInBytesFromTensorProto<0>(input_tensor, &cpu_tensor_length);


There is OrtGetTensorMemSizeInBytesFromTensorProto.

I chose not to use those for two reasons:

Those work on serialized protobufs and for us to support JSON and GRPC we need to be able to control the deserialization.

I got the impression from Changming that we should treat the TensorProto as internal to ORT and not depend specifically on it. @snnn ?

letmaik · 2019-06-26T14:17:01Z

onnxruntime/server/executor.cc

-  status = onnxruntime::utils::TensorProtoToMLValue(onnxruntime::Env::Default(), nullptr, input_tensor,
-                                                    onnxruntime::MemBuffer(buf, cpu_tensor_length, *cpu_allocator_info),
-                                                    ml_value, deleter);
+  status = onnxruntime::server::TensorProtoToMLValue(input_tensor,


There is OrtTensorProtoToOrtValue.

same reasoning as above.

onnxruntime/server/executor.cc

onnxruntime/server/executor.h

onnxruntime/server/converter.h

onnxruntime/server/environment.cc

onnxruntime/server/converter.cc

NonStatic2014 · 2019-07-01T15:38:03Z

onnxruntime/server/converter.cc

-        for (size_t i = 0, count = 1 + ((tensor.Size() - 1) / sizeof(int32_t)); i < count; ++i) {
-          tensor_proto.add_int32_data(i32data[i]);
+        for (size_t i = 0, count = elem_count; i < count; ++i) {
+          tensor_proto.add_int32_data(reinterpret_cast<const uint16_t*>(data)[i]);


You mean uint32_t here?

no - float16 is supposed to get cast to uint16 - from onnx-ml.proto

// For int32, uint8, int8, uint16, int16, bool, and float16 values // float16 values must be bit-wise converted to an uint16_t prior // to writing to the buffer. // When this field is present, the data_type field MUST be // INT32, INT16, INT8, UINT16, UINT8, BOOL, or FLOAT16 repeated int32 int32_data = 5 [packed = true];

should I add a comment maybe?

onnxruntime/server/converter.cc

onnxruntime/server/main.cc

onnxruntime/server/executor.h

onnxruntime/server/server_configuration.h

…e into coverste/dynamic-link

NonStatic2014 · 2019-07-03T20:59:05Z

onnxruntime/server/executor.cc

-  if (!status.IsOK()) {
-    logger->error("GetSizeInBytesFromTensorProto() failed. Error Message: {}", status.ToString());
-    return GenerateProtobufStatus(status, "GetSizeInBytesFromTensorProto() failed: " + status.ToString());
+  try {


Why don't we use the return value here? try-catch will hurt the performance.

He is using try/catch because GetSizeInBytesFromTensorProto doesn't return anything.

The c++ api uses exceptions, so I decided to use exceptions for the converter as well. IIRC c++ exceptions are generally zero cost in the happy path so this will only hurt performance when it fails.

onnxruntime/server/executor.cc

This reverts commit 387676d.

pranavsharma · 2019-07-04T00:46:57Z

/azp run

azure-pipelines · 2019-07-04T00:48:26Z

Azure Pipelines successfully started running 22 pipeline(s).

csteegz added 13 commits June 14, 2019 23:56

replace log sinks

d307bf6

limit headers to include dir

315baf0

first changes to do dynamic linking

e0e95a9

wip for using cxx api

4f9d9b3

remove weird dangling dependency

f42f79a

building with tests failing

d87c1a6

finish updating converters

ac8493d

fix const

0cfca56

intital introduction of typedef

7e89b26

change logging to use spdlog

58980b6

get tests passing

4469560

clang format

e2611f3

map logging levels better

ae771bb

csteegz requested a review from a team as a code owner June 21, 2019 01:43

tmccrmck reviewed Jun 25, 2019

View reviewed changes

onnxruntime/server/executor.cc Show resolved Hide resolved

onnxruntime/server/executor.cc Show resolved Hide resolved

onnxruntime/server/executor.cc Outdated Show resolved Hide resolved

letmaik suggested changes Jun 26, 2019

View reviewed changes

csteegz and others added 4 commits June 26, 2019 18:26

clean up unused imports

f9beed1

trent cr comments

d79b688

clang-format

150377c

Merge branch 'master' into coverste/dynamic-link

60cd92f

pranavsharma reviewed Jun 29, 2019

View reviewed changes

onnxruntime/server/converter.h Outdated Show resolved Hide resolved

pranavsharma reviewed Jun 29, 2019

View reviewed changes

onnxruntime/server/environment.cc Outdated Show resolved Hide resolved

pranavsharma reviewed Jun 29, 2019

View reviewed changes

onnxruntime/server/environment.cc Outdated Show resolved Hide resolved

NonStatic2014 reviewed Jul 1, 2019

View reviewed changes

csteegz and others added 5 commits July 1, 2019 17:26

code review comments

65cf368

Merge branch 'coverste/dynamic-link' of github.com:csteegz/onnxruntim…

374ff16

…e into coverste/dynamic-link

changing buffer use to reserve

38f97c9

Dynamically link

75d4a9b

Merge branch 'master' into coverste/dynamic-link

e5cbbd9

NonStatic2014 reviewed Jul 3, 2019

View reviewed changes

revert tvm

387676d

pranavsharma reviewed Jul 3, 2019

View reviewed changes

onnxruntime/server/executor.cc Outdated Show resolved Hide resolved

Ubuntu added 5 commits July 3, 2019 22:34

update binary uploading

bfc9bec

catch exceptions by const-ref

c106fa5

Revert "revert tvm"

e75b5cf

This reverts commit 387676d.

fix typo

fb04697

update versioning of lib

17ae106

pranavsharma approved these changes Jul 4, 2019

View reviewed changes

pranavsharma merged commit a8ff209 into microsoft:master Jul 4, 2019

csteegz deleted the coverste/dynamic-link branch July 4, 2019 08:09

non-static mentioned this pull request Jul 5, 2019

Copy shared library in Dockerfile.server #1347

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor Onnx runtime Server to only use public APIs #1271

Refactor Onnx runtime Server to only use public APIs #1271

csteegz commented Jun 21, 2019

tmccrmck commented Jun 25, 2019 •

edited

Loading

letmaik Jun 26, 2019

csteegz Jun 26, 2019

letmaik Jun 26, 2019

csteegz Jun 26, 2019

NonStatic2014 Jul 1, 2019

csteegz Jul 1, 2019

csteegz Jul 1, 2019

NonStatic2014 Jul 3, 2019

pranavsharma Jul 3, 2019 •

edited

Loading

csteegz Jul 3, 2019

pranavsharma commented Jul 4, 2019

azure-pipelines bot commented Jul 4, 2019

Refactor Onnx runtime Server to only use public APIs #1271

Refactor Onnx runtime Server to only use public APIs #1271

Conversation

csteegz commented Jun 21, 2019

tmccrmck commented Jun 25, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pranavsharma Jul 3, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pranavsharma commented Jul 4, 2019

azure-pipelines bot commented Jul 4, 2019

tmccrmck commented Jun 25, 2019 •

edited

Loading

pranavsharma Jul 3, 2019 •

edited

Loading