Affinitize server call notification #6612

ctiller · 2016-05-17T00:23:58Z

Takes #6149 and extends it to also keep call request queue affinity with the underlying channel.

Improves throughput benchmarks somewhere between 20-90% (finally we see the benefit from the initial work!)

Some refactoring left to get tests building, and no doubt some bugs left.

Also built on #6603, #6585

…s makes writing certain test cases (like hybrid_end2end tests) easier

listening completion queue (i.e frequently polled)

…finity

…o affine

flag to be set to true on the pollset in case of 'poll' strategy. To fix this I am calling grpc_pollset_work with a 0 timeout right after adding the fds)

ctiller · 2016-05-21T05:24:37Z

Ok... polling problem fixed.

The final thing is that we need to keep track of non-listening cqs for real, so that they can work-steal even if they're not listening.

sreecha · 2016-05-21T06:29:41Z

test/cpp/end2end/hybrid_end2end_test.cc

  ResetStub();
  std::thread generic_handler_thread([this, &generic_service] {
+    gpr_log(GPR_DEBUG, "t0 start");


delete the debug lines (513, 515, 519, 521, 525, 527)

… them

This reverts commit 3f3312e.

This reverts commit e76528c.

ctiller · 2016-05-22T04:30:43Z

Flakes seem unrelated (sanity looks like misconfiguration, tsan is known)

ctiller · 2016-05-22T14:32:17Z

Seems ready pending a last check through

sreecha · 2016-05-23T14:54:49Z

test/cpp/end2end/hybrid_end2end_test.cc

@@ -207,6 +207,9 @@ class HybridEnd2endTest : public ::testing::Test {
    ServerBuilder builder;
    builder.AddListeningPort(server_address_.str(),
                             grpc::InsecureServerCredentials());
+    // Always add a sync unimplemented service: we rely on having at least one
+    // synchronous method to get a listening cq
+    builder.RegisterService(&unimplemented_service_);


Do we really need this to get a listening cq ? - the hybrid server already has a "sync part" and that should give us a listening cq anyway.

On a related note: I thought that these tests might have been previously failing because non-listening completion queues were not being added to server->cqs and hence weren't getting picked up for queuing completed events. I see that you fixed that (i.e add non-listening cqs to server-cqs and don't start a listener in grpc_server_start function on those pollsets)

So shouldn't that fix take care of the hybrid_end2end test failures ?

We've at least one test in the suite that overrides all methods as async or generic, meaning there's no sync part - meaning there's no sync listener (and that test needs one).

We could either split out some code for that specific test (which one is escaping me at the minute), or live with this to ensure that there's a sync listener around.

Got it. Ok. Makes sense.

sreecha · 2016-05-23T14:57:31Z

LGTM. I just had one comment/question in the hybrid_end2end tests but that is not blocking my LGTM

Server was continuing to make requests for new calls forever, which were starving out the shutdown sequence. Change order and win.

sreecha and others added 30 commits April 12, 2016 09:20

Some comments

76cfc6a

first cut of changes

42b004a

test cases

47ef37a

Rewrite test case to handle more scenarios

89bbc78

Delete debug log lines

fe11589

fix formatting

5e28d71

Test failures fix

9e926e8

Merge branch 'master' into server_channel_affinity

020cdb8

Add the option of adding a non-listening server completion queue. Thi…

1f5e262

…s makes writing certain test cases (like hybrid_end2end tests) easier

Add a safety check to ensure atleast one of the completion queues is

7def036

listening completion queue (i.e frequently polled)

generate_projects.sh and fix copyright year

0190712

clang format fix

0b9fdd8

Merge branch 'master' into server_channel_affinity

3049976

Merge branch 'master' into server_channel_affinity

cfa6401

Merge remote-tracking branch 'upstream/master' into server_channel_af…

192afb9

…finity

Merge branch 'master' into server_channel_affinity

a2b5495

Merge branch 'master' into server_channel_affinity

b69251c

Merge branch 'master' into server_channel_affinity

8dbe2cb

Merge branch 'master' into server_channel_affinity

0888911

Merge branch 'server_channel_affinity' of github.com:sreecha/grpc int…

f7a670f

…o affine

Add affinity to ev_poll_posix

b1d3b36

Merge branch 'connect_first' into test_affine

1102faf

Merge branch 'master' into server_channel_affinity

4790263

Add affinity to ev_poll_posix

cefa378

Merge branch 'better_wakeups' into test_affine

a0e10d4

Better testing

c3b88b0

Begin sharding request queues per cq

418a821

Further server cq affinity work

9f9d422

Add missing function for completion queue

40945c7

Fix the failing test. (Adding fd was caling 'kicked_without_pollers'

11e304a

flag to be set to true on the pollset in case of 'poll' strategy. To fix this I am calling grpc_pollset_work with a 0 timeout right after adding the fds)

sreecha reviewed May 21, 2016
View reviewed changes

ctiller added 8 commits May 21, 2016 12:32

Fix non-listening cq registration so that calls can be queued against…

509b30e

… them

Remove spam

3f3312e

Fix comments

fa96d86

clang-format

4265fa1

Revert "Remove spam"

e76528c

This reverts commit 3f3312e.

Simpler trick to force a listening cq

34c6e87

Revert "Revert "Remove spam""

bc7593d

This reverts commit e76528c.

Fix protobuf

5f04538

ctiller added 2 commits May 22, 2016 15:26

Merge github.com:grpc/grpc into test_affine

4609754

Merge github.com:grpc/grpc into test_affine

8ec4097

sreecha reviewed May 23, 2016
View reviewed changes

ctiller added 2 commits May 23, 2016 12:48

Merge github.com:grpc/grpc into test_affine

c4c6ecf

Merge branch 'test_affine' of github.com:ctiller/grpc into test_affine

8c2d373

sreecha mentioned this pull request May 23, 2016

Server channel affinity #6149

Merged

ctiller added 4 commits May 23, 2016 14:50

Fix timeout on async server shutdown

e67f7b6

Server was continuing to make requests for new calls forever, which were starving out the shutdown sequence. Change order and win.

Merge github.com:grpc/grpc into test_affine

116b3c5

Cleanup redundant tests

825cd45

Speed up tests

e0ddc35

ctiller added the disposition/ready to merge label May 24, 2016

ctiller merged commit 8978b3c into grpc:master May 24, 2016

markdroth mentioned this pull request May 19, 2017

gRPC C++ doxygen audit suggestions/change requests #11022

Closed

ncteisen mentioned this pull request May 30, 2018

Any reason to prefer the same completion queue for all requests from the same connection? #15535

Closed

lock bot locked as resolved and limited conversation to collaborators Jan 27, 2019

lock bot unassigned sreecha Jan 27, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Affinitize server call notification #6612

Affinitize server call notification #6612

ctiller commented May 17, 2016 •

edited

Loading

ctiller commented May 21, 2016

sreecha May 21, 2016 •

edited

Loading

ctiller commented May 22, 2016

ctiller commented May 22, 2016

sreecha May 23, 2016

ctiller May 23, 2016

sreecha May 23, 2016

sreecha commented May 23, 2016

Affinitize server call notification #6612

Affinitize server call notification #6612

Conversation

ctiller commented May 17, 2016 • edited Loading

ctiller commented May 21, 2016

sreecha May 21, 2016 • edited Loading

Choose a reason for hiding this comment

ctiller commented May 22, 2016

ctiller commented May 22, 2016

sreecha May 23, 2016

Choose a reason for hiding this comment

ctiller May 23, 2016

Choose a reason for hiding this comment

sreecha May 23, 2016

Choose a reason for hiding this comment

sreecha commented May 23, 2016

ctiller commented May 17, 2016 •

edited

Loading

sreecha May 21, 2016 •

edited

Loading