issue #200: update leader timeouts after disk write #201

nhardt · 2015-12-07T21:40:23Z

in cases where disk writes can take longer than the leader election,
a follower can call for an election when one is not necessary. this
change resets the election related timers after all disk writes.

I don't fully understand what stepDown is doing or if we should call it again, but it does seem related to the other calls so I duplicated it, so I left in. In testing, this change is helping prevent the stated problem, which makes sense. I'll make whatever changes are requested.

ongardie · 2015-12-07T21:45:35Z

Server/RaftConsensus.cc

+
+    // reset election timer to avoid punishing the leader for our own
+    // long writes
+    stepDown(request.term());


No need to call stepDown again here.

ongardie · 2015-12-07T21:47:16Z

Server/RaftConsensus.cc

+    // This request is a sign of life from the current leader. Update
+    // our term and convert to follower if necessary; reset the
+    // election timer. set it here in case we exit early, we will set
+    // it again after the write


"exit early" -> "exit this function early"
"write" -> "disk write"

ongardie · 2015-12-07T21:48:31Z

Cool. stepDown is there (above) to ensure this server is a follower in the same term (it might not have participated in the election).

ongardie · 2015-12-07T21:51:42Z

Probably also worth updating copyright header and release notes.

nhardt · 2015-12-07T23:22:58Z

ok, re-pushed with those changes. is this issue able to repro'd somewhere in the logcabin unit tests?

ongardie · 2015-12-07T23:24:22Z

RELEASES.md

@@ -15,6 +15,10 @@ See [RELEASE-PROCESS.md](RELEASE-PROCESS.md).
 Version 1.2.0-alpha.0 (In Development)
 ======================================

+Bug fixes (low severity):
+
+- #200: reset leader election timeout in follower after disk io completes


That's an improvement, not a bug fix :)

ongardie · 2015-12-07T23:26:55Z

Let me think about the unit testing question. At least we should be able to find a clean way to check that the code is doing what we expect (whitebox).

in cases where disk writes can take longer than the leader election, a follower can call for an election when one is not necessary. this change resets the election related timers after all disk writes.

nhardt · 2015-12-07T23:48:47Z

pushed another.

issue #200: update leader timeouts after disk write

ongardie reviewed Dec 7, 2015
View reviewed changes

nhardt force-pushed the issue200 branch from e9641c1 to 45332ce Compare December 7, 2015 21:46

ongardie reviewed Dec 7, 2015
View reviewed changes

nhardt force-pushed the issue200 branch from 45332ce to 1f6f597 Compare December 7, 2015 21:47

nhardt force-pushed the issue200 branch from 1f6f597 to 2e6fb73 Compare December 7, 2015 21:51

nhardt force-pushed the issue200 branch from 2e6fb73 to 77b0470 Compare December 7, 2015 21:57

ongardie reviewed Dec 7, 2015
View reviewed changes

nhardt force-pushed the issue200 branch from 77b0470 to 745f2bb Compare December 7, 2015 23:38

issue logcabin#200: update leader timeouts after disk write

745f2bb

in cases where disk writes can take longer than the leader election, a follower can call for an election when one is not necessary. this change resets the election related timers after all disk writes.

ongardie added a commit that referenced this pull request Feb 11, 2016

Merge pull request #201 from nhardt/issue200

bca155c

issue #200: update leader timeouts after disk write

ongardie merged commit bca155c into logcabin:master Feb 11, 2016

nhardt deleted the issue200 branch October 5, 2016 20:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

issue #200: update leader timeouts after disk write #201

issue #200: update leader timeouts after disk write #201

nhardt commented Dec 7, 2015

ongardie Dec 7, 2015

ongardie Dec 7, 2015

ongardie commented Dec 7, 2015

ongardie commented Dec 7, 2015

nhardt commented Dec 7, 2015

ongardie Dec 7, 2015

ongardie commented Dec 7, 2015

nhardt commented Dec 7, 2015

issue #200: update leader timeouts after disk write #201

issue #200: update leader timeouts after disk write #201

Conversation

nhardt commented Dec 7, 2015

ongardie Dec 7, 2015

Choose a reason for hiding this comment

ongardie Dec 7, 2015

Choose a reason for hiding this comment

ongardie commented Dec 7, 2015

ongardie commented Dec 7, 2015

nhardt commented Dec 7, 2015

ongardie Dec 7, 2015

Choose a reason for hiding this comment

ongardie commented Dec 7, 2015

nhardt commented Dec 7, 2015