Double container probe timeout #61721

liggitt · 2018-03-26T15:13:38Z

in some environments, we see a combination of start latency
and the corresponding effect on sync pod latency causing status
manager to fail to report within the 2 minute window.

NONE

in some environments, we see a combination of start latency and the corresponding effect on sync pod latency causing status manager to fail to report within the 2 minute window.

this reduces flakiness in extended suites where long start delays result in this test failing.

liggitt · 2018-03-26T15:37:00Z

/sig testing
@kubernetes/sig-testing-pr-reviews

fejta · 2018-03-26T17:38:30Z

LGTM but can we get someone from the relevant sig -- api-machinery or networking?? -- to review this?

liggitt · 2018-03-26T18:02:00Z

@smarterclayton, suggestion for best approver for this? it's more a general "what's the longest a container could take to start in a test env" question

fejta · 2018-03-26T22:19:13Z

/lgtm
/hold

smarterclayton · 2018-03-27T04:33:26Z

Right now about 2 minutes is the longest i've seen, so i think 3 is a good rule of thumb (a slow pull with heavily contended low CPU node without high IOPS)

smarterclayton · 2018-03-27T04:34:23Z

Generally I would expect any latency issues to be caught by the sig-scalability tests that measure for this that are more controlled environments (a high parallelism e2e is not controlled since multiple conflicting workloads can vary wildly in scope).

smarterclayton · 2018-03-27T04:34:27Z

/approve

k8s-ci-robot · 2018-03-27T04:34:33Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: fejta, liggitt, smarterclayton

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~test/OWNERS~~ [fejta,liggitt,smarterclayton]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

liggitt · 2018-04-11T05:26:37Z

/hold cancel
/retest

k8s-github-robot · 2018-04-11T07:46:06Z

/test all [submit-queue is verifying that this PR is safe to merge]

k8s-github-robot · 2018-04-11T07:51:10Z

Automatic merge from submit-queue (batch tested with PRs 46903, 61721, 62317). If you want to cherry-pick this change to another branch, please follow the instructions here.

Double container probe timeout

de053ef

in some environments, we see a combination of start latency and the corresponding effect on sync pod latency causing status manager to fail to report within the 2 minute window.

k8s-ci-robot added size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. do-not-merge/release-note-label-needed Indicates that a PR should not merge because it's missing one of the release note labels. labels Mar 26, 2018

k8s-ci-robot requested review from fejta and smarterclayton March 26, 2018 15:13

k8s-ci-robot added approved Indicates a PR has been approved by an approver from all required OWNERS files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. labels Mar 26, 2018

Increase service endpoint test timeout

88a1128

this reduces flakiness in extended suites where long start delays result in this test failing.

k8s-ci-robot added release-note-none Denotes a PR that doesn't merit a release note. and removed do-not-merge/release-note-label-needed Indicates that a PR should not merge because it's missing one of the release note labels. labels Mar 26, 2018

k8s-ci-robot added the sig/testing Categorizes an issue or PR as relevant to SIG Testing. label Mar 26, 2018

k8s-ci-robot assigned fejta Mar 26, 2018

k8s-ci-robot added do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. lgtm "Looks good to me", indicates that a PR is ready to be merged. labels Mar 26, 2018

k8s-ci-robot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Apr 11, 2018

k8s-github-robot merged commit 72a44f9 into kubernetes:master Apr 11, 2018

liggitt deleted the container-probe-timeout branch April 18, 2018 15:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Double container probe timeout #61721

Double container probe timeout #61721

liggitt commented Mar 26, 2018 •

edited

Loading

liggitt commented Mar 26, 2018

fejta commented Mar 26, 2018

liggitt commented Mar 26, 2018

fejta commented Mar 26, 2018

smarterclayton commented Mar 27, 2018

smarterclayton commented Mar 27, 2018

smarterclayton commented Mar 27, 2018

k8s-ci-robot commented Mar 27, 2018

liggitt commented Apr 11, 2018

k8s-github-robot commented Apr 11, 2018

k8s-github-robot commented Apr 11, 2018

Double container probe timeout #61721

Double container probe timeout #61721

Conversation

liggitt commented Mar 26, 2018 • edited Loading

liggitt commented Mar 26, 2018

fejta commented Mar 26, 2018

liggitt commented Mar 26, 2018

fejta commented Mar 26, 2018

smarterclayton commented Mar 27, 2018

smarterclayton commented Mar 27, 2018

smarterclayton commented Mar 27, 2018

k8s-ci-robot commented Mar 27, 2018

liggitt commented Apr 11, 2018

k8s-github-robot commented Apr 11, 2018

k8s-github-robot commented Apr 11, 2018

liggitt commented Mar 26, 2018 •

edited

Loading