Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Double container probe timeout #61721

Merged
merged 2 commits into from
Apr 11, 2018

Conversation

liggitt
Copy link
Member

@liggitt liggitt commented Mar 26, 2018

in some environments, we see a combination of start latency
and the corresponding effect on sync pod latency causing status
manager to fail to report within the 2 minute window.

NONE

in some environments, we see a combination of start latency
and the corresponding effect on sync pod latency causing status
manager to fail to report within the 2 minute window.
@k8s-ci-robot k8s-ci-robot added size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. do-not-merge/release-note-label-needed Indicates that a PR should not merge because it's missing one of the release note labels. labels Mar 26, 2018
@k8s-ci-robot k8s-ci-robot added approved Indicates a PR has been approved by an approver from all required OWNERS files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. labels Mar 26, 2018
this reduces flakiness in extended suites where
long start delays result in this test failing.
@k8s-ci-robot k8s-ci-robot added release-note-none Denotes a PR that doesn't merit a release note. and removed do-not-merge/release-note-label-needed Indicates that a PR should not merge because it's missing one of the release note labels. labels Mar 26, 2018
@liggitt
Copy link
Member Author

liggitt commented Mar 26, 2018

/sig testing
@kubernetes/sig-testing-pr-reviews

@k8s-ci-robot k8s-ci-robot added the sig/testing Categorizes an issue or PR as relevant to SIG Testing. label Mar 26, 2018
@fejta
Copy link
Contributor

fejta commented Mar 26, 2018

LGTM but can we get someone from the relevant sig -- api-machinery or networking?? -- to review this?

@liggitt
Copy link
Member Author

liggitt commented Mar 26, 2018

@smarterclayton, suggestion for best approver for this? it's more a general "what's the longest a container could take to start in a test env" question

@fejta
Copy link
Contributor

fejta commented Mar 26, 2018

/lgtm
/hold

@k8s-ci-robot k8s-ci-robot added do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. lgtm "Looks good to me", indicates that a PR is ready to be merged. labels Mar 26, 2018
@smarterclayton
Copy link
Contributor

Right now about 2 minutes is the longest i've seen, so i think 3 is a good rule of thumb (a slow pull with heavily contended low CPU node without high IOPS)

@smarterclayton
Copy link
Contributor

Generally I would expect any latency issues to be caught by the sig-scalability tests that measure for this that are more controlled environments (a high parallelism e2e is not controlled since multiple conflicting workloads can vary wildly in scope).

@smarterclayton
Copy link
Contributor

/approve

@k8s-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: fejta, liggitt, smarterclayton

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@liggitt
Copy link
Member Author

liggitt commented Apr 11, 2018

/hold cancel
/retest

@k8s-ci-robot k8s-ci-robot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Apr 11, 2018
@k8s-github-robot
Copy link

/test all [submit-queue is verifying that this PR is safe to merge]

@k8s-github-robot
Copy link

Automatic merge from submit-queue (batch tested with PRs 46903, 61721, 62317). If you want to cherry-pick this change to another branch, please follow the instructions here.

@k8s-github-robot k8s-github-robot merged commit 72a44f9 into kubernetes:master Apr 11, 2018
@liggitt liggitt deleted the container-probe-timeout branch April 18, 2018 15:09
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. lgtm "Looks good to me", indicates that a PR is ready to be merged. release-note-none Denotes a PR that doesn't merit a release note. sig/testing Categorizes an issue or PR as relevant to SIG Testing. size/XS Denotes a PR that changes 0-9 lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants