Incremental improvements to kubelet e2e tests #24426

pwittrock · 2016-04-18T21:00:56Z

Add keep-alive to ssh connection
Don't try to stop services on image-based runs
Increase jenkins ci timeout to 90 minutes to accomadate unpredictable go build times
Remove spammy log statement

- Add keep-alive to ssh connection - Don't try to stop services on image-based runs - Increase jenkins ci timeout to 90 minutes to accomadate unpredictable go build times - Remove spammy log statement

eparis · 2016-04-18T21:25:02Z

test/e2e_node/privileged_test.go

@@ -151,7 +150,6 @@ func (config *PrivilegedPodTestConfig) dialFromContainer(containerIP string, con
 	var output map[string]string
 	err = json.Unmarshal([]byte(stdout), &output)
 	Expect(err).NotTo(HaveOccurred(), fmt.Sprintf("Could not unmarshal curl response: %s", stdout))
-	glog.Infof("Deserialized output is %v", output)


This output was just not useful at all?

cc @yujuhong

The tests that call this log the map if it does not match expectations. I don't think we want to output this when the tests pass since it clutters up the build log.

I copied this from the original test. The output seems pretty meaningful. E.g., "privileged_test.go:154] Deserialized output is map[output:RTNETLINK answers: Operation not permitted"

Do we have any general rules for logging in a node e2e suite? A cluster e2e test logs quite a lot at the info level, but I've just noticed that it's very different in the node e2e suite.

I expect that message is ignored unless the test fails, in which case it should be printed out by the assertion. That output is actually expected, but can make debugging unrelated failures hard because it looks like an error message even though it is printed at info level.

My opinion would be that anything besides the test results themselves are just debug information that we should only assume will be seen if there is a test failure. To the extent possible, debug information should only be printed if it is related to a failed tests.

I agree in general that we should keep the output to the minimum, but fwiw, comparing the logs of a passed test against those of a failed test is helpful at times. I guess we won't need that if every test is well-written :)

k8s-bot · 2016-04-18T21:35:13Z

GCE e2e build/test passed for commit 90d2f9a.

pwittrock · 2016-04-19T16:47:39Z

Added p1 label since this is causing flaky failures and should recieve higher priority

eparis · 2016-04-19T19:40:09Z

is the flake fix just the timeout extension?

pwittrock · 2016-04-19T21:03:06Z

No, there are a couple other flakes I am hoping this will help tease out.

#24423 CoreOs is kill the ssh connection mid test - I am seeing if the keepalive may help with this
#24422 Ginkgo tests exit -1 after suite passes - I prevented stoping the processes to see if this is causing a failure

k8s-github-robot · 2016-04-20T04:55:59Z

@k8s-bot test this [submit-queue is verifying that this PR is safe to merge]

k8s-github-robot · 2016-04-20T04:56:23Z

@k8s-bot test this issue: #IGNORE

Tests have been pending for 24 hours

k8s-bot · 2016-04-20T05:36:16Z

GCE e2e build/test passed for commit 90d2f9a.

k8s-github-robot · 2016-04-20T05:36:40Z

Automatic merge from submit-queue

Incremental improvements to kubelet e2e tests

90d2f9a

- Add keep-alive to ssh connection - Don't try to stop services on image-based runs - Increase jenkins ci timeout to 90 minutes to accomadate unpredictable go build times - Remove spammy log statement

googlebot added the cla: yes label Apr 18, 2016

k8s-github-robot assigned eparis Apr 18, 2016

k8s-github-robot added size/S Denotes a PR that changes 10-29 lines, ignoring generated files. release-note-label-needed labels Apr 18, 2016

eparis reviewed Apr 18, 2016
View reviewed changes

eparis added lgtm "Looks good to me", indicates that a PR is ready to be merged. release-note Denotes a PR that will be considered when it comes time to generate release notes. and removed release-note-label-needed labels Apr 19, 2016

pwittrock added priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. kind/flake Categorizes issue or PR as related to a flaky test. labels Apr 19, 2016

This was referenced Apr 19, 2016

kubelet test flake - coreos killed ssh connection mid test #24423

Closed

kubelet ssh / ginkgo exists 255 after successful tests #24422

Closed

k8s-github-robot merged commit 86544c2 into kubernetes:master Apr 20, 2016

pwittrock mentioned this pull request Apr 20, 2016

kubelet-e2e-gce-ci timeout #24209

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Incremental improvements to kubelet e2e tests #24426

Incremental improvements to kubelet e2e tests #24426

pwittrock commented Apr 18, 2016

eparis Apr 18, 2016

pwittrock Apr 18, 2016

yujuhong Apr 18, 2016

pwittrock Apr 19, 2016

yujuhong Apr 19, 2016

k8s-bot commented Apr 18, 2016

pwittrock commented Apr 19, 2016

eparis commented Apr 19, 2016

pwittrock commented Apr 19, 2016

k8s-github-robot commented Apr 20, 2016

k8s-github-robot commented Apr 20, 2016

k8s-bot commented Apr 20, 2016

k8s-github-robot commented Apr 20, 2016

Incremental improvements to kubelet e2e tests #24426

Incremental improvements to kubelet e2e tests #24426

Conversation

pwittrock commented Apr 18, 2016

eparis Apr 18, 2016

Choose a reason for hiding this comment

pwittrock Apr 18, 2016

Choose a reason for hiding this comment

yujuhong Apr 18, 2016

Choose a reason for hiding this comment

pwittrock Apr 19, 2016

Choose a reason for hiding this comment

yujuhong Apr 19, 2016

Choose a reason for hiding this comment

k8s-bot commented Apr 18, 2016

pwittrock commented Apr 19, 2016

eparis commented Apr 19, 2016

pwittrock commented Apr 19, 2016

k8s-github-robot commented Apr 20, 2016

k8s-github-robot commented Apr 20, 2016

k8s-bot commented Apr 20, 2016

k8s-github-robot commented Apr 20, 2016