kubernetes-e2e-aws failing to start cluster #18037

spxtr · 2015-12-01T21:48:20Z

kubernetes-e2e-aws has been running daily for about a week and failing each time after trying to check for salt-master repeatedly:

e2e.go:141: Error starting e2e cluster. Aborting.

Once this is green, should it be moved to critical builds?

The text was updated successfully, but these errors were encountered:

spxtr · 2015-12-01T21:51:38Z

Also related, there is a case in e2e.sh for kubernetes-e2e-aws-parallel, but no corresponding Jenkins job.

j3ffml · 2015-12-11T19:09:24Z

Ping on this. Cluster up is succeeding now, but the e2e test driver is timing out waiting for kube-system pods to start running.

ixdy · 2016-02-04T00:27:56Z

any update on this?

spxtr · 2016-02-11T19:19:00Z

It looks like both aws and aws-1.1 are running tests, with one failing consistently on aws and several failing on aws-1.1.

spxtr · 2016-02-12T18:09:39Z

There is one test failing now, same place 3x in a row. @brendandburns you might be interested.

[BeforeEach] Services
  /go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/test/e2e/service.go:70

[It] should be able to change the type and ports of a service
  /go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/test/e2e/service.go:699
...
STEP: changing the TCP service to type=LoadBalancer
STEP: changing the UDP service to type=LoadBalancer
STEP: waiting for the TCP service to have a load balancer
Feb 11 18:53:08.217: INFO: Waiting up to 20m0s for service "mutability-test" to have a LoadBalancer
Feb 11 18:53:10.291: INFO: TCP load balancer: a999de63bd13311e59c500a99e494a48-1780227540.us-east-1.elb.amazonaws.com
STEP: waiting for the UDP service mutability-test to have a load balancer
STEP: waiting for the UDP service to have a load balancer
Feb 11 18:53:10.291: INFO: Waiting up to 20m0s for service "mutability-test" to have a LoadBalancer
Feb 11 19:13:10.433: FAIL: Timeout waiting for service "mutability-test" to have a load balancer
...

• Failure [1281.610 seconds]
Services
/go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/test/e2e/service.go:885
  should be able to change the type and ports of a service [It]
  /go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/test/e2e/service.go:699

  Feb 11 19:13:10.433: Timeout waiting for service "mutability-test" to have a load balancer

  /go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/test/e2e/service.go:1602

brendandburns · 2016-02-29T18:11:09Z

Assigning to @justinsb since I believe he is fixing this stuff. Ping back if not true.

justinsb · 2016-02-29T19:30:53Z

Thank you. I'm pretty sure this is fixed (I'm running the e2e tests locally using a simulate-Jenkins hack), and they are currently all green. But once we get the pending PRs merged I will verify / investigate whether there is something wrong with Jenkins!

fejta · 2016-03-02T19:13:31Z

Jenkins ran AWS tests on 3/1 and 2/29

spxtr · 2016-03-02T19:30:21Z

The cluster successfully comes up, but the two tests: Services should be able to up and down services and SSH should SSH to all nodes and run commands are failing. We can either close this and open a new issue or just track those here.

On release-1.1 branch there are lots of tests failing, but I don't think that's a priority.

justinsb · 2016-03-02T22:37:05Z

@spxtr those are failing on 1.2? The default SSH username changed to "admin" if you're using jessie, which is now the default if you don't set KUBE_OS_DISTRIBUTION. So KUBE_SSH_USER needs to be changed from ubuntu -> jessie. I can file a PR for that.

I think service up and down is a flake, but I've seen it come and go also.

I would hope 1.1 should pass tests, but that shouldn't be a priority over 1.2.

I propose we close this and open 2 issues: default SSH username change, and 1.1 e2e being not happy. And then if we see the service up and down again we open it too...

spxtr · 2016-03-02T22:41:50Z

SGTM

justinsb · 2016-03-04T03:54:24Z

Opened those two issues; closing this one.

spxtr assigned brendandburns Dec 1, 2015

fabioy added area/test area/platform/aws area/test-infra labels Dec 1, 2015

This was referenced Dec 3, 2015

Fix an incorrect reference to the directive that installs docker #18176

Merged

Fix the scripts to handle master/minion salt setups (e.g. AWS) #18198

Merged

salt 2014.7 doesn't support systemd services #18201

Closed

ikehz added the priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. label Dec 22, 2015

justinsb modified the milestone: v1.2 Feb 20, 2016

justinsb added priority/backlog Higher priority than priority/awaiting-more-evidence. and removed priority/backlog Higher priority than priority/awaiting-more-evidence. labels Feb 23, 2016

brendandburns assigned justinsb and unassigned brendandburns Feb 29, 2016

This was referenced Mar 4, 2016

AWS: Check e2e on release-1.1 branch #22499

Closed

Jenkins AWS e2e: SSH username for jessie is admin #22403

Closed

justinsb closed this as completed Mar 4, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kubernetes-e2e-aws failing to start cluster #18037

kubernetes-e2e-aws failing to start cluster #18037

spxtr commented Dec 1, 2015

spxtr commented Dec 1, 2015

j3ffml commented Dec 11, 2015

ixdy commented Feb 4, 2016

spxtr commented Feb 11, 2016

spxtr commented Feb 12, 2016

brendandburns commented Feb 29, 2016

justinsb commented Feb 29, 2016

fejta commented Mar 2, 2016

spxtr commented Mar 2, 2016

justinsb commented Mar 2, 2016

spxtr commented Mar 2, 2016

justinsb commented Mar 4, 2016

kubernetes-e2e-aws failing to start cluster #18037

kubernetes-e2e-aws failing to start cluster #18037

Comments

spxtr commented Dec 1, 2015

spxtr commented Dec 1, 2015

j3ffml commented Dec 11, 2015

ixdy commented Feb 4, 2016

spxtr commented Feb 11, 2016

spxtr commented Feb 12, 2016

brendandburns commented Feb 29, 2016

justinsb commented Feb 29, 2016

fejta commented Mar 2, 2016

spxtr commented Mar 2, 2016

justinsb commented Mar 2, 2016

spxtr commented Mar 2, 2016

justinsb commented Mar 4, 2016