pull-kubernetes-e2e-gce-etcd3 failing: Quota 'SUBNETWORKS' exceeded #47362

yujuhong · 2017-06-12T20:05:38Z

The ~5 most recent builds failed because of this issue.
https://k8s-gubernator.appspot.com/builds/kubernetes-jenkins/pr-logs/directory/pull-kubernetes-e2e-gce-etcd3

W0612 12:52:57.395] ERROR: (gcloud.compute.networks.create) Could not fetch resource:
W0612 12:52:57.396]  - Quota 'SUBNETWORKS' exceeded.  Limit: 150.0

/cc @kubernetes/sig-testing-bugs @kubernetes/sig-network-bugs based on the similar issue from ~2 weeks ago #46713

The text was updated successfully, but these errors were encountered:

k8s-github-robot · 2017-06-12T20:05:41Z

@yujuhong There are no sig labels on this issue. Please add a sig label by:
(1) mentioning a sig: @kubernetes/sig-<team-name>-misc
(2) specifying the label manually: /sig <label>

Note: method (1) will trigger a notification to the team. You can find the team list here and label list here

yujuhong · 2017-06-12T20:07:00Z

/cc @dchen1107 @krzyzacy

krzyzacy · 2017-06-12T20:10:08Z

that again?
cc @MrHohn

MrHohn · 2017-06-12T20:27:29Z

@krzyzacy The behavior this time is we leaked an ingress firewall and failed to cleanup the network resource.

W0612 13:20:54.309] ERROR: (gcloud.compute.networks.delete) Some requests did not succeed:
W0612 13:20:54.309]  - The network resource 'projects/k8s-jkns-pr-gce-etcd3/global/networks/e2e-35526' is already being used by 'projects/k8s-jkns-pr-gce-etcd3/global/firewalls/ingress-80-443-e2e-tests-ingress-3qzl2'
W0612 13:20:54.309] 
I0612 13:20:54.410] Failed to delete network 'e2e-35526'. Listing firewall-rules:
I0612 13:20:55.064] NAME                                    NETWORK    SRC_RANGES  RULES           SRC_TAGS  TARGET_TAGS
I0612 13:20:55.065] ingress-80-443-e2e-tests-ingress-3qzl2  e2e-35526  0.0.0.0/0   tcp:80,tcp:443

@nicksardo I thought the ingress tests are only run in slow suite?

krzyzacy · 2017-06-12T20:50:39Z

The leaking firewall is created in

I0612 12:14:24.949] STEP: Initializing nginx controller
I0612 12:14:24.949] Jun 12 12:06:08.560: INFO: Creating firewall-rules in project k8s-jkns-pr-gce-etcd3: ingress-80-443-e2e-tests-ingress-g9d4j
I0612 12:14:24.949] Jun 12 12:06:08.560: INFO: Running command: gcloud compute firewall-rules create ingress-80-443-e2e-tests-ingress-g9d4j --project=k8s-jkns-pr-gce-etcd3 --allow tcp:80,tcp:443 --network e2e-35517

and then it cannot acquire the IP,

I0612 12:14:24.955] Jun 12 12:14:12.173: INFO: Waiting for Ingress echomap to acquire IP, error <nil>
I0612 12:14:24.955] Jun 12 12:14:22.197: INFO: Waiting for Ingress echomap to acquire IP, error <nil>
I0612 12:14:24.955] 
I0612 12:14:24.955] ---------------------------------------------------------
I0612 12:14:24.956] Received interrupt.  Running AfterSuite...
I0612 12:14:24.956] ^C again to terminate immediately

and result in

/go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/test/e2e/ingress.go:194
Jun 12 12:05:02.109: Ingress failed to acquire an IP address within 15m0s
/go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/test/e2e/framework/ingress_utils.go:900

seems that ingress test is pretty much screwed?
https://k8s-gubernator.appspot.com/build/kubernetes-jenkins/pr-logs/pull/47274/pull-kubernetes-e2e-gce-etcd3/35517
once it's timed out it's not deleting created resources

MrHohn · 2017-06-12T20:58:25Z

The ingress test is supposed to cleanup firewall resource in AfterEach(), but before that it received sigterm signals so it stopped:

I0612 13:15:04.080] Jun 12 13:14:59.681: INFO: Waiting for Ingress echomap to acquire IP, error <nil>
I0612 13:15:04.080] 
I0612 13:15:04.080] ---------------------------------------------------------
I0612 13:15:04.080] Received interrupt.  Running AfterSuite...
I0612 13:15:04.080] ^C again to terminate immediately
I0612 13:15:04.081] Jun 12 13:15:03.970: INFO: Running AfterSuite actions on all node
I0612 13:15:04.081] 
I0612 13:15:33.752] 
I0612 13:15:33.753] Jun 12 13:06:01.025: INFO: Running AfterSuite actions on all node
I0612 13:15:33.753] 
I0612 13:15:33.753] ---------------------------------------------------------

nicksardo · 2017-06-12T21:01:00Z

@aledbf Looks like this is the nginx ingress test. We're going to move this to the slow suite.

dchen1107 · 2017-06-12T21:02:01Z

SGTM.

nicksardo · 2017-06-12T21:10:30Z

Will cleanup the project resources.

dchen1107 · 2017-06-12T21:10:46Z

@krzyzacy Can we removed those leaked network resources for recover the build? Thanks~!

krzyzacy · 2017-06-12T21:15:42Z

just wiped some firewalls. @nicksardo I'll leave the rest to you :-)

nicksardo · 2017-06-12T21:16:13Z

All ingress firewalls are gone. Will delete the old networks.

nicksardo · 2017-06-12T21:28:47Z

Old networks have been cleared. Everything should be good-to-go.

nicksardo · 2017-06-12T22:16:09Z

I'll cleanup resources again after my PR gets merged.

nicksardo · 2017-06-12T22:22:55Z

/assign

ericchiang · 2017-06-13T02:24:19Z

@nicksardo this test was re-assigned to the slow suits but is still failing there. https://k8s-testgrid.appspot.com/release-master-blocking#gci-gke-slow

nicksardo · 2017-06-13T04:13:46Z

@aledbf Mind looking at this?

aledbf · 2017-06-13T04:28:18Z

@nicksardo sure

aledbf · 2017-06-13T04:41:40Z

@nicksardo reading one of the logs https://k8s-gubernator.appspot.com/build/kubernetes-jenkins/logs/ci-kubernetes-e2e-gci-gke-slow/7064/nodelog?junit=junit_08.xml&wrap=on
I see this

E0613 00:10:33.950066    1552 remote_runtime.go:91] RunPodSandbox from runtime service failed: rpc error: code = 2 desc = NetworkPlugin kubenet failed to set up pod "nginx-ingress-controller-6xd68_e2e-tests-ingress-l50hv" network: cannot open hostport 80 for pod nginx-ingress-controller-6xd68_e2e-tests-ingress-l50hv: listen tcp :80: bind: address already in use

yujuhong added the kind/failing-test Categorizes issue or PR as related to a consistently or frequently failing test. label Jun 12, 2017

k8s-ci-robot added sig/testing Categorizes an issue or PR as relevant to SIG Testing. kind/bug Categorizes issue or PR as related to a bug. labels Jun 12, 2017

k8s-ci-robot added sig/network Categorizes an issue or PR as relevant to SIG Network. kind/bug Categorizes issue or PR as related to a bug. labels Jun 12, 2017

k8s-github-robot added the needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. label Jun 12, 2017

yujuhong removed the needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. label Jun 12, 2017

wongma7 mentioned this issue Jun 12, 2017

Don't provision for PVCs with AccessModes unsupported by plugin #47274

Merged

dchen1107 added this to the v1.7 milestone Jun 12, 2017

dchen1107 added the approved-for-milestone label Jun 12, 2017

nicksardo mentioned this issue Jun 12, 2017

[Nginx] Run nginx ingress test in slow suite #47369

Merged

dcbw mentioned this issue Jun 12, 2017

kubelet/network: report but tolerate errors returned from GetNetNS() v2 #46823

Merged

dchen1107 mentioned this issue Jun 12, 2017

cache mutation detector causes memory/cpu pressure at the end of long e2e runs (like pull-kubernetes-e2e-gce-etcd3) #47135

Closed

k8s-ci-robot assigned nicksardo Jun 12, 2017

dchen1107 closed this as completed in #47369 Jun 12, 2017

nicksardo mentioned this issue Jun 13, 2017

E2E Test failure: [k8s.io] Loadbalancing: L7 [k8s.io] [Slow] Nginx should conform to Ingress spec (Failure cluster [1e796c...] failed 121 builds, 23 jobs, and 3 tests over 1 days) #47397

Closed

dchen1107 mentioned this issue Jun 13, 2017

E2E Test failure: [k8s.io] Loadbalancing: L7 [k8s.io] [Slow] Nginx should conform to Ingress spec #47441

Closed

cblecker mentioned this issue Aug 22, 2017

pull-kubernetes-e2e-gce-etcd3 failing: Quota 'SUBNETWORKS' exceeded #51136

Closed

MrHohn mentioned this issue Sep 8, 2017

Create manual network instead of automatic network for PR jobs. kubernetes/test-infra#4472

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pull-kubernetes-e2e-gce-etcd3 failing: Quota 'SUBNETWORKS' exceeded #47362

pull-kubernetes-e2e-gce-etcd3 failing: Quota 'SUBNETWORKS' exceeded #47362

yujuhong commented Jun 12, 2017

k8s-github-robot commented Jun 12, 2017

yujuhong commented Jun 12, 2017

krzyzacy commented Jun 12, 2017

MrHohn commented Jun 12, 2017

krzyzacy commented Jun 12, 2017 •

edited

Loading

MrHohn commented Jun 12, 2017

nicksardo commented Jun 12, 2017

dchen1107 commented Jun 12, 2017

nicksardo commented Jun 12, 2017

dchen1107 commented Jun 12, 2017

krzyzacy commented Jun 12, 2017

nicksardo commented Jun 12, 2017

nicksardo commented Jun 12, 2017

nicksardo commented Jun 12, 2017

nicksardo commented Jun 12, 2017

ericchiang commented Jun 13, 2017

nicksardo commented Jun 13, 2017

aledbf commented Jun 13, 2017

aledbf commented Jun 13, 2017

pull-kubernetes-e2e-gce-etcd3 failing: Quota 'SUBNETWORKS' exceeded #47362

pull-kubernetes-e2e-gce-etcd3 failing: Quota 'SUBNETWORKS' exceeded #47362

Comments

yujuhong commented Jun 12, 2017

k8s-github-robot commented Jun 12, 2017

yujuhong commented Jun 12, 2017

krzyzacy commented Jun 12, 2017

MrHohn commented Jun 12, 2017

krzyzacy commented Jun 12, 2017 • edited Loading

MrHohn commented Jun 12, 2017

nicksardo commented Jun 12, 2017

dchen1107 commented Jun 12, 2017

nicksardo commented Jun 12, 2017

dchen1107 commented Jun 12, 2017

krzyzacy commented Jun 12, 2017

nicksardo commented Jun 12, 2017

nicksardo commented Jun 12, 2017

nicksardo commented Jun 12, 2017

nicksardo commented Jun 12, 2017

ericchiang commented Jun 13, 2017

nicksardo commented Jun 13, 2017

aledbf commented Jun 13, 2017

aledbf commented Jun 13, 2017

krzyzacy commented Jun 12, 2017 •

edited

Loading