v1alpha2 E2E tests for termination policy #646

jlewi · 2018-06-12T21:08:57Z

Add E2E tests that verify termination policy is handled correctly.

Only the tests for v1alpha1 are enabled. A follow on PR will see
if v1alpha2 is working and enable the tests for v1alpha2.
Fix versionTag logic; we need to allow for case where versionTag is an
To facilitate these E2E tests, we create a test server to be run as
inside the replicas. This server allows us to control what the process
does via RPC. This allows the test runner to control when a replica exits.
Test harness needs to route requests through the APIServer proxy
Events no longer appears to be showing up for all services / pods
even though all services pods are being created. So we turn the failure
into a warning instead of a test failure.
Print out the TFJob spec and events to aid debugging test failures.

Fix #653 test server

Fixes: #235 E2E test case for when chief is worker 0

Related: #589 CI for v1alpha2

This change is

coveralls · 2018-06-12T21:31:28Z

Coverage remained the same at 57.405% when pulling 32e6416 on jlewi:v1alpha2_tests into 4634c93 on kubeflow:master.

* Only the tests for v1alpha1 are enabled. A follow on PR will see if v1alpha2 is working and enable the tests for v1alpha2. * Fix versionTag logic; we need to allow for case where versionTag is an * To facilitate these E2E tests, we create a test server to be run as inside the replicas. This server allows us to control what the process does via RPC. This allows the test runner to control when a replica exits. * Test harness needs to route requests through the APIServer proxy * Events no longer appears to be showing up for all services / pods even though all services pods are being created. So we turn the failure into a warning instead of a test failure. * Print out the TFJob spec and events to aid debugging test failures. Fix kubeflow#653 test server Fixes: kubeflow#235 E2E test case for when chief is worker 0 Related: kubeflow#589 CI for v1alpha2

jlewi · 2018-06-14T04:49:38Z

/assign @ankushagarwal
/assign @gaocegege

gaocegege

/lgtm

* Fix bug in getting message from event.

ankushagarwal · 2018-06-14T15:56:23Z

/lgtm
/approve

jlewi · 2018-06-14T16:35:05Z

/approve

k8s-ci-robot · 2018-06-14T16:35:12Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: ankushagarwal, jlewi

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [jlewi]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

* Add E2E tests that verify termination policy is handled correctly. * Only the tests for v1alpha1 are enabled. A follow on PR will see if v1alpha2 is working and enable the tests for v1alpha2. * Fix versionTag logic; we need to allow for case where versionTag is an * To facilitate these E2E tests, we create a test server to be run as inside the replicas. This server allows us to control what the process does via RPC. This allows the test runner to control when a replica exits. * Test harness needs to route requests through the APIServer proxy * Events no longer appears to be showing up for all services / pods even though all services pods are being created. So we turn the failure into a warning instead of a test failure. * Print out the TFJob spec and events to aid debugging test failures. Fix kubeflow#653 test server Fixes: kubeflow#235 E2E test case for when chief is worker 0 Related: kubeflow#589 CI for v1alpha2 * * Fix bug in wait for pods; we were exiting prematurely * Fix bug in getting message from event.

k8s-ci-robot added the do-not-merge/work-in-progress label Jun 12, 2018

k8s-ci-robot requested review from jimexist and mitake June 12, 2018 21:09

k8s-ci-robot added the size/XXL label Jun 12, 2018

jlewi force-pushed the v1alpha2_tests branch from db8738f to 923a35b Compare June 13, 2018 05:39

This was referenced Jun 13, 2018

[v1alpha2] Add CI test #589

Closed

[v1alpha2] Create a simple python server to be used for E2E tests of controller behavior #653

Closed

jlewi force-pushed the v1alpha2_tests branch from c290504 to 308c980 Compare June 14, 2018 03:05

jlewi changed the title ~~[wip] v1alpha2 E2E tests for termination policy~~ v1alpha2 E2E tests for termination policy Jun 14, 2018

k8s-ci-robot removed the do-not-merge/work-in-progress label Jun 14, 2018

jlewi force-pushed the v1alpha2_tests branch from a34f0ed to 057e08a Compare June 14, 2018 04:49

k8s-ci-robot assigned ankushagarwal and gaocegege Jun 14, 2018

k8s-ci-robot added the lgtm label Jun 14, 2018

gaocegege reviewed Jun 14, 2018

View reviewed changes

* Fix bug in wait for pods; we were exiting prematurely

32e6416

* Fix bug in getting message from event.

k8s-ci-robot removed the lgtm label Jun 14, 2018

k8s-ci-robot added the lgtm label Jun 14, 2018

k8s-ci-robot added the approved label Jun 14, 2018

k8s-ci-robot merged commit 96e6409 into kubeflow:master Jun 14, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v1alpha2 E2E tests for termination policy #646

v1alpha2 E2E tests for termination policy #646

jlewi commented Jun 12, 2018 •

edited

Loading

coveralls commented Jun 12, 2018 •

edited

Loading

jlewi commented Jun 14, 2018

gaocegege left a comment

ankushagarwal commented Jun 14, 2018

jlewi commented Jun 14, 2018

k8s-ci-robot commented Jun 14, 2018

v1alpha2 E2E tests for termination policy #646

v1alpha2 E2E tests for termination policy #646

Conversation

jlewi commented Jun 12, 2018 • edited Loading

coveralls commented Jun 12, 2018 • edited Loading

jlewi commented Jun 14, 2018

gaocegege left a comment

Choose a reason for hiding this comment

ankushagarwal commented Jun 14, 2018

jlewi commented Jun 14, 2018

k8s-ci-robot commented Jun 14, 2018

jlewi commented Jun 12, 2018 •

edited

Loading

coveralls commented Jun 12, 2018 •

edited

Loading