Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

v1alpha2 E2E tests for termination policy #646

Merged
merged 2 commits into from
Jun 14, 2018

Conversation

jlewi
Copy link
Contributor

@jlewi jlewi commented Jun 12, 2018

Add E2E tests that verify termination policy is handled correctly.

  • Only the tests for v1alpha1 are enabled. A follow on PR will see
    if v1alpha2 is working and enable the tests for v1alpha2.

  • Fix versionTag logic; we need to allow for case where versionTag is an

  • To facilitate these E2E tests, we create a test server to be run as
    inside the replicas. This server allows us to control what the process
    does via RPC. This allows the test runner to control when a replica exits.

  • Test harness needs to route requests through the APIServer proxy

  • Events no longer appears to be showing up for all services / pods
    even though all services pods are being created. So we turn the failure
    into a warning instead of a test failure.

  • Print out the TFJob spec and events to aid debugging test failures.

Fix #653 test server

Fixes: #235 E2E test case for when chief is worker 0

Related: #589 CI for v1alpha2


This change is Reviewable

@coveralls
Copy link

coveralls commented Jun 12, 2018

Coverage Status

Coverage remained the same at 57.405% when pulling 32e6416 on jlewi:v1alpha2_tests into 4634c93 on kubeflow:master.

@jlewi jlewi changed the title [wip] v1alpha2 E2E tests for termination policy v1alpha2 E2E tests for termination policy Jun 14, 2018
* Only the tests for v1alpha1 are enabled. A follow on PR will see
if v1alpha2 is working and enable the tests for v1alpha2.

* Fix versionTag logic; we need to allow for case where versionTag is an

* To facilitate these E2E tests, we create a test server to be run as
  inside the replicas. This server allows us to control what the process
  does via RPC. This allows the test runner to control when a replica exits.

* Test harness needs to route requests through the APIServer proxy

* Events no longer appears to be showing up for all services / pods
  even though all services pods are being created. So we turn the failure
  into a warning instead of a test failure.

* Print out the TFJob spec and events to aid debugging test failures.

Fix kubeflow#653 test server

Fixes: kubeflow#235 E2E test case for when chief is worker 0

Related: kubeflow#589 CI for v1alpha2
@jlewi
Copy link
Contributor Author

jlewi commented Jun 14, 2018

/assign @ankushagarwal
/assign @gaocegege

Copy link
Member

@gaocegege gaocegege left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/lgtm

* Fix bug in getting message from event.
@k8s-ci-robot k8s-ci-robot removed the lgtm label Jun 14, 2018
@ankushagarwal
Copy link

/lgtm
/approve

@jlewi
Copy link
Contributor Author

jlewi commented Jun 14, 2018

/approve

@k8s-ci-robot
Copy link

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: ankushagarwal, jlewi

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@k8s-ci-robot k8s-ci-robot merged commit 96e6409 into kubeflow:master Jun 14, 2018
yph152 pushed a commit to yph152/tf-operator that referenced this pull request Jun 18, 2018
* Add E2E tests that verify termination policy is handled correctly.

* Only the tests for v1alpha1 are enabled. A follow on PR will see
if v1alpha2 is working and enable the tests for v1alpha2.

* Fix versionTag logic; we need to allow for case where versionTag is an

* To facilitate these E2E tests, we create a test server to be run as
  inside the replicas. This server allows us to control what the process
  does via RPC. This allows the test runner to control when a replica exits.

* Test harness needs to route requests through the APIServer proxy

* Events no longer appears to be showing up for all services / pods
  even though all services pods are being created. So we turn the failure
  into a warning instead of a test failure.

* Print out the TFJob spec and events to aid debugging test failures.

Fix kubeflow#653 test server

Fixes: kubeflow#235 E2E test case for when chief is worker 0

Related: kubeflow#589 CI for v1alpha2

* * Fix bug in wait for pods; we were exiting prematurely
* Fix bug in getting message from event.
jetmuffin pushed a commit to jetmuffin/tf-operator that referenced this pull request Jul 9, 2018
* Add E2E tests that verify termination policy is handled correctly.

* Only the tests for v1alpha1 are enabled. A follow on PR will see
if v1alpha2 is working and enable the tests for v1alpha2.

* Fix versionTag logic; we need to allow for case where versionTag is an

* To facilitate these E2E tests, we create a test server to be run as
  inside the replicas. This server allows us to control what the process
  does via RPC. This allows the test runner to control when a replica exits.

* Test harness needs to route requests through the APIServer proxy

* Events no longer appears to be showing up for all services / pods
  even though all services pods are being created. So we turn the failure
  into a warning instead of a test failure.

* Print out the TFJob spec and events to aid debugging test failures.

Fix kubeflow#653 test server

Fixes: kubeflow#235 E2E test case for when chief is worker 0

Related: kubeflow#589 CI for v1alpha2

* * Fix bug in wait for pods; we were exiting prematurely
* Fix bug in getting message from event.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants