Skip to content

[Feature] Improve stability of RayJob controller  #393

Closed
@Jeffwan

Description

Search before asking

  • I had searched in the issues and found no similar feature requirement.

Description

follow up on #303 (comment)

1. jobDeploymentStatus seems stay at Running stage for successful runs -> add Complete status
2. spec.jobId is not being used to custom job ids
3. spec.shutdownAfterJobFinishes is not implemented yet
4. provide easier way to fetch job failure message from CR
5. I do see frequent state update which can be optimized by comparing the before and after status.
6. log consistency problem.

Use case

Make sure RayJob is stable in v0.3.0 release

Related issues

No response

Are you willing to submit a PR?

  • Yes I am willing to submit a PR!

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions