Automatic pod rescheduling #47965
Labels
kind/feature
Categorizes issue or PR as related to a new feature.
sig/api-machinery
Categorizes an issue or PR as relevant to SIG API Machinery.
/kind feature
Right now if a node is terminated, its pods are rescheduled on any other available node. However, once a new node is created to replace the failed node, pods are not rescheduled on this node, so in case of a failure of the other node will result in a temporary outage.
For example:
Node N1 has pod P1
Node N2 has pod P2
Node N1 fails
Node N2 has pod P1 and P2
Node N3 is created to replace Node N1
Node N2 has pod P1 and P2
Node N3 has no pods
If N2 fails, there will be an outage until pods are rescheduled on N3. If one of the pods would have been rescheduled automatically when N3 became healthy, each node would have had a pod and a failure of any node would not result in an outage.
The text was updated successfully, but these errors were encountered: