Reschedules CrashLoopBackOff
Pod
to fix permanent crashes caused by stale init-container/sidecar/configmap
- Listens to Pod update events
- Looks for containers in CrashLoopBackOff with
restartCount
> 5 (failureThreshold
config) - Ignores Pods with annotation
kube-remediator/CrashLoopBackOffRemediator: "false"
- Can work in a single namespace, default is all namespaces
""
(namespace
config) - Ignores Pods without
ownerReferences
(Avoid deleting something which does not come back)
Deletes Pod
s with label kube-remediator/OldPodDeleter=true
older than 24h
Deletes PersistentVolumeClaim
left behind by deleted StatefulSet
, that are not automatically cleanup up otherwise
- Waits for 7 days(configurable) before deleting
- Ignores if
PersistentVolume
haspersistentVolumeReclaimPolicy
set toRetain
kubectl apply -f kubernetes/rbac.yaml
kubectl apply -f kubernetes/app-server.yml
Configuration options:
- Deploy provided image to use defaults under
config/*
- Make a new image
FROM
the provided image and add/removeconfig/*
- Overwrite
config/*
with a mountedConfigMap
Run in local kubernetes with docker-for-mac
rake server
Run against local kubernetes cluster with go:
unset GOPATH
go mod vendor # install into local directory instead of global path
make dev # run on cluster from $KUBECONFIG (defaults to ~/.kube/config)
# CrashLoopBackOffRemediator: pod is rescheduled after restarting 5 times ?
kubectl apply -f examples/crashloop_pod.yml
# OldPodDeleter: pod is deleted when it gets 24h old ? (best change the 24h in the code to 1min)
kubectl apply -f examples/old_pod.yml