Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature request: A way to signal pods #24957

Open
thockin opened this issue Apr 29, 2016 · 80 comments
Open

Feature request: A way to signal pods #24957

thockin opened this issue Apr 29, 2016 · 80 comments
Labels
area/api Indicates an issue on api area. kind/feature Categorizes issue or PR as related to a new feature. lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. priority/backlog Higher priority than priority/awaiting-more-evidence. sig/node Categorizes an issue or PR as relevant to SIG Node.

Comments

@thockin
Copy link
Member

thockin commented Apr 29, 2016

This has come up a bunch of times in conversations. The idea is that some external orchestration is being performed and the user needs to signal (SIGHUP usually, but arbitrary) pods. Sometimes this is related to ConfigMap (#22368) and sometimes not. This also can be used to "bounce" pods.

We can't currently signal across containers in a pod, so any sort of sidecar is out for now.

We can do it by docker kill, but something like kubectl signal is clumsy - it's not clearly an API operation (unless we revisit
'operation' constructs for managing imperative async actions).

Another issue is that a pod doesn't really exist - do we signal one container? Which one? Maybe all containers? Or do we nominate a signalee in the pod spec?

@thockin thockin added area/api Indicates an issue on api area. kind/feature Categorizes issue or PR as related to a new feature. labels Apr 29, 2016
@mikedanese mikedanese added the sig/node Categorizes an issue or PR as relevant to SIG Node. label Apr 29, 2016
@bprashanth
Copy link
Contributor

We can just use config map as the kube-event-bus. It's what I was going to do for petset.

I need to notify petset peers about changes in cluster membership. The idea is people will write "on-change" hook handlers, the petset will create a config map and write to it anytime it creates a new pet. The kubelet runs a handler exec/http-probe style, on every change to the config map.

I'd rather have a hook than an explicity signal because I don't need a proxy to listen for the signal and reload the pet. In fact to do this reliably I need a handler, because I need to poll till the pet shows up in DNS.

I vastly prefer this to running a pid 1 that is cluster aware (autopilot strategy).

@thockin
Copy link
Member Author

thockin commented Apr 29, 2016

I opened this as a separate PR because I have been asked several times for
literal signals. Asking people to do something instead of signals,
especially something kube-centric is a non-starter for people who have apps
that need signals. We could ask them to bundle sidecars as bridges, but
because we can not signal across containers, they have to bundle into their
own containers. That is gross to max.

On Fri, Apr 29, 2016 at 2:32 PM, Prashanth B notifications@github.com
wrote:

We can just use config map as the kube-event-bus. It's what I was going to
do for petset.

I need to notify petset peers about changes in cluster membership. The
idea is people will write "on-change" hook handlers, the petset will create
a config map and write to it anytime it creates a new pet. The kubelet runs
a handler exec/http-probe style, on every change to the config map.

I'd rather have a hook than an explicity signal because I don't need a
proxy to listen for the signal and reload the pet. In fact to do this
reliably I need a handler, because I need to poll till the pet shows up in
DNS.

I vastly prefer this to running a pid 1 that is cluster aware (autopilot
strategy).


You are receiving this because you authored the thread.
Reply to this email directly or view it on GitHub
#24957 (comment)

@bprashanth
Copy link
Contributor

bprashanth commented Apr 29, 2016

You won't need a sidecar, you'll need a shell script that sends pkill -HUP nginx, right? the kubelet will exec-probe that script when the config map changes. This will require modifcation of pod spec but is more flexible imo.

@thockin
Copy link
Member Author

thockin commented Apr 30, 2016

You're missing the point. There are other things people want to do that
involve sending a signal and "create a config map, modify your container to
include bash and pkill, modify your pod to mount the config map, write a
script that runs your process and then watches the configmap and signals
your process, and then run that instead of your real app" is a crappy
solution. It's a kludgey workaround for lack of a real feature.

On Fri, Apr 29, 2016 at 3:27 PM, Prashanth B notifications@github.com
wrote:

You won't need a sidecar, you'll need a shell script that sends pkill
-HUP nginx, right? the kubelet will exec-probe that script. This will
require modifcation of pod spec but is more flexible imo.


You are receiving this because you authored the thread.
Reply to this email directly or view it on GitHub
#24957 (comment)

@bprashanth
Copy link
Contributor

write a script that runs your process and then watches the configmap and signals
your process, and then run that instead of your real app"

That's not part of what i want. I explicitly don't want a long running process aware of the config map.

create a config map, modify your container to
include bash and pkill, modify your pod to mount the config map,

I think this is trivial. Not bash, sh or ash. Or http. The same things you need for a probe, or a post start hook, or pre stop hook. concepts we already have.

In short I want a way to tell the container about a reconfigure event and I don't want to teach all databases to handle a signal properly.

@bprashanth
Copy link
Contributor

And I'm trying not to end up with 2 overlapping concepts when you can achieve one with the other. Somehow I dont think people will be against a probe like thing instead of a straight signal.

@thockin
Copy link
Member Author

thockin commented Apr 30, 2016

It feels like a hack to me. I'd be fine with notifiers, and with notifiers
attached to configmap changes, and with notifiers that included "send a
signal". None of that obviates the utility of "i want to signal my pods"

On Fri, Apr 29, 2016 at 10:55 PM, Prashanth B notifications@github.com
wrote:

And I'm trying not to end up with 2 overlapping concepts when you can
achieve one with the other. Somehow I dont think people will be against a
probe like thing instead of a straight signal.


You are receiving this because you authored the thread.
Reply to this email directly or view it on GitHub
#24957 (comment)

@bprashanth
Copy link
Contributor

Hmm, my feature request also doesn't require a config map, it's the common case. Maybe a notifier of type=probe,exec,signal?

@smarterclayton
Copy link
Contributor

smarterclayton commented Apr 30, 2016 via email

@thockin
Copy link
Member Author

thockin commented Apr 30, 2016

I could see an argument for "restart" as a verb on pods (albeit with he
same issues around imperatives), distinct from signal, though I assumed
signalling a pod meant to signal all containers or a self-nominated signal
receiver container.
On Apr 30, 2016 10:03 AM, "Clayton Coleman" notifications@github.com
wrote:

Regarding signals and restarts, is kill the right signal, or do users
want to restart the pod itself? I fee like the granularity on signals
is processes (of which container is a relatively good proxy), but the
granularity on restart / exec actions is containers or pods.


You are receiving this because you are subscribed to this thread.
Reply to this email directly or view it on GitHub
#24957 (comment)

@smarterclayton
Copy link
Contributor

smarterclayton commented Apr 30, 2016 via email

@bprashanth
Copy link
Contributor

I'm still kind of stuck at the point where we're special casing a specific form of linux IPC. I really want a cluster notification system, the last hop of which is some implementation of a node local ipc, i.e, say i have a pod with:

notification:
  exec: /on-change.sh
  stdin: ""

and I write "foo" to stdin, the kubelet will deliver that at least once, to the one container with the hook. I then observe generation number and know that my container has received echo foo | on-change.sh. Now say i want to send a signal, I'd just do:

notification:
  signal: posix number??

or

notification:
  exec: "sighup.sh"
  stdin: ""

And I still write "" to the stdin field, and poll on generation.

Is that a different feature request?

@smarterclayton
Copy link
Contributor

I kind of agree signal is a subset of "pod notification". The security
aspects of exec make it something I'd prefer to require the pod author to
define, in which case it's a named hook / notification.

On Sat, Apr 30, 2016 at 3:11 PM, Prashanth B notifications@github.com
wrote:

I'm still kind of stuck at the point where we're special casing a specific
form of linux IPC. I really want a cluster notification system, the last
hop of which is some implementation of a node local ipc, i.e, say i have a
pod with:

notification:
exec: /on-change.sh
stdin: ""

and I write "foo" to stdin, the kubelet will deliver that at least once,
to the one container with the hook. I then observe generation number and
know that my container has received echo foo | on-change.sh. Now say i
want to send a signal, I'd just do:

notification:
signal: posix number??

or

notification:
exec: "sighup.sh"
stdin: ""

And I still write "" to the stdin field, and poll on generation.

Is that a different feature request?


You are receiving this because you are subscribed to this thread.
Reply to this email directly or view it on GitHub
#24957 (comment)

@smarterclayton
Copy link
Contributor

I would like to have a concrete way to force and guarantee a restart of a
pod from the outside without hacking. One option that is not signals is a
monotonic value on the pod that a user can update (essentially,
generation). That's more "whole pod" signaling.

On Sat, Apr 30, 2016 at 3:15 PM, Clayton Coleman ccoleman@redhat.com
wrote:

I kind of agree signal is a subset of "pod notification". The security
aspects of exec make it something I'd prefer to require the pod author to
define, in which case it's a named hook / notification.

On Sat, Apr 30, 2016 at 3:11 PM, Prashanth B notifications@github.com
wrote:

I'm still kind of stuck at the point where we're special casing a
specific form of linux IPC. I really want a cluster notification system,
the last hop of which is some implementation of a node local ipc, i.e, say
i have a pod with:

notification:
exec: /on-change.sh
stdin: ""

and I write "foo" to stdin, the kubelet will deliver that at least once,
to the one container with the hook. I then observe generation number and
know that my container has received echo foo | on-change.sh. Now say i
want to send a signal, I'd just do:

notification:
signal: posix number??

or

notification:
exec: "sighup.sh"
stdin: ""

And I still write "" to the stdin field, and poll on generation.

Is that a different feature request?


You are receiving this because you are subscribed to this thread.
Reply to this email directly or view it on GitHub
#24957 (comment)

@bprashanth
Copy link
Contributor

So all containers BUT pause need to restart? (otherwise why not just delete the pod?)

@eghobo
Copy link

eghobo commented May 10, 2016

it should be optional and probably support rollout restart, it's too dangerous to restart everything right a way.

@bprashanth
Copy link
Contributor

it should be optional and probably support rollout restart,

If it's too dangerous to restart containers in a pod at the same time, put them in different pods/services? Everything else in the system will treat the pod as a unit, so there's probably no avoiding this. Or are you asking for a dependency chain of restarts on containers in a pod?

@smarterclayton
Copy link
Contributor

Classic cases:

  • service registers itself on start with a system, that systems failed
  • server has caches that need to be flushed because of a failure elsewhere
  • server is flaking in a way that liveness probe doesn't/can't catch
  • debugging a pod on a particular node
  • database that needs a restart after a schema change

In all of these cases the invariant enforced by pod deletion is excessive.

@eghobo
Copy link

eghobo commented May 10, 2016

@bprashanth: for example we have replication controller which has N of replicas, if we are going to restart all of them at the same time (e.g. we rollout new config change) we will affect customer traffic.

@therc
Copy link
Member

therc commented May 12, 2016

More use cases: some software can be told to increase logging or debugging verbosity with SIGUSR1 and SIGUSR2, to rotate logs, open diagnostic ports, etc.

@pmorie
Copy link
Member

pmorie commented May 24, 2016

I also view this as notification. I would prefer to expose an intentional API instead of something in terms of POSIX signals. I would take two actions to start with:

  1. bump - signal the container somehow that a reload should happen of any config
  2. restart - what it says on the tin

The problems with an intentional API are:

  1. The ways of carrying out intent are not going to be universal across platforms; we need to define them for each platform we support; we can say that on Linux, bump means SIGHUP
  2. As others have commented, there will be some overlap with other areas like the use of SIGUSR1 and SIGUSR2 - where do we draw the line between rich APIs for specific platforms versus more abstract (but harder to implement universally) APIs?

@pmorie
Copy link
Member

pmorie commented May 24, 2016

Also, for the record, seems plausible that we would constrain the list of signals you could send in an API using PodSecurityPolicy.

@nhlfr
Copy link

nhlfr commented Jun 3, 2016

I have no strong preference about "intentional API vs POSIX" problem. The only proposition I'm opposed is using any king of exec or scripts as a workaround. Docker has docker kill --signal, rkt has the issue where the idea of supporting custom signals is considered rkt/rkt#1496. I'd prefer to go with these features.

@pmorie sorry for my ignorance, but what exact non-Linux platforms we have to care about in k8s? I had no luck with googling/greping that information.

@smarterclayton
Copy link
Contributor

smarterclayton commented Jun 3, 2016 via email

@bprashanth
Copy link
Contributor

For the record we should be able to exec scripts from sidecars for notification delivery with shared pidnamespaces, which is coming in docker 1.12 apparently (#1615)

@nhlfr
Copy link

nhlfr commented Jun 3, 2016

@smarterclayton OK, so in that case intentional API sounds better.

So, for now we're going to implement the bump and restart actions as

  1. API actions for pod
  2. Actions which may be triggered by event bus

?

Also, as far as I understand, this event bus is something which we have to implement? Sorry if I'm missing some already existing feature.

If I understand correctly the API part, then I'm eager to implement this.

@k8s-ci-robot k8s-ci-robot added the priority/backlog Higher priority than priority/awaiting-more-evidence. label Jun 25, 2021
@zffocussss
Copy link

@alinbalutoiu You could use a sidecar container that watches the mounted configmap for changes and sends a signal to prometheus. That's the best/closest to native way I'm aware of.

in the pod spec: shareProcessNamespace: true

A bash script with while loop around inotify

#!/bin/bash
# requires these rpm packages on fedora/centos flavors:
# - procps-ng
# - inotify-tools
# or, requires these deb packages on debian/ubuntu flavors:
# - procps
# - inotify-tools

set -eu -o pipefail

WATCH_FILE="/configmap-mount/prometheus.config"
INOTIFYWAIT_OPTS="${INOTIFYWAIT_OPTS:-"-e modify -e delete -e delete_self"}"

main() {
  while true; do
    while [[ ! -f "$WATCH_FILE" ]]; do
      echo "Waiting for $WATCH_FILE to appear ..."
      sleep 1
    done

    echo "Waiting for '$WATCH_FILE' to be deleted or modified..."
    # shellcheck disable=SC2086
    if ! inotifywait $INOTIFYWAIT_OPTS "$WATCH_FILE"; then
      echo "WARNING: inotifywait exited with code $?"
    fi
    # small grace period before sending SIGHUP:
    sleep 1

    echo "sending SIGHUP to prometheus"
    if ! pkill -HUP prometheus; then
      echo "WARNING: 'pkill' exited with code $?"
    fi
  done
}
main "$@"

inotifywait will block until any changes to the $WATCH_FILE are detected.

You could also try my go-init-sentinel, which is a pid-1 compliant wrapper that allows for watching 1 or more files and sending signals to child processes when those files change. This does not require a sidecar container. https://github.com/joemiller/go-init-sentinel

it is a useful way for us to control process life inside the application container in the same pod.
May I know if there is a common way(maybe CRD, controller) to manage all the pods in a center container/pod?

@amalic
Copy link

amalic commented Oct 9, 2021

Came here because I was looking for a solution to have pods restarting when dependent secrets change without complicated scripts that will break in some not so distant future. I don't like the sidecar solution becasue in a microservice architecture this would requre tons of resources (energy) wasted just to watch for config or secret updates.

@tomkukral
Copy link
Contributor

Came here because I was looking for a solution to have pods restarting when dependent secrets change without complicated scripts that will break in some not so distant future. I don't like the sidecar solution becasue in a microservice architecture this would requre tons of resources (energy) wasted just to watch for config or secret updates.

The best solution for microservice architecture may be to teach the service to watch for secrets and restart without delegating this responsibility to infrastructure (Kubernetes), and/or call restart of other service via API. Depending on process signal is kind of old view for this task, use service's API instead.

@amalic
Copy link

amalic commented Nov 4, 2021

Came here because I was looking for a solution to have pods restarting when dependent secrets change without complicated scripts that will break in some not so distant future. I don't like the sidecar solution becasue in a microservice architecture this would requre tons of resources (energy) wasted just to watch for config or secret updates.

The best solution for microservice architecture may be to teach the service to watch for secrets and restart without delegating this responsibility to infrastructure (Kubernetes), and/or call restart of other service via API. Depending on process signal is kind of old view for this task, use service's API instead.

That's how it is being already done. But it's alwazs a custom implementation. It would save time, resources, and potential bughunting sessions when this could be delivered as an out of the box functionality.

@sftim
Copy link
Contributor

sftim commented Dec 21, 2022

You could now use an ephemeral container for this, I think, or from a sidecar, if a Pod uses shareProcessNamespace: true. Does that help enough that we could document doing that, and then close the issue?

@sftim
Copy link
Contributor

sftim commented Dec 21, 2022

I think if I were making any change here, it'd be to allow a Pod to define which containers can share a process namespace. I might want to use an ephemeral container to HUP the log sidecar, but keep the main app container isolated.

@adampl
Copy link

adampl commented Dec 21, 2022

Ephemeral container for such a simple thing as sending a signal seems like a overkill to me.

@amalic
Copy link

amalic commented Dec 21, 2022

Friendly reminder about a key etcd feature

  • Watch for changes --> Watch specific keys or directories for changes and react to changes in values

I think this is the exactly right way to handle this requirement.

@thockin
Copy link
Member Author

thockin commented Dec 21, 2022

kubernetes/enhancements#1977 is what I want . It's somewhat involved, and I would love to hear simplifications if anyone has them. Mostly it needs an owner.

@sftim
Copy link
Contributor

sftim commented Dec 22, 2022

@thockin - how about we close this and signpost anyone following this issue to look at kubernetes/enhancements#1977?

We could even mention this issue in the KEP itself for background reading, or under “alternatives considered”.

@thockin
Copy link
Member Author

thockin commented Dec 22, 2022 via email

@k8s-ci-robot
Copy link
Contributor

@thockin: Closing this issue.

In response to this:

SGTM - we don't really document the relationship between k/k issues and
KEPs, but I can't see we what we need both at the same time.

/close

On Thu, Dec 22, 2022, 12:54 AM Tim Bannister @.***>
wrote:

@thockin https://github.com/thockin - how about we close this and
signpost anyone following this issue to look at
kubernetes/enhancements#1977
kubernetes/enhancements#1977?

We could even mention this issue in the KEP itself for background reading,
or under “alternatives considered”.


Reply to this email directly, view it on GitHub
#24957 (comment),
or unsubscribe
https://github.com/notifications/unsubscribe-auth/ABKWAVBAUZBBE6GZI4GR77DWOQJMJANCNFSM4CCJ34CA
.
You are receiving this because you were mentioned.Message ID:
@.***>

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@amalic
Copy link

amalic commented Dec 27, 2022

@thockin - how about we close this and signpost anyone following this issue to look at kubernetes/enhancements#1977?

We could even mention this issue in the KEP itself for background reading, or under “alternatives considered”.

This is awful!

If you read this thread then you will realize that we asked for a way to automatically restart containers if config maps or secrets change. The idea is that the container running a specific workload is not aware of running in Kubernetes. Such a restart should happen without running an extra Sidecar or Opearator.

It's a simple requirement that would make day-2-day operations much easier. Not sure why this is so hard to understand!

@thockin
Copy link
Member Author

thockin commented Dec 27, 2022

This issue is specifically about sending a signal to pods from outside. Somewhere the idea of configmap changes got mixed in.

I don't have a KEP in my pocket for that topic specifically (kubernetes/enhancements#948 is maybe related?), but if someone wants to discuss a proposal for it, I am open to the topic.

@artheus
Copy link

artheus commented May 9, 2023

It would be quite an easy task to make a small service, that watches k8s events for configmaps, secrets etc. Whenever they change, it will just do a rollout restart of the services involved with that configmap/secret/whatever.
It would be a central service, that would not really take much resources at all.
I could write one, if anyone is interested.

@joemiller
Copy link

joemiller commented May 9, 2023

@artheus unless i am misunderstanding your proposal there is already a popular project that does just that - https://github.com/stakater/Reloader

@thockin thockin reopened this Jun 5, 2023
@k8s-ci-robot
Copy link
Contributor

This issue is currently awaiting triage.

If a SIG or subproject determines this is a relevant issue, they will accept it by applying the triage/accepted label and provide further guidance.

The triage/accepted label can be added by org members by writing /triage accepted in a comment.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@k8s-ci-robot k8s-ci-robot added the needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. label Jun 5, 2023
@thockin
Copy link
Member Author

thockin commented Jun 5, 2023

kubernetes/enhancements#1977 and kubernetes/enhancements#1995 propose a design for this but need an owner.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/api Indicates an issue on api area. kind/feature Categorizes issue or PR as related to a new feature. lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. priority/backlog Higher priority than priority/awaiting-more-evidence. sig/node Categorizes an issue or PR as relevant to SIG Node.
Projects
None yet
Development

No branches or pull requests