Secure node -> master communication #3168

davidopp · 2014-12-29T23:28:16Z

In some hosting environments/configurations, the network traffic between node and master may traverse the public Internet. As a result, we'd like to secure the communication between the node components (e.g. kubelet and proxy) and master. To avoid the complexity of securing the kubelet API, we'd like to secure the node -> master communication, but not the reverse. This simplification has a downside; it means all communication between kubelet and master would have to be initiated by the kubelet. For example, we'd have to change health checks to be initiated by the kubelet, which in turn raises a question of how to do flow control (master apply backpressure when it becomes overloaded).

davidopp · 2014-12-29T23:33:58Z

As part of this, we should harden the master against properly authenticated but malformed requests (in other words, make sure the master gracefully handles bugs in the node components).

erictune · 2014-12-30T07:11:38Z

To ensure we handle malformed requests, we should have a fuzz test than
runs against the master api, including endpoints used by the kubelets.

To handle a software fault where one or more kubelets are sending requests
at a fairly high rate, we should implement per-source-IP rate limits. This
should be easy to implement.

Defending against very high aggregate request rates (when the master is
lots of CPU just saying no to connection request) is harder and I think we
should leave it out of scope for now.

On Mon, Dec 29, 2014 at 3:34 PM, davidopp notifications@github.com wrote:

As part of this, we should harden the master against properly
authenticated but malformed requests (in other words, make sure the master
gracefully handles bugs in the node components).

—
Reply to this email directly or view it on GitHub
#3168 (comment)
.

erictune · 2014-12-30T07:14:17Z

This is closely related to #2483 and to discussion in pull #846. I'll leave it up to @davidopp to devide if this issue is a duplicate or not.

alex-mohr · 2015-01-29T19:36:06Z

I'd like to leave this open as specifically tracking secure master <-> kubelet communication and farm out other independent parts to separate issues (fuzz testing, (D)DoS prevention, and the wide variety of issues in #2483).

bgrant0607 · 2015-02-13T21:04:10Z

#156 is a proposal to make apiserver and controllers not contact kubelet. I am now in favor of that proposal.

j3ffml · 2015-02-13T21:31:30Z

I think we still need a secure channel from apiserver to kubelet to do things like exec in a container and stream container logs.

bgrant0607 · 2015-02-13T21:46:46Z

Proxy/bastion cases. Fair enough.

derekwaynecarr · 2015-02-13T21:50:00Z

/cc @liggitt @deads2k

erictune · 2015-02-14T06:21:23Z

Let's continue discussion of "exec in container and stream container logs" on #156

roberthbailey · 2015-02-17T23:50:44Z

@liggitt @a-robinson

Here is my plan to get us to "Static Clustering" as defined in clustering.md:

Configure the kubelet to use HTTPS
- Generate a self signed certificate on the kubelet, bind to the port using TLS.
- The master will connect over HTTPS without verifying the certificate.
Modify the kubelet to register itself with the master
- For GCE/GKE, the master location is currently passed as the name of the master VM which is resolved into the internal IP of the master. For non-salt based platforms this is currently TBD until I investigate further.
- The kubelet will pass its public key and location to the master, which will be stored persistently in etcd. The master can either use the current means to accept the kubelet into the cluster or implement the "insecure-always-approve" policy (as it will not yet be able to verify the certificate provided by the kubelet).
- The master will now connect over HTTPS and verify that the certificate for the kubelet has the correct public key. Note that the kubelet will still not verify the master.
Distribute certs to the master/nodes during cluster creation
- For GCE/GKE this can be through the GCE metadata server or using gcloud ssh. For other providers this can be via ssh or another side channel.
- For GCE we will use a single “cluster” certificate for all of the nodes instead of generating a separate certificate for each node to support managed instance groups.
- Instead of generating self signed certificates, the certificates on the master and nodes will now be signed by the same certificate authority.
- When the kubelet registers with the master, the master can verify that the provided certificate to register as well as the client certificate in the TLS handshake are signed by a known CA (the same one that signed the master certificate). The kubelet can verify that the server certificate provided by the master was signed by a known CA.
- When the master connects to the kubelet over HTTPS the kubelet can verify that the client certificate provided by the master is signed by a known CA (the same one that signed the kubelet certificate) and the master can verify that the server certificate provided by the kubelet was signed by a known CA.

smarterclayton · 2015-03-05T20:39:37Z

@liggitt I think this is the best place to describe the things we were talking about w.r.t. securing the kubelet.

liggitt · 2015-03-05T21:34:10Z

KubeletConfig{}/NewMainKubelet() already take a client.Client used to call the master API. That already provides a way to give the kubelet the master CA and credentials to use against the API (cert, token, etc).

I'd like to start by plumbing TLS, server cert, and server key options from the KubeletConfig down to the server start. First stab at that is here: #5104

Note that there are still places that assume http and 10250 (like minion.ResourceLocation) that need to know the particular scheme and port for a given node. I think that means the node API object should probably contain that info in addition to the hostname.

liggitt · 2015-03-05T21:39:34Z

Before doing the kubelet will pass its public key and location to the master, could we update the Node API object to keep track of the location (scheme, host, port)? That seems like a much smaller change that would prereq what is described in #3168 (comment), but would enable manually registering nodes using https or alternate ports (and would also let us fix minion.ResourceLocation)

roberthbailey · 2015-03-26T06:06:07Z

@LiGgit I'm finally getting some time to work on this issue and it seems like you've gotten a bit of a start for me. Thanks!

The list items above were a bit hand-wavy and weren't meant to represent consecutive PRs but rather the general plan forward, knowing that it will need to be tweaked as I dig into the details for each step. For your specific question, when the node registers with the master, we can certainly store a bit of extra information about how to contact the node, including port (I hadn't considered storing the scheme, since I'd just assumed everything would need to move to https but we can discuss that when we get there).

alex-mohr · 2015-06-03T18:27:15Z

Update: @cjcullen is actively driving the work to secure the master -> kubelet communication for the proxy -- CJ, can you please mention this issue for the relevant PRs so we can track them as they land? (And thanks also to @brendandburns for helping out!)

timothysc · 2015-06-05T15:25:19Z

cc @rrati

cjcullen · 2015-06-09T17:29:46Z

SSH proxy code is in. The only remaining question is whether to leave 10250 open to the public internet (in which case we'd still need to secure it) or close it off and only listen inside the cluster.
@a-robinson @vishh

roberthbailey · 2015-06-09T21:27:29Z

We've decided not to leave 10250 open to the internet, so I'm bumping this to the v1.0-post milestone (since we still want to add better security to the kubelet's http endpoint, just not for 1.0).

wingedkiwi · 2015-07-24T14:35:41Z

I'm trying to follow the status of this issue. Is the master -> kubelet connection still insecure?

roberthbailey · 2015-07-24T16:25:54Z

The master -> kubelet connection is not secure enough to be run across the internet. The master currently only connects to the kubelet for proxying user requests that need to be forwarded to the kubelet (e.g. hitting the /api/v1/proxy/ endpoint or using kubectl exec / kubectl logs). In most cases, this is done over a local secure network. For GKE, we use ssh tunnels to securely put packets onto the cluster's network without exposing the kubelet's web server to the internet.

Remaining work: The kubelet needs to serve its https endpoint with a certificate that is signed by the cluster CA. Right now it uses a self signed cert for its web server, even though it uses a client certificate signed by the CA as credentials to authenticate to the master. The master needs to have a client certificate signed by the cluster CA to present to the kubelet when connecting.

We are also looking at moving the proxying functionality out of the master, which would entirely remove the need for the master to connect to the kubelet, making the work to secure the kubelet unnecessary. I'm not sure whether that will land first or if it's still worth trying to secure the master -> kubelet communications.

smarterclayton · 2015-07-24T16:33:26Z

The proxying function still has to go somewhere, so I would assume that has
to be secured? :) Is there an issue for the move out?

On Fri, Jul 24, 2015 at 12:26 PM, Robert Bailey notifications@github.com
wrote:

The master -> kubelet connection is not secure enough to be run across the
internet. The master currently only connects to the kubelet for proxying
user requests that need to be forwarded to the kubelet (e.g. hitting the
/api/v1/proxy/ endpoint or using kubectl exec / kubectl logs). In most
cases, this is done over a local secure network. For GKE, we use ssh
tunnels to securely put packets onto the cluster's network without exposing
the kubelet's web server to the internet.

Remaining work: The kubelet needs to serve its https endpoint with a
certificate that is signed by the cluster CA. Right now it uses a self
signed cert for its web server, even though it uses a client certificate
signed by the CA as credentials to authenticate to the master. The master
needs to have a client certificate signed by the cluster CA to present to
the kubelet when connecting.

We are also looking at moving the proxying functionality out of the
master, which would entirely remove the need for the master to connect to
the kubelet, making the work to secure the kubelet unnecessary. I'm not
sure whether that will land first or if it's still worth trying to secure
the master -> kubelet communications.

—
Reply to this email directly or view it on GitHub
#3168 (comment)
.

Clayton Coleman | Lead Engineer, OpenShift

roberthbailey · 2015-07-24T16:40:32Z

There's been quite a bit of discussion internally (not necessarily conclusive one way or another), but I don't know if there's an issue open.... looking.

roberthbailey · 2015-07-24T16:42:33Z

Looks like there are (at least) two: #10209 and #3481.

roberthbailey · 2015-07-24T16:46:43Z

I've created #11816 to discuss securing the master -> node communication, so I'm going to close this issue in favor of that one.

davidopp added area/nodecontroller area/hosting labels Dec 29, 2014

a-robinson mentioned this issue Jan 2, 2015

WIP: Modify kube-push for GCE to bring down the existing master VM and completely replace it with a new one #3174

Closed

goltermann added the priority/backlog Higher priority than priority/awaiting-more-evidence. label Jan 7, 2015

davidopp assigned j3ffml Feb 4, 2015

alex-mohr assigned roberthbailey and unassigned j3ffml Feb 4, 2015

bgrant0607 mentioned this issue Feb 13, 2015

Remove need for the apiserver to contact kubelet for current container state #156

Closed

zmerlynn added this to the v1.0 milestone Feb 26, 2015

bgrant0607 added the area/security label Feb 28, 2015

a-robinson mentioned this issue Mar 11, 2015

Secure intra-cluster communication with TLS #129

Closed

davidopp mentioned this issue Mar 19, 2015

Allow minions to be securely registered/deregistered #267

Closed

roberthbailey mentioned this issue Mar 21, 2015

Dramatically simplify Kubernetes deployment #2303

Closed

22 tasks

lavalamp mentioned this issue May 29, 2015

kubectl proxy to node is broken #9029

Closed

goltermann added the feature label Jun 3, 2015

alex-mohr assigned cjcullen and unassigned roberthbailey Jun 3, 2015

alex-mohr added priority/critical-urgent Highest priority. Must be actively worked on as someone's top priority right now. and removed priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. labels Jun 3, 2015

alex-mohr mentioned this issue Jun 5, 2015

Add an ssh tunnel option to the /proxy endpoint #9292

Merged

cjcullen mentioned this issue Jun 5, 2015

Mount cloud-config files for kube-apiserver & kube-controllermanager pods. #8541

Closed

roberthbailey assigned roberthbailey and unassigned cjcullen Jun 9, 2015

roberthbailey modified the milestones: v1.0-post, v1.0 Jun 9, 2015

roberthbailey added priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. and removed priority/critical-urgent Highest priority. Must be actively worked on as someone's top priority right now. labels Jun 23, 2015

bgrant0607 removed the feature label Jul 23, 2015

bgrant0607 removed this from the v1.0-post milestone Jul 24, 2015

roberthbailey mentioned this issue Jul 24, 2015

Secure master -> node communication #11816

Closed

roberthbailey closed this as completed Jul 24, 2015

gosharplite mentioned this issue Sep 17, 2015

kubectl logs returns x509: certificate signed by unknown authority #14097

Closed

mikedanese mentioned this issue Sep 4, 2019

Retroactive KEP: Certificates API kubernetes/enhancements#1097

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Secure node -> master communication #3168

Secure node -> master communication #3168

davidopp commented Dec 29, 2014

davidopp commented Dec 29, 2014

erictune commented Dec 30, 2014

erictune commented Dec 30, 2014

alex-mohr commented Jan 29, 2015

bgrant0607 commented Feb 13, 2015

j3ffml commented Feb 13, 2015

bgrant0607 commented Feb 13, 2015

derekwaynecarr commented Feb 13, 2015

erictune commented Feb 14, 2015

roberthbailey commented Feb 17, 2015

smarterclayton commented Mar 5, 2015

liggitt commented Mar 5, 2015

liggitt commented Mar 5, 2015

roberthbailey commented Mar 26, 2015

alex-mohr commented Jun 3, 2015

timothysc commented Jun 5, 2015

cjcullen commented Jun 9, 2015

roberthbailey commented Jun 9, 2015

wingedkiwi commented Jul 24, 2015

roberthbailey commented Jul 24, 2015

smarterclayton commented Jul 24, 2015

roberthbailey commented Jul 24, 2015

roberthbailey commented Jul 24, 2015

roberthbailey commented Jul 24, 2015

Secure node -> master communication #3168

Secure node -> master communication #3168

Comments

davidopp commented Dec 29, 2014

davidopp commented Dec 29, 2014

erictune commented Dec 30, 2014

erictune commented Dec 30, 2014

alex-mohr commented Jan 29, 2015

bgrant0607 commented Feb 13, 2015

j3ffml commented Feb 13, 2015

bgrant0607 commented Feb 13, 2015

derekwaynecarr commented Feb 13, 2015

erictune commented Feb 14, 2015

roberthbailey commented Feb 17, 2015

smarterclayton commented Mar 5, 2015

liggitt commented Mar 5, 2015

liggitt commented Mar 5, 2015

roberthbailey commented Mar 26, 2015

alex-mohr commented Jun 3, 2015

timothysc commented Jun 5, 2015

cjcullen commented Jun 9, 2015

roberthbailey commented Jun 9, 2015

wingedkiwi commented Jul 24, 2015

roberthbailey commented Jul 24, 2015

smarterclayton commented Jul 24, 2015

roberthbailey commented Jul 24, 2015

roberthbailey commented Jul 24, 2015

roberthbailey commented Jul 24, 2015