Limit the potentially unbounded list of images in the node status #23355

yujuhong · 2016-03-22T20:49:27Z

In short, kubelet reports the list of images on the node as part of the node status. The kubelet image GC kicks in only when the disk space hits a threshold, so the list can grow very long with many small images.
@lavalamp mentioned that this could be a problem for the apiserver, and we should handle it somehow.

A naive solution (off the top of my head): kubelet sorts the images by size and report only the top N images.

/cc @davidopp

davidopp · 2016-03-22T20:59:14Z

If the reason we put the images in the node status was just for the scheduler, we probably don't need it at all. Scheduler can just watch which pods get bound to which nodes, and keep the mapping from nodes to images in its memory. Kubelet would just need to somehow report when it GCs images so scheduler can update its information. When scheduler restarts it would lose all the information, but putting pods on nodes that already have the image installed is a best-effort feature anyway.

If there was some additional reason we were reporting images that I have forgotten, then never mind.

yujuhong · 2016-03-22T21:10:57Z

I believe it was added for image-affinity scheduling, and only that.

Kubelet would just need to somehow report when it GCs images so scheduler can update its information.

This'd make things more complicated than necessary. If we're aiming for best-effort only, scheduler can construct the list of images by looking at only running pods (which may be too limited?).

dchen1107 · 2016-03-23T00:24:24Z

@davidopp putting pods on nodes with images installed is the only use case so far. But some users want that feature support is not best-effort since they pre-cached the images to the node for speed up the container startup.

davidopp · 2016-03-23T01:08:31Z

But some users want that feature support is not best-effort since they pre-cached the images to the node for speed up the container startup.

The suggestion in the first entry in this issue (sort and drop the smaller packages from the report) also makes the feature best-effort.

yujuhong · 2016-03-23T01:39:46Z

The suggestion in the first entry in this issue (sort and drop the smaller packages from the report) also makes the feature best-effort.

Yes, I guess one argument will be that users usually pre-pull if image size is significant, and scheduler will disregard this "optimization* by looking at pod mappings only.

BTW, it was just a random idea i threw out...

yujuhong · 2016-05-23T17:59:55Z

Fixed by #25328

yujuhong added sig/node Categorizes an issue or PR as relevant to SIG Node. sig/api-machinery Categorizes an issue or PR as relevant to SIG API Machinery. labels Mar 22, 2016

yujuhong added the team/control-plane label Mar 23, 2016

ghost added the priority/awaiting-more-evidence Lowest priority. Possibly useful, but not yet enough support to actually get it done. label Mar 31, 2016

This was referenced May 3, 2016

Handle image digests in node status and image GC #25088

Merged

Limit the number of images exposed as part of NodeStatus #25209

Closed

yujuhong closed this as completed May 23, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Limit the potentially unbounded list of images in the node status #23355

Limit the potentially unbounded list of images in the node status #23355

yujuhong commented Mar 22, 2016

davidopp commented Mar 22, 2016

yujuhong commented Mar 22, 2016

dchen1107 commented Mar 23, 2016

davidopp commented Mar 23, 2016

yujuhong commented Mar 23, 2016

yujuhong commented May 23, 2016

Limit the potentially unbounded list of images in the node status #23355

Limit the potentially unbounded list of images in the node status #23355

Comments

yujuhong commented Mar 22, 2016

davidopp commented Mar 22, 2016

yujuhong commented Mar 22, 2016

dchen1107 commented Mar 23, 2016

davidopp commented Mar 23, 2016

yujuhong commented Mar 23, 2016

yujuhong commented May 23, 2016