Image pull progress should be exposed #19077

stuartbassett · 2015-12-24T04:14:45Z

If a container is waiting for an image to be pulled before it can start, it would be nice to see the progress of that pull in kubectl, so that the user can know if they have time for another cup of coffee.

An api endpoint to give a progress update, possibly with a watch option, would be ideal.
It would also be helpful to include this in the container information in each pod.

For example, running kubectl desribe pod/<pod> should return, in addition to all info currently returned, a field containing the % pulled (or number of bytes) of the image that each container uses.

Additionally, running kubectl pull-progress pod/<pod> should return a json encoded summary of each image being pulled in order to start a pod. This should also support a watch option, to notify the client of changes in the progress. There should be an equivalent HTTP API endpoint for this.

I'm interested in using this capability to provide loading bars on a UI.

The text was updated successfully, but these errors were encountered:

maclof · 2015-12-26T10:34:00Z

I have a similar usecase for this, so I would also like to see this added :)

fgrzadkowski · 2015-12-30T10:23:14Z

@kubernetes/goog-ux

bgrant0607 · 2016-01-29T00:33:36Z

See also #19695

cc @vishh

smarterclayton · 2016-01-31T21:33:03Z

Yeah, very common request. We've ended up adding heuristics "If PodStatus Pending and no containers, display message to user Probably Pulling, but we aren't sure the node is there", etc.

vishh · 2016-02-11T21:32:59Z

We generate events for this purpose. We currently have a pulling and
pulled
event.

AFAIK, docker does not surface image pull progress. Did that change
recently?

On Sun, Jan 31, 2016 at 1:33 PM, Clayton Coleman notifications@github.com
wrote:

Yeah, very common request. We've ended up adding heuristics "If PodStatus
Pending and no containers, display message to user Probably Pulling, but
we aren't sure the node is there", etc.

—
Reply to this email directly or view it on GitHub
#19077 (comment)
.

smarterclayton · 2016-02-11T21:45:30Z

Practically speaking, people use pod status to figure out what the pod is
doing. The vast majority of the time between "user creates pod" and "user
sees success" is going to be spent pulling. The fact that we have no
discrete status indicating that appears to be a bug to people. The event
is useful, but if we're updating the status anyway we should know whether
we're pulling immediately or not when we write the pod status.

On Thu, Feb 11, 2016 at 4:33 PM, Vish Kannan notifications@github.com
wrote:

We generate events for this purpose. We currently have a [pulling and
pulled](

https://github.com/kubernetes/kubernetes/blob/master/pkg/kubelet/container/event.go#L29
)
event.

AFAIK, docker does not surface image pull progress. Did that change
recently?

On Sun, Jan 31, 2016 at 1:33 PM, Clayton Coleman <notifications@github.com

wrote:

Yeah, very common request. We've ended up adding heuristics "If PodStatus
Pending and no containers, display message to user Probably Pulling, but
we aren't sure the node is there", etc.

—
Reply to this email directly or view it on GitHub
<
#19077 (comment)

.

—
Reply to this email directly or view it on GitHub
#19077 (comment)
.

vishh · 2016-02-11T22:00:38Z

By Status are you referring to the output of kubectl describe or
PodStatus? If users have insight into when the pod was accepted by the
kubelet and when it starts and completes (or fails) to pull an image, would
that suffice?

On Thu, Feb 11, 2016 at 1:46 PM, Clayton Coleman notifications@github.com
wrote:

Practically speaking, people use pod status to figure out what the pod is
doing. The vast majority of the time between "user creates pod" and "user
sees success" is going to be spent pulling. The fact that we have no
discrete status indicating that appears to be a bug to people. The event
is useful, but if we're updating the status anyway we should know whether
we're pulling immediately or not when we write the pod status.

On Thu, Feb 11, 2016 at 4:33 PM, Vish Kannan notifications@github.com
wrote:

We generate events for this purpose. We currently have a [pulling and
pulled](

https://github.com/kubernetes/kubernetes/blob/master/pkg/kubelet/container/event.go#L29
)
event.

AFAIK, docker does not surface image pull progress. Did that change
recently?

On Sun, Jan 31, 2016 at 1:33 PM, Clayton Coleman <
notifications@github.com

wrote:

Yeah, very common request. We've ended up adding heuristics "If
PodStatus
Pending and no containers, display message to user Probably Pulling,
but
we aren't sure the node is there", etc.

—
Reply to this email directly or view it on GitHub
<

#19077 (comment)

.

—
Reply to this email directly or view it on GitHub
<
#19077 (comment)

.

—
Reply to this email directly or view it on GitHub
#19077 (comment)
.

samsabed · 2016-02-11T22:27:53Z

The requester seems to want a progress measure 30% pulled or x/y

smarterclayton · 2016-02-11T22:34:47Z

The use cases people have raised with us is an obvious indicator on pull in Pod Status. Progress is nice but practically not required for a well run infra. Right now there is no distinction between waiting for schedule and pull, which is a common state people get into.

stuartbassett · 2016-05-02T03:28:39Z

#25032

vishh · 2016-05-02T19:13:04Z

@smarterclayton

Right now there is no distinction between waiting
for schedule and pull, which is a common state people get into.

The output of kubectl describe pod today includes events that provide the information you seek. Here is an example:

Events:
  FirstSeen LastSeen    Count   From                    SubobjectPath           Type        Reason      Message
  --------- --------    -----   ----                    -------------           --------    ------      -------
  8s        8s      1   {default-scheduler }                            Normal      Scheduled   Successfully assigned busybox-573201948-x4rg4 to kubernetes-minion-31zg
  8s        7s      2   {kubelet kubernetes-minion-31zg}    spec.containers{busybox}    Normal      Pulling     pulling image "busybox"
  7s        7s      1   {kubelet kubernetes-minion-31zg}    spec.containers{busybox}    Normal      Created     Created container with docker id 7ac36eac5dc5
  7s        7s      1   {kubelet kubernetes-minion-31zg}    spec.containers{busybox}    Normal      Started     Started container with docker id 7ac36eac5dc5
  7s        6s      2   {kubelet kubernetes-minion-31zg}    spec.containers{busybox}    Normal      Pulled      Successfully pulled image "busybox"
  6s        6s      1   {kubelet kubernetes-minion-31zg}    spec.containers{busybox}    Normal      Created     Created container with docker id 8a1d39975f1c
  6s        6s      1   {kubelet kubernetes-minion-31zg}    spec.containers{busybox}    Normal      Started     Started container with docker id 8a1d39975f1c

We can possibly add more events that include the progress, if an image pull were to take longer than expected.

@stuartbassett

I don't see why #25032 is needed yet.

smarterclayton · 2016-05-02T19:15:08Z

Events aren't that useful to uis, because you have to read and parse the event stream and correlate. It would be much better to have a container condition indicating that in the pod status.

vishh · 2016-05-02T20:39:27Z

Wouldn't UIs include events as well, at-least the critical ones?

smarterclayton · 2016-05-02T22:30:59Z

They should, but in large list views that correlation and setup is complicated. Both kubectl get pods and a naive UI should be able to easily show "pulling" because 95% of the time that's what is actually happening, but when it isn't, that's really important (after 10s of pending without seeing pulling, you could infer something else is wrong).

Random-Liu · 2016-05-03T22:08:22Z

@smarterclayton It is hard to update the image pulling progress in pod status for now. Because pod status is updated before each SyncPod, while the whole image pulling process happens during SyncPod.

The plan for now is to periodically (maybe every 5, 10 senconds) send event telling what is the current image pulling progress, which will at least tell user whether the image pulling is stuck there.

smarterclayton · 2016-05-03T22:26:11Z

Doesn't pulling then effectively block all sync progress for other
containers in the same pod? Or does it merely block parallel startup of
the containers.

On Tue, May 3, 2016 at 6:08 PM, Lantao Liu notifications@github.com wrote:

@smarterclayton https://github.com/smarterclayton It is hard to update
the image pulling progress in pod status for now. Pod status is updated
before each SyncPod, while the whole image pulling process happens during
SyncPod.

The plan for now is to periodically (maybe every 5, 10 senconds) send
event telling what is the current image pulling progress, which will at
least tell user that whether the image pulling is stuck there.

—
You are receiving this because you were mentioned.
Reply to this email directly or view it on GitHub
#19077 (comment)

vishh · 2016-05-03T22:42:10Z

@smarterclayton Assuming that exposing image pull progress is mainly for human consumption, would the following work?

If an image is being pulled for a container, then update ContainerStateWaiting to contain a reason PullingImage and a message "progress: x%". The progress will be made available on a best-effort basis.
If a pod is inPending phase because images are being pulled, reflect that in PodStatus.Reason and PodStatus.Message. Message could be an aggregate across all containers.

This would essentially push the burden of generating human friendly pod and container status to Kubelet.

smarterclayton · 2016-05-03T22:47:23Z

I don't even think progress is required, I'd be happy with a single
reason. But I'll take the message and say that it'll help users even on a
best effort.

On Tue, May 3, 2016 at 6:42 PM, Vish Kannan notifications@github.com
wrote:

@smarterclayton https://github.com/smarterclayton Assuming that
exposing image pull progress is mainly for human consumption, would the
following work?

If an image is being pulled for a container, then update
ContainerStateWaiting to contain a reason PullingImage and a message
"progress: x%". The progress will be made available on a best-effort basis.

If a pod is inPending phase because images are being pulled, reflect
that in PodStatus.Reason and PodStatus.Message. Message could be an
aggregate across all containers.

This would essentially push the burden of generating human friendly pod
and container status to Kubelet.

—
You are receiving this because you were mentioned.
Reply to this email directly or view it on GitHub
#19077 (comment)

smarterclayton · 2016-05-03T22:47:41Z

Basically, yes, I think that would make 90% of clients better.

On Tue, May 3, 2016 at 6:47 PM, Clayton Coleman ccoleman@redhat.com wrote:

I don't even think progress is required, I'd be happy with a single
reason. But I'll take the message and say that it'll help users even on a
best effort.

On Tue, May 3, 2016 at 6:42 PM, Vish Kannan notifications@github.com
wrote:

@smarterclayton https://github.com/smarterclayton Assuming that
exposing image pull progress is mainly for human consumption, would the
following work?

If an image is being pulled for a container, then update
ContainerStateWaiting to contain a reason PullingImage and a message
"progress: x%". The progress will be made available on a best-effort basis.

If a pod is inPending phase because images are being pulled,
reflect that in PodStatus.Reason and PodStatus.Message. Message could
be an aggregate across all containers.

This would essentially push the burden of generating human friendly pod
and container status to Kubelet.

—
You are receiving this because you were mentioned.
Reply to this email directly or view it on GitHub
#19077 (comment)

vishh · 2016-05-03T23:00:47Z

@stuartbassett Will you be able to re-purpose your PR (#25032) to do what's mentioned in #19077 (comment) ? I can provide more specific design details, if you have difficulty parsing #19077 (comment)

Random-Liu · 2016-05-03T23:32:03Z

@smarterclayton Yeah, for now it will block starting of all other containers, but we definitely do not want that and should make it better in the future, :)

nphmuller · 2019-05-14T21:55:24Z

/remove-lifecycle stale

amadav · 2019-05-17T22:30:39Z

Do we know if there exists any better way of achieving this?

fejta-bot · 2019-08-15T23:17:43Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

nomcopter · 2019-08-15T23:50:38Z

/remove-lifecycle stale

saschagrunert · 2019-10-17T15:59:35Z

We could write up a KEP for this and pitch it in SIG Node. I’d be happy to drive this topic forward, but we should get at least 3 people on board. Who is in? 🙃

bboreham · 2019-10-17T16:12:20Z

@saschagrunert I am interested - what sort of commitment do you need?

saschagrunert · 2019-10-17T16:45:42Z

@saschagrunert I am interested - what sort of commitment do you need?

I never wrote a KEP, but I’m thrilled to write one. I would just need relevant input, review and maybe implementation support. We could create a small work group in slack if you want. :)

smarterclayton · 2019-10-19T21:30:16Z

I commented on the cri-o issue, but I think we could separate this into two parts:

Progress for end users of kube (the issue this was raised for)
Better administrative / operational insight into pulls

The former is definitely Kube since it would have to be exposed via an api. However the second is likely to be fairly specific to container runtime implementation and the storage that backs it. And given the improvements in monitoring since this issue was opened, and that most deployments likely have prometheus or something like it watching their container runtimes, the second might be the best place to start both to make a concrete first step now for admins, while also learning more about how we might expose progress. I do not think the former item was intended to solve the latter, and the latter is probably more broadly applicable since the vast majority of clusters are single team owned.

bboreham · 2019-10-20T09:07:17Z

FWIW my interest comes from working on tools layered on top of Kubernetes, where the rollout is initiated by something other that kubectl (for instance a git commit).
And we'd like that tooling to be able to feed back status and/or issues without guessing.

Status needs to be tied to a specific update, since multiple overlapping updates can be issued.

I can't immediately see how Prometheus solves this requirement.

saschagrunert · 2019-10-21T12:28:17Z

I commented on the cri-o issue, but I think we could separate this into two parts:
1. Progress for end users of kube (the issue this was raised for)
2. Better administrative / operational insight into pulls

I'd be happy to push both topics forward. From the runtime perspective: If we have the data at hand, then we could expose it to any interface.

I see four major points, whereas the first one could be dropped from here:

Getting the data inside the runtime (out of scope from here)
Getting the data into the kubelet via the CRI API (KEP)
Exposing the data via the API Server (KEP)
Exposing the data via the CLI (KEP)

Anything else?

saschagrunert · 2019-10-23T11:57:24Z

I wrote an email to the SIG Node mailing list regarding the topic and the plan:
https://groups.google.com/forum/#!topic/kubernetes-sig-node/JHEus_TlZzA

fejta-bot · 2020-01-21T12:16:30Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

fejta-bot · 2020-02-20T12:58:55Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle rotten

fejta-bot · 2020-03-21T13:42:03Z

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/close

k8s-ci-robot · 2020-03-21T13:42:19Z

@fejta-bot: Closing this issue.

In response to this:

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

metametadata · 2020-03-21T14:29:49Z

Is there a ticket where one could vote for disabling fejta-bot?

It pollutes the long-standing discussions and eventually closes the important issues where, I suspect, people simply got tired of interacting with the bot. I'm certainly annoyed of getting notifications from it both as an author and participant of a few issues.

turowicz · 2021-03-26T12:41:29Z

Is this ever going to be a thing?

@smarterclayton

aminmr · 2023-07-12T17:33:04Z

Is there no progress on this most-wanted feature request?

dims · 2023-08-09T11:39:50Z

For those of you interested in this issue, please coordinate your interest into something actionable which in our community is a KEP:

https://github.com/kubernetes/enhancements#is-my-thing-an-enhancement

Please feel free to use community resources (sig-node mailing list, agenda on sig-node meetings, google docs to seed discussion etc) to figure out what needs to be in the KEP and how the feature progress(es) through community process(es). Some good info can be found in:

You should probably read one of the KEP(s) from before in sig-node for example in:

https://github.com/kubernetes/enhancements/tree/master/keps/sig-node

Looking forward to folks stepping up to help with this! thanks in advance.

saschagrunert · 2023-08-09T12:18:57Z

Closing the loop, there is a KEP: kubernetes/enhancements#3542

fgrzadkowski added team/ux area/usability labels Dec 30, 2015

bgrant0607 added area/kubectl sig/node Categorizes an issue or PR as relevant to SIG Node. priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. labels Jan 29, 2016

This was referenced Mar 28, 2016

Kubelet: Switch to Docker engine-api and add timeout for all docker calls #23563

Closed

Flake: Kubectl run rc [It] should create an rc from an image [Conformance] [pods not ready] #22603

Closed

vishh closed this as completed May 2, 2016

vishh reopened this May 2, 2016

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label May 14, 2019

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Aug 15, 2019

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Aug 15, 2019

saschagrunert mentioned this issue Oct 17, 2019

Image pull metrics cri-o/cri-o#2757

Closed

saschagrunert mentioned this issue Oct 25, 2019

Add image pull progress KEP kubernetes/enhancements#1338

Closed

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jan 21, 2020

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Feb 20, 2020

k8s-ci-robot closed this as completed Mar 21, 2020

afbjorklund mentioned this issue Oct 23, 2022

Is there a way to get download vs extraction times when we run crictl pull? kubernetes-sigs/cri-tools#1009

Closed

Image pull progress should be exposed #19077

Image pull progress should be exposed #19077

Comments

stuartbassett commented Dec 24, 2015

maclof commented Dec 26, 2015

fgrzadkowski commented Dec 30, 2015

bgrant0607 commented Jan 29, 2016

smarterclayton commented Jan 31, 2016

vishh commented Feb 11, 2016

smarterclayton commented Feb 11, 2016

vishh commented Feb 11, 2016

samsabed commented Feb 11, 2016

smarterclayton commented Feb 11, 2016 via email

stuartbassett commented May 2, 2016

vishh commented May 2, 2016

smarterclayton commented May 2, 2016 via email

vishh commented May 2, 2016

smarterclayton commented May 2, 2016 via email

Random-Liu commented May 3, 2016 • edited Loading

smarterclayton commented May 3, 2016

vishh commented May 3, 2016

smarterclayton commented May 3, 2016

smarterclayton commented May 3, 2016

vishh commented May 3, 2016

Random-Liu commented May 3, 2016 • edited Loading

nphmuller commented May 14, 2019

amadav commented May 17, 2019

fejta-bot commented Aug 15, 2019

nomcopter commented Aug 15, 2019

saschagrunert commented Oct 17, 2019

bboreham commented Oct 17, 2019

saschagrunert commented Oct 17, 2019

smarterclayton commented Oct 19, 2019

bboreham commented Oct 20, 2019

saschagrunert commented Oct 21, 2019

saschagrunert commented Oct 23, 2019

fejta-bot commented Jan 21, 2020

fejta-bot commented Feb 20, 2020

fejta-bot commented Mar 21, 2020

k8s-ci-robot commented Mar 21, 2020

metametadata commented Mar 21, 2020 • edited Loading

turowicz commented Mar 26, 2021 • edited Loading

aminmr commented Jul 12, 2023

dims commented Aug 9, 2023

saschagrunert commented Aug 9, 2023

Random-Liu commented May 3, 2016 •

edited

Loading

Random-Liu commented May 3, 2016 •

edited

Loading

metametadata commented Mar 21, 2020 •

edited

Loading

turowicz commented Mar 26, 2021 •

edited

Loading