Web UI should supports get artifact from local path #1497

jinchihe · 2019-06-12T10:16:02Z

We have support mounting PV/PVC in pipelines for on premise cluster. For this case the artifact will be stored in the PVC mount path, we should support getting artifacts from local path, such as /mnt/a07b8215-8c1a-11e9-a2ff-525400ed33aa/tfx-taxi-cab-classification-pipeline-example-q7rtq-2449399348/data/roc.csv. Thanks.

The text was updated successfully, but these errors were encountered:

cvenets · 2019-06-21T05:14:49Z

This is something we need as well, to be able to visualize the results correctly via the KFP UI on MiniKF.

Can we coordinate to include this in 0.6?

jlewi · 2019-07-09T15:56:32Z

/cc @IronPan @jessiezcc @rileyjbauer @paveldournov

paveldournov · 2019-07-09T16:42:29Z

@jinchihe when you refer to the local path for the artifacts, what is the path local to? Is this a local file system of the machine running miniKube, or a local file system of the container creating the artifact or something else?

jinchihe · 2019-07-10T00:28:53Z

@jinchihe when you refer to the local path for the artifacts, what is the path local to? Is this a local file system of the machine running miniKube, or a local file system of the container creating the artifact or something else?

The case is running on on-prem cluster, and the artifacts will be saved to path where PVC mounted to.

IronPan · 2019-07-11T16:29:19Z

The case is running on on-prem cluster, and the artifacts will be saved to path where PVC mounted to.

How will the UI get the data? Will the UI pod mount the same PV to retrieve and render it? How does this work if UI need to render data for different pipeline that has different PVs?

IronPan · 2019-07-11T16:35:05Z

Also @mameshini has some extension to better abstract artifacts storage that works with On-Prem and cloud. We are looking into port them over as part of KF Pipeline as the default artifact storage. Please see this thread for details.
#596

jlewi · 2019-07-14T23:50:56Z

@cvenets Based on @IronPan is this 0.6 blocking? Should we downgrade it from P0? IIUC figuring out how to support storing artifacts on PV/PVC is non-trivial. If artifacts are stored on a PV/PVC then the UI won't be able to access those artifacts without mounting the PV/PVC.

cvenets · 2019-07-15T10:37:57Z

@jlewi @IronPan indeed this is not trivial, and these are two distinct and orthogonal problems mentioned in this thread:

Being able to have UIs that can access data in PVCs (both the Kubeflow UI and the KFP UI)
Being able to have PVCs backed by Object Stores (issue S3 errors in Pipeline examples for reading training data and artifact storage #596)

[1] needs to be tackled in a generic way that will work for different use cases, one of which is the Artifacts tab of the KFP UI. We have discussed this internally and I think we can contribute a generic mechanism that all UIs will be able to use for v0.7. This is related to the Tensorboard issue as well:
kubeflow/kubeflow#3578

So, yes, let's not consider this blocking for v0.6 because it needs significant work. We will aim to solve it universally for v0.7.

For [2], I went through #596. I don't think it is related to this issue, it's more of an infra problem, of how one chooses to implement PVCs at the K8s level. If one has PVCs backed by Goofys which is then backed by an Object Store, then Pipelines and every other component will work transparently. @IronPan can you comment why this may be different than any other PVC provider?

jlewi · 2019-07-16T15:41:31Z

Thanks @cvenets downgrading to p1 and moving to 0.7.0

jlewi · 2019-08-27T02:12:16Z

Anyone planning on tackling this in Q3?

jackhawa · 2019-10-06T03:48:53Z

Hello, is there any update (or eta) on this issue? And is there a workaround to see local artifacts produced by a step?

Ark-kun · 2019-10-16T17:44:28Z

We could support this consistently:
Design:

Cluster admin specifies that they want to use particular Kubernetes volume for all data storage and passing (which replaces Minio or other S3 storage).
The storage volume is also mounted to the Frontend pod.
Backend rewrites pipelines so that the artifacts are now stored in the volume while only the volume paths are being passed between steps. The change should be invisible for the user code.
Frontend can now access all data (including the data from unfinished steps) as pointed by the volume paths.

mameshini · 2019-10-16T19:10:26Z

Yes it would address the need - a single Kubernetes volume can be mounted to the Frontend pod, and used for all data storage and passing. Tensorboard data can also be saved on that volume. Currently only GCS buckets can be used to visualize Tensorboard data which is a serious limitation for other cloud providers, on-prem, and local.

eterna2 · 2019-10-24T02:24:08Z

The storage volume is also mounted to the Frontend pod.

Question. Can multiple pods mount the same PV? Because for PV for AWS backed k8s uses EBS. And I don't think we can mount the same EBS on multiple pods.

Does that means we need to use some custom volume backed by nfs or some storage service?

eterna2 · 2019-10-24T02:29:14Z

Just a side-note, I got tensorboard viewer to work with S3 by exposing a env var in frontend to set a path to a custom podspec, which I config by mounting a configmap.

#1906

tanguycdls · 2019-10-25T13:34:00Z

@eterna2 How can we use your PR #1906 to use Tensorboard on S3 ? I opened an issue a while ago here kubeflow/kubeflow#3773. Thks !

eterna2 · 2019-10-25T15:05:55Z

Tensorboard supports s3 thru boto3 under the hood. You will need to either pass in the AWS credentials with env variables or set the pod annotations with the appropriate IAM roles (if your cluster is running kube2iam or equivalent) for the tensorboard pod to access ur s3 bucket.

My PR exposes an env var VIEWER_TENSORBOARD_POD_TEMPLATE_SPEC_PATH inside pipeline UI to load a custom podTemplateSpec instead of the default gcp spec.

The podTemplateSpec is used by the view controller to create the tensorboard viewer pod.

You can create a configmap with a json for the podTemplateSpec for the tensorboard viewer with the AWS credentials or pod annotations. Then u mount the configmap to the path referenced by the env var.

scheme for podTemplateSpec can be found in k8s API reference. U can ignore the image and arg field as these are injected by the viewer controller.

https://kubernetes.io/docs/reference/generated/kubernetes-api/v1.11/#podtemplatespec-v1-core

See here on the env var to config the AWS credentials.

https://docs.aws.amazon.com/cli/latest/userguide/cli-configure-envvars.html

More on kube2iam here
https://github.com/jtblin/kube2iam

Note also that changes to the spec is not retro-active. U will need to kill existing view, and reload them to see the pods with the updated podTemplateSpec.

gabrielvcbessa · 2019-10-26T11:55:56Z

@eterna2 I am also using pod annotations to authenticate with aws. I tried to patch the ml-pipeline-ui deploy and include the following patch:

spec:
  template:
    metadata:
      annotations:
        iam.amazonaws.com/role: my-role

But I am still getting S3Error: Access Denied when trying to visualize any artifact generated by any given step. I double checked and the pod has the annotation and also have the following log:

Getting storage artifact at: s3: bucket/prefix

So the annotated pod is the one requesting it. Shouldn't this work?

eterna2 · 2019-10-26T13:20:17Z

This is for tensorboard. Not pipeline-ui.

Cuz minio-js does not support IAM roles.

U need to wait for my pr #2081 to be merged in before pod annotations will work for pipeline-ui.

Meanwhile u can use a minio gateway to proxy to s3.

eterna2 · 2019-10-26T13:31:18Z

https://github.com/minio/minio/blob/master/docs/gateway/s3.md

A minio gateway is setup very similarly to the kf minio server. U just need to add the pod annotations and change the args.

descampsk · 2019-12-18T16:26:55Z

My PR exposes an env var VIEWER_TENSORBOARD_POD_TEMPLATE_SPEC_PATH inside pipeline UI to load a custom podTemplateSpec instead of the default gcp spec.

Thanks !

It works pretty good. But the Tensorboard viewers pods will live forever...

Do you think it's possible to add a spec to allow them to live only some minutes ? Or do we have to build a cronejob to delete automatically this pods every hour ?

Bobgy · 2020-01-22T14:31:03Z

Not very familiar with the long thread here, is there anything left that still needs a solution?

mameshini · 2020-01-23T04:05:46Z

Yes it's still unsolved for AWS and on-prem, as far as I know. Presenting Tensorboard data to end users is a high value user story. Tensorboard data has to be presented not only after a pipeline step execution is done, but also during the pipeline step execution so that a data scientist can monitor the model training progress.

Let's assume there is a single PVC mounted to all pipeline steps, as well to the pipeline UI. Many Kubeflow users are already mounting data to pods using tools like goofys. Pipeline UI has to access Tensorboard data via local path without assuming that tensor board data always in GCP storage bucket. Alternatively if pipeline UI can get artifact data from Minio/S3 buckets that would be fine too. I will be able to get back to testing this in a week.

stale · 2020-06-24T15:49:04Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

stale · 2020-07-01T17:02:14Z

This issue has been automatically closed because it has not had recent activity. Please comment "/reopen" to reopen it.

Ark-kun · 2020-08-26T19:54:12Z

Alternatively if pipeline UI can get artifact data from Minio/S3 buckets that would be fine too.

I think this is what usually happen.

Bobgy · 2020-08-27T14:02:17Z

The requested feature is already supported in https://github.com/kubeflow/pipelines/blob/master/docs/config/volume-support.md.

stale · 2020-11-26T21:12:54Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

jlewi added the priority/p0 label Jul 9, 2019

jlewi added priority/p1 and removed priority/p0 labels Jul 16, 2019

jlewi added priority/p2 and removed priority/p1 labels Aug 27, 2019

Ark-kun assigned rmgogogo, Bobgy and jingzhang36 Oct 4, 2019

Ark-kun added the area/frontend label Oct 4, 2019

Ark-kun assigned IronPan Oct 4, 2019

mameshini mentioned this issue Oct 16, 2019

Web UI should allow access to model training/validation metrics in real time #2416

Closed

Bobgy added needs investigation status/triaged Whether the issue has been explicitly triaged help wanted The community is welcome to contribute. labels Jan 22, 2020

stale bot added the lifecycle/stale The issue / pull request is stale, any activities remove this label. label Jun 24, 2020

stale bot closed this as completed Jul 1, 2020

Mar-ai mentioned this issue Aug 26, 2020

[Feature Request] Kubeflow-UI should support getting artifacts from local path #4418

Closed

Ark-kun reopened this Aug 26, 2020

stale bot removed the lifecycle/stale The issue / pull request is stale, any activities remove this label. label Aug 26, 2020

stale bot added the lifecycle/stale The issue / pull request is stale, any activities remove this label. label Nov 26, 2020

Bobgy closed this as completed Nov 27, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Web UI should supports get artifact from local path #1497

Web UI should supports get artifact from local path #1497

jinchihe commented Jun 12, 2019

cvenets commented Jun 21, 2019

jlewi commented Jul 9, 2019

paveldournov commented Jul 9, 2019

jinchihe commented Jul 10, 2019

IronPan commented Jul 11, 2019

IronPan commented Jul 11, 2019

jlewi commented Jul 14, 2019

cvenets commented Jul 15, 2019

jlewi commented Jul 16, 2019

jlewi commented Aug 27, 2019

jackhawa commented Oct 6, 2019

Ark-kun commented Oct 16, 2019

mameshini commented Oct 16, 2019

eterna2 commented Oct 24, 2019

eterna2 commented Oct 24, 2019

tanguycdls commented Oct 25, 2019

eterna2 commented Oct 25, 2019

gabrielvcbessa commented Oct 26, 2019 •

edited

Loading

eterna2 commented Oct 26, 2019

eterna2 commented Oct 26, 2019

descampsk commented Dec 18, 2019

Bobgy commented Jan 22, 2020

mameshini commented Jan 23, 2020

stale bot commented Jun 24, 2020

stale bot commented Jul 1, 2020

Ark-kun commented Aug 26, 2020

Bobgy commented Aug 27, 2020

stale bot commented Nov 26, 2020

Web UI should supports get artifact from local path #1497

Web UI should supports get artifact from local path #1497

Comments

jinchihe commented Jun 12, 2019

cvenets commented Jun 21, 2019

jlewi commented Jul 9, 2019

paveldournov commented Jul 9, 2019

jinchihe commented Jul 10, 2019

IronPan commented Jul 11, 2019

IronPan commented Jul 11, 2019

jlewi commented Jul 14, 2019

cvenets commented Jul 15, 2019

jlewi commented Jul 16, 2019

jlewi commented Aug 27, 2019

jackhawa commented Oct 6, 2019

Ark-kun commented Oct 16, 2019

mameshini commented Oct 16, 2019

eterna2 commented Oct 24, 2019

eterna2 commented Oct 24, 2019

tanguycdls commented Oct 25, 2019

eterna2 commented Oct 25, 2019

gabrielvcbessa commented Oct 26, 2019 • edited Loading

eterna2 commented Oct 26, 2019

eterna2 commented Oct 26, 2019

descampsk commented Dec 18, 2019

Bobgy commented Jan 22, 2020

mameshini commented Jan 23, 2020

stale bot commented Jun 24, 2020

stale bot commented Jul 1, 2020

Ark-kun commented Aug 26, 2020

Bobgy commented Aug 27, 2020

stale bot commented Nov 26, 2020

gabrielvcbessa commented Oct 26, 2019 •

edited

Loading