forked from kubeflow/examples
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Kaggle notebook to kfp pipeline (kubeflow#940)
* Create README.md * kaggle to kfp * Create README.md * Update README.md * Add files via upload * Update README.md * Update README.md * Update README.md * kaggle to kfp
- Loading branch information
1 parent
68477e3
commit 639f84a
Showing
4 changed files
with
765 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,30 @@ | ||
# Objective | ||
Here we convert the https://www.kaggle.com/competitions/digit-recognizer code to kfp-pipeline | ||
The objective of this task is tois to correctly identify digits from a dataset of tens of thousands of handwritten images. | ||
|
||
# Testing environment | ||
| Name | version | | ||
| ------------- |:-------------:| | ||
| Kubeflow | v1 | | ||
| kfp | 1.8.11 | | ||
|
||
|
||
Kfp version used for testing can be installed as `pip install kfp==1.8.11` | ||
|
||
# Components used | ||
|
||
## kubeflow lightweight component method | ||
Here, a python function is created to carry out a certain task and the python function is passed inside kfp component method`create_component_from_func`. | ||
|
||
|
||
## Kubeflow pipelines | ||
Kubeflow pipelines connect each components according to how they were passed and creates a pipeline. The kfp `dsl.pipeline` method was used to create a pipeline function. The kkfp component method `InputPath` and `OutputPath` was used to pass data amongst component. | ||
|
||
Finally, the `create_run_from_pipeline_func` was used to submit pipeline directly from pipeline function | ||
|
||
## To create pipeline | ||
1. Navigate to `data` directory, download compressed kaggle data and put your `training.zip` and `test.zip` data in the data folder. | ||
2. Open your setup kubeflow cluster and create a notebook server and connect to it. | ||
3. Clone this repo and navigate to this directory | ||
4. run the kfp-digit-recognizer notebook from start to finish | ||
5. View run details immediately after submitting pipeline. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,16 @@ | ||
# Objective | ||
Download compressed data from https://www.kaggle.com/competitions/digit-recognizer/data and store here | ||
|
||
replace download link with the repo link where the data is stored https://github-repo/data-dir/{file}.csv.zip?raw=true | ||
<p> | ||
<img src="https://github.com/josepholaide/examples/blob/master/digit_recognition/data/img1.PNG?raw=true" alt="kubeflow pipeline" width="850" height="250"/> | ||
</p> | ||
|
||
# Testing environment | ||
| Name | version | | ||
| ------------- |:-------------:| | ||
| Kubeflow | v1 | | ||
| kfp | 1.8.11 | | ||
|
||
|
||
Kfp version used for testing can be installed as `pip install kfp==1.8.11` |
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Oops, something went wrong.