(WIP) MLOps on Vertex AI

This example implements the end-to-end MLOps process using Vertex AI platform and Smart Analytics technology capabilities. The example use Keras to implement the ML model, TFX to implement the training pipeline, and Model Builder SDK to interact with Vertex AI.

Getting started

Setting up MLOps environment on Google Cloud.
Start your AI Notebook instance.
Open the JupyterLab then open a new Terminal

Clone the repository to your AI Notebook instance:

git clone https://github.com/ksalama/ucaip-labs.git
cd ucaip-labs

Install the required Python packages:

pip install tfx==0.30.0 --user
pip install -r requirements.txt --user

Upgrade the gcloud components:

sudo apt-get install google-cloud-sdk
gcloud components update

Dataset Management

The Chicago Taxi Trips dataset is one ofof public datasets hosted with BigQuery, which includes taxi trips from 2013 to the present, reported to the City of Chicago in its role as a regulatory agency. The task is to predict whether a given trip will result in a tip > 20%.

The 01-dataset-management notebook covers:

Performing exploratory data analysis on the data in BigQuery.
Creating managed Vertex AI Dataset using the Python SDK.
Generating the schema for the raw data using TensorFlow Data Validation.

ML Development

We experiment with creating a Custom Model using 02-experimentation notebook, which covers:

Preparing the data using Dataflow.
Implementing a Keras classification model.
Training the Keras model in Vertex AI using a pre-built container.
Upload the exported model from Cloud Storage to Vertex AI as a Model.
Exract and visualize experiment parameters from Vertex AI Metadata.

We use Vertex TensorBoard and Vertex ML Metadata to track, visualize, and compare ML experiments.

In addition, the training steps are formalized by implementing a TFX pipeline. The 03-training-formalization notebook covers implementing and testing the pipeline components interactively.

Training Operationalization

The end-to-end TFX training pipeline implementation is in the src/pipelines directory, which covers the following steps:

Receive hyperparameters using hyperparam_gen custom python component.
Extract data from BigQuery using BigQueryExampleGen.
Validate the raw data using StatisticsGen and ExampleValidator.
Process the data using Transform.
Train a custom model using Trainer.
Evaluat and validate the custom model using ModelEvaluator.
Save the blessed to model registry location using using Pusher.
Upload the model to Vertex AI using aip_model_pusher custom python component.

The 04-pipeline-deployment notebook covers testing, compiling, and running the pipeline locally and using Vertex AI Pipelines.

Continuous Training

After testing, compiling, and uploading the pipeline definition to Cloud Storage, the pipeline is executed with respect to a trigger. We use Cloud Functions and Cloud Pub/Sub as a triggering mechanism.

The 05-continuous-training notebook covers the following steps:

Create the Cloud Pub/Sub topic.
Deploy the Cloud Function, which is implemented in src/pipeline_triggering.
Test triggering a pipeline.

Model Deployment

We use Cloud Build test and deploy the uploaded model to Vertex AI Prediction. The 06-model-deployment configures and executes the build/model-deployment.yaml file with the following steps:

Creating an Vertex AI Endpoint.
Test model interface.
Create an endpoint in Vertex AI.
Deploy the model to the endpoint.
Test the endpoint.

Prediction Serving

We serve the deployed model for prediction. The 07-prediction-serving notebook covers:

Use the endpoint for online prediction.
Use the uploaded model for batch prediciton.
Run the batch prediction using Vertex AI Pipelines.

Model Monitoring

After a model is deployed in for prediciton serving, continuous monitoring is set up to ensure that the model continue to perform as expected. The 08-model-monitoring notebook covers configuring Vertex AI Model Monitoring for skew and dirft detection:

Set skew and drift threshold.
Create a monitoring job for all the models under and endpoint.
List the monitoring jobs.
List artifacts produced by monitoring job.
Pause and delete the monitoring job.

Metadata Tracking

You can view the parameters and metrics logged by your experiments, as well as the artifacts and metadata stored by your Vertex AI Pipelines in Cloud Console.

Disclaimer

This is not an official Google product but sample code provided for an educational purpose.

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at: http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

Name		Name	Last commit message	Last commit date
Latest commit History 254 Commits
build		build
provision		provision
src		src
.gitignore		.gitignore
01-dataset-management.ipynb		01-dataset-management.ipynb
02-experimentation.ipynb		02-experimentation.ipynb
03-training-formalization.ipynb		03-training-formalization.ipynb
04-pipeline-deployment.ipynb		04-pipeline-deployment.ipynb
05-continuous-training.ipynb		05-continuous-training.ipynb
06-model-deployment.ipynb		06-model-deployment.ipynb
07-prediction-serving.ipynb		07-prediction-serving.ipynb
08-model-monitoring.ipynb		08-model-monitoring.ipynb
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
mlops.png		mlops.png
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

(WIP) MLOps on Vertex AI

Getting started

Dataset Management

ML Development

Training Operationalization

Continuous Training

Model Deployment

Prediction Serving

Model Monitoring

Metadata Tracking

Disclaimer

About

Releases

Packages

Contributors 2

Languages

License

ksalama/ucaip-labs

Folders and files

Latest commit

History

Repository files navigation

(WIP) MLOps on Vertex AI

Getting started

Dataset Management

ML Development

Training Operationalization

Continuous Training

Model Deployment

Prediction Serving

Model Monitoring

Metadata Tracking

Disclaimer

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages