- A pre-existing "provisioning" node which can be used to run Ansible and the install scripts
- A cluster to deploy to (potentially a cluster or single server - or a virtual one)
- Pick a provisioning node to deploy from. This is where the Ansible scripts should be run from and is often a development laptop that has a connection to the target cluster. On this provisioning node, clone the DeepOps repository...
git clone https://github.com/NVIDIA/deepops.git
- Checkout a recent release tag. This is an optional step, but if not done, the latest development code will be used, not an official release.
cd deepops
git checkout tags/22.07
- Pick one of the Deployment Options mentioned in the main README, following the installation instructions.
To learn more about how to use this directory and customize your DeepOps cluster, see how to configure DeepOps.
Each of the deployment options in DeepOps is highly modular and does not need to be deployed as-is. It’s strongly encouraged to read through the installation scripts and examine the playbooks to see which components should be disabled or replaced with custom components. Otherwise, the defaults are an opinionated approach to deploying the cluster for that deployment option.
Optional components are included as scripts in the /scripts directory. They can also be used on an existing cluster that did not use DeepOps. For example, on an existing Kubernetes cluster, the /scripts/k8s/deploy_kubeflow.sh script can be launched to deploy Kubeflow.
Examples are included in the /examples directory. Each example starts with the cluster type, so /examples/k8s/dask-rapids requires a Kubernetes cluster.
The rest of the docs are in the /docs directory.