Skip to content

v1.9.0 release

Latest
Compare
Choose a tag to compare
@andreyvelich andreyvelich released this 28 Jan 15:58
· 10 commits to master since this release
6f74c7f

This is the Training Operator v1.9.0 release.

This release introduces a new JAXJob, enabling seamless distributed training with JAX.

Additionally, it adds the managedBy API to streamline the orchestration of training Jobs in multi-cluster environment using MultiKueue.

Breaking Changes

New Features

Distributed JAX

New Examples

Control Plane Updates

SDK Updates

Kubeflow Trainer V2

Bug Fixes

Misc