Training and serving large-scale neural networks with auto parallelization.
-
Updated
Dec 9, 2023 - Python
Training and serving large-scale neural networks with auto parallelization.
TePDist (TEnsor Program DISTributed) is an HLO-level automatic distributed system for DL models.
The ASPLOS 2025 / EuroSys 2025 Contest Track
Supplemental materials for The ASPLOS 2025 / EuroSys 2025 Contest on Intra-Operator Parallelism for Distributed Deep Learning
Add a description, image, and links to the auto-parallelization topic page so that developers can more easily learn about it.
To associate your repository with the auto-parallelization topic, visit your repo's landing page and select "manage topics."