Tags: microsoft/Megatron-DeepSpeed
Toggle v2.4's commit message
Merge branch 'github-main' into 'main'
Bringing github main and internal main back together.
See merge request ADLR/megatron-lm!275
Toggle v2.3's commit message
Merge branch 'main' into github-main
Toggle v2.2's commit message
Cherry-pick fix from development repo.
Toggle v2.1's commit message
Use torch.cuda.synchronize() right after calling batch_isend_irecv() …
…communication API
Toggle v2.0's commit message
Merge branch 'TOC-fix' into 'main'
TOC fix
See merge request ADLR/megatron-lm!208
Toggle v1.1's commit message
Merge branch 'readme_update' into 'main'
Update scaling numbers in README and other small tweaks.
See merge request ADLR/megatron-lm!130
Toggle v1.0's commit message
Updated with minor fix to l2 norm computation.
Toggle v0.2's commit message
Merge branch 'torchddp' into 'master'
Use DDP command line argument instead of source flag in pretrain_bert.py.
See merge request ADLR/megatron-lm!2
Toggle v0.1's commit message
You can’t perform that action at this time.