Skip to content

Tags: microsoft/Megatron-DeepSpeed

Tags

v2.4

Toggle v2.4's commit message
Merge branch 'github-main' into 'main'

Bringing github main and internal main back together.

See merge request ADLR/megatron-lm!275

v2.3

Toggle v2.3's commit message
Merge branch 'main' into github-main

v2.2

Toggle v2.2's commit message
Cherry-pick fix from development repo.

v2.1

Toggle v2.1's commit message
Use torch.cuda.synchronize() right after calling batch_isend_irecv() …

…communication API

v2.0

Toggle v2.0's commit message
Merge branch 'TOC-fix' into 'main'

TOC fix

See merge request ADLR/megatron-lm!208

v1.1

Toggle v1.1's commit message
Merge branch 'readme_update' into 'main'

Update scaling numbers in README and other small tweaks.

See merge request ADLR/megatron-lm!130

v1.0

Toggle v1.0's commit message
Updated with minor fix to l2 norm computation.

v0.2

Toggle v0.2's commit message
Merge branch 'torchddp' into 'master'

Use DDP command line argument instead of source flag in pretrain_bert.py.

See merge request ADLR/megatron-lm!2

v0.1

Toggle v0.1's commit message
initial commit