Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model.
2022
-
Correction
-
Cite
-
Save
0
References
0
Citations
NaN
KQI