author-photo

Vijay Korthikanti

3

Papers

22

Citations

Citation Trend

Filter By

Papers (3) Sort By

Journal Conference Others

Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model.

2022 CoRR
Show All
- Source
- Cite
- Save
Citations (0)
Reducing Activation Recomputation in Large Transformer Models.

2022 CoRR
Show All
- Source
- Cite
- Save
Citations (0)
Efficient Large-Scale Language Model Training on GPU Clusters

2021 arXiv: Computation and Language
Show All
- Source
- Cite
- Save
Citations (22)