How far can we get with one GPU in 100 hours? CoAStaL at MultiIndicMT Shared Task.

Rahul Aralikatte,Héctor Ricardo Murrieta Bello,Daniel Hershcovich,Marcel Bollmann,Anders Søgaard

How far can we get with one GPU in 100 hours? CoAStaL at MultiIndicMT Shared Task.

2021

Rahul Aralikatte
Héctor Ricardo Murrieta Bello
Daniel Hershcovich
Marcel Bollmann
Anders Søgaard

This work shows that competitive translation results can be obtained in a constrained setting by incorporating the latest advances in memory and compute optimization. We train and evaluate large multilingual translation models using a single GPU for a maximum of 100 hours and get within 4-5 BLEU points of the top submission on the leaderboard. We also benchmark standard baselines on the PMI corpus and re-discover well-known shortcomings of translation systems and metrics.

Keywords:

Benchmark (computing)
BLEU
Computer science
Artificial intelligence
Machine learning
task
Translation (geometry)
Work (electrical)

Correction
Source
Cite
Save

References

Citations