降低分布式训练通信的梯度稀疏压缩方法
陈世达,刘强,韩亮

Gradient sparsification compression approach to reducing communication in distributed training
Shi-da CHEN,Qiang LIU,Liang HAN
图 8 各种稀疏方法与经典训练方法在V100 GPU的扩展性对比
Fig.8 Comparison of scalability between various sparsification and classic training approaches on V100 GPU