降低分布式训练通信的梯度稀疏压缩方法 |
陈世达,刘强,韩亮 |
Gradient sparsification compression approach to reducing communication in distributed training |
Shi-da CHEN,Qiang LIU,Liang HAN |
图 8 各种稀疏方法与经典训练方法在V100 GPU的扩展性对比 |
Fig.8 Comparison of scalability between various sparsification and classic training approaches on V100 GPU |
![]() |