降低分布式训练通信的梯度稀疏压缩方法 |
| 陈世达,刘强,韩亮 |
|
Gradient sparsification compression approach to reducing communication in distributed training |
| Shi-da CHEN,Qiang LIU,Liang HAN |
| 图 8 各种稀疏方法与经典训练方法在V100 GPU的扩展性对比 |
| Fig.8 Comparison of scalability between various sparsification and classic training approaches on V100 GPU |
|