Baidu's 'Ring Allreduce' Library Increases Machine Learning Efficiency Across Many GPU Nodes

Baidu released its own implementation of the "ring allreduce" algorithm that can make parallel training of neural networks on GPUs significantly more efficient.

tomshardware.com