Replying to @roydanroy
@ArnaudDoucet1 @kazizzad @jxbz we carry out analysis of biased gradients when it is due to 1 bit quantization. This can accelerate convergence in the beginning when signal to noise ratio is large https://arxiv.org/abs/1802.04434
10:16 PM - 9 Jun 2018
0 replies
0 retweets
8 likes
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.