@fchollet neat. Citation? Experiences?
To be learnable, a problem has to be stable. Therefore in a stacked ML architecture, the lower levels should evolve at slower scale.
-
-
-
@stephenroller Personal exp. Try it: take a deep MLP, and test 1) constant learning rates, 2) higher LR for higher levels, 3) the reverse - Show replies
New conversation -
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.