There's also the fact that gradient decent gets really weird when it comes to RNNs
-
-
@fchollet@kastnerkyle Pretty sure adam's strength is in learning embeddings - seen it work super well for that.Thanks. Twitter will use this to make your timeline better. UndoUndo
-
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.