@karpathy @fchollet @sedielem @_mayfa @j26774 comparing char-rnn models, very different results for keras & lasagne!pic.twitter.com/dkynepPH9e
You can add location information to your Tweets, such as your city or precise location, from the web and via third-party applications. You always have the option to delete your Tweet location history. Learn more
i totally agree. i need to understand where parameters/functions are missing in keras/lasagne vs tf/torch/chainer implementations
@sedielem fwiw, i noticed disagreement about rmsprop divisor. lasagne&keras use sqrt(a+1e-6). torch's sqrt(a)+1e-8 seems better?
not big enough to account for this bug, but it does make a consistent difference across a few tests.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.