With BPROP training I get 2.3% of errors in MNIST with just 1000 hidden units 2-layers net, no convolution layers. Pretty good AFAIK.
you should be using cross entropy. Probability distributions are not an Euclidean space.
-
-
for this problem for sure, but for the random Redis application of discovering data from users or alike?
-
What are your targets?
- Show replies
New conversation -
-
-
basically I want to pick something that kinda works for everything even if not optimal...
Thanks. Twitter will use this to make your timeline better. UndoUndo
-
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.