Many people sharing this essay arguing that "computational scale beats clever new ideas". It takes for granted backprop, better activation functions, better learning methods, conv nets, better regularization techniques, etc etc. In other words, it seems to ignore the clever ideashttps://twitter.com/gdb/status/1106329741785653248 …
-
-
There is, IMO, a good paper to be written following this up, carefully understanding the relationship between scale and clever ideas.
Show this threadThanks. Twitter will use this to make your timeline better. UndoUndo
-
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.