1/ Higher Top 1 accuracy in image classification actually isn't a great indicator that a new vision model is going to work out well in practice in an image to image task (like DeOldify). I've learned this the hard way after getting excited about a new shiny model many times!
-
-
I wish. It’s usually trial & error, w/ different components, activations, layers, losses, etc. Just released a paper today on further results with SignalTrain architecture! https://twitter.com/drscotthawley/status/1271064318768005121?s=21 …. Which, in humilty, this paper exposes the model’s limitations.
Thanks. Twitter will use this to make your timeline better. UndoUndo
-
-
-
I will say: skip connections + 1cycle learning rate schedule = at least a factor of 10 for lowering loss / decreasing training time. :-)
-
Yes agreed on those being crucial. Generally there's so few things that seem to -actually- matter in practice. Training regime on the pretrained backbone (going beyond ImageNet 1K, namely) would be another I'd say I feel confident o being important.
End of conversation
New conversation -
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.