1/ Higher Top 1 accuracy in image classification actually isn't a great indicator that a new vision model is going to work out well in practice in an image to image task (like DeOldify). I've learned this the hard way after getting excited about a new shiny model many times!
-
Show this thread
-
2/ EfficientNets come to mind- I just have not been seeing the benefits relative to the classification performance. But this really applies generally, even within a set of architectures that have been trained the same way (FaceBook's wsl models, BiTM, etc).
1 reply 0 retweets 8 likesShow this thread -
3/ In those cases, you just don't really know until you actually try. You may be surprised, I can tell you that much! So don't just pick the one with the highest Top-1 accuracy in ImageNet...
5 replies 1 retweet 11 likesShow this thread -
Replying to @citnaj
Any better metrics that you might not have come up with along the way? Or sets of metrics?
3 replies 0 retweets 3 likes
1/ Just rules of thumb. So far I find the resnet architecture generally is better than EfficientNet. But the other thing that seems quite important is the training regime. The Big Transfer (BitM) paper spells it out pretty clearly that this makes a big difference in transfer ...
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.