1/ Higher Top 1 accuracy in image classification actually isn't a great indicator that a new vision model is going to work out well in practice in an image to image task (like DeOldify). I've learned this the hard way after getting excited about a new shiny model many times!
-
-
Any better metrics that you might not have come up with along the way? Or sets of metrics?
-
1/ Just rules of thumb. So far I find the resnet architecture generally is better than EfficientNet. But the other thing that seems quite important is the training regime. The Big Transfer (BitM) paper spells it out pretty clearly that this makes a big difference in transfer ...
End of conversation
New conversation -
-
-
This. One of my most popular tweets was an offhand remark on how to eek out an extra 0.1% on ImageNet, which I regarded as useless b/c ‘interesting problems’ are not Imagenet, but the tweet *exploded* in popularity, which I found puzzling. Then I realized: they were all Kagglers
-
Oh well that makes perfect sense! Do you have a favorite architecture for your interesting work?
- Show replies
New conversation -
-
-
I don't know whether this is tested for CV, but worth a try.https://twitter.com/sai_prasanna/status/1265730581586866182?s=19 …
-
Sounds like a great lead: "Proper DNN comparison hence requires a comparison between their empirical score distributions on unseen data, rather than between single evaluation scores as is standard for more simple, convex models. "
End of conversation
New conversation -
-
-
Yeah, eeking out an extra bit of top-1 on ImageNet can be harmful to other tasks.... I don't have link handy, but there was a paper (neurips 2019 I think?) that showed training with label smoothing hurt downstream transfer learning ability...
- Show replies
New conversation -
-
-
If I correctly understood, you mean that we should not see any score(like accuracy). We should depend on the output result we are getting. Am I right?
-
Unfortunately that does seem to be the best way I know how to do it as of now! It's not like there's 0 correlation with accuracy but it's not great. I've spelled out rules of thumb that have seemed to be most useful in reply to this:https://twitter.com/MaxLenormand/status/1271156822976737280 …
- Show replies
New conversation -
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.