The machine learning equivalent: shuffle the labels in your test dataset (so they don't match the test inputs anymore, while keeping the same class distribution) and rerun evaluation. If your accuracy is still as high as before, you have a problem
-
-
Show this thread
-
The best accuracy achievable without looking at the inputs (i.e. just by learning the label distribution) can be a pretty good initial baseline for a difficult problem.
Show this thread
End of conversation
New conversation -
-
-
Mutation Testing?
Thanks. Twitter will use this to make your timeline better. UndoUndo
-
-
-
great suggestion. Sometime the smoke tests we run was not covering the part of code we changed!! And we think everything is ok as test passed. Knowing the correct tests to run is an art!
Thanks. Twitter will use this to make your timeline better. UndoUndo
-
-
-
Automate this w QuickCheck https://en.wikipedia.org/wiki/QuickCheck generally property based testinghttps://twitter.com/daniel_bilar/status/1363540291228205059 …
Thanks. Twitter will use this to make your timeline better. UndoUndo
-
-
-
This is why coverage reports are so useful
-
Even just reference labels in the IDE can be useful for that, but those and code coverage tools won't necessarily pick up cases where there's certain conditions not being exercised.
End of conversation
New conversation -
-
-
subtweet? :)
Thanks. Twitter will use this to make your timeline better. UndoUndo
-
-
-
That’s actually very unintuitive and good advice. Thanks!
Thanks. Twitter will use this to make your timeline better. UndoUndo
-
-
-
There is nothing more suspicious than code that works on the first try!
-
Oh boy, I hope it compiles and works!!
Okay... what the f is this aboutpic.twitter.com/0KviR9ee3W
End of conversation
New conversation -
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.