Thanks Gary. We don't go deep into the broader picture, but in general I think this is a good non-algorithmic way to incorporate human intelligence in AI. Human-defined subset testing can avoid many bone-headed AI failures, at least in constrained and understood environments.
-
-
Thanks. Twitter will use this to make your timeline better. UndoUndo
-
-
-
Simple scores are terrible for denoting performance in real world. They neither capture the degree or the gravity of the error.
Thanks. Twitter will use this to make your timeline better. UndoUndo
-
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.