Apropos of AlphaGo (BetaGo?), why reinforcement learning AIs are here to stay: agent AIs are smarter & fasterhttps://www.gwern.net/Tool%20AI
humans in the loop make the feedback loop manageably long, which is good, but they'll miss things that look innocuous on their own
-
-
to spot those, you'd need an about equally powerful AI (maybe a sibling?) - is that what you mean with A/B tests?
-
No, I mean it needs to optimize for something based on observing things affected by its classification/predictions.
- 1 more reply
New conversation -
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.