Making AI responsive to human preferences (and extremely interesting approach to learning hard to specify behaviors) https://blog.openai.com/deep-reinforcement-learning-from-human-preferences/ …
-
-
Thanks. Twitter will use this to make your timeline better. UndoUndo
-
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.