An agent which learned to play Mario without rewards. Instead, it was incentivized to avoid "boredom" (that is, getting into states where it can predict what will happen next). Discovered warp levels, how to defeat bosses, etc. More details: https://blog.openai.com/reinforcement-learning-with-prediction-based-rewards/ …pic.twitter.com/6ObS35iZZS
-
-
This is me mindlessly scrolling Twitter instead of reading the book I sat down with.
-
So I am not the only one doing this
-
I think many of us do, even if not all confess to it. It spoke to me, because I quite literally have traded in watching garbage on TV for scrolling through garbage on Twitter. (Still better than FB or IG though.)
End of conversation
New conversation -
-
-
artificial procrastination
Thanks. Twitter will use this to make your timeline better. UndoUndo
-
-
-
Thanks. Twitter will use this to make your timeline better. UndoUndo
-
-
-
yes, that's me!
Thanks. Twitter will use this to make your timeline better. UndoUndo
-
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.
