In this setup, the best AI is a hashtable (i.e. skill is outsourced to prior simulation via the mediation of memory). That's Deep RL in a nutshell.
-
-
Show this threadThanks. Twitter will use this to make your timeline better. UndoUndo
-
-
-
We need to start using
@spelunkyworld as AI benchmark. AI that figures out what to do with Eggplant wins. ;)Thanks. Twitter will use this to make your timeline better. UndoUndo
-
-
-
Thanks! This has got to be the clearest explanation of RL that exists.... 2 tweets!
Thanks. Twitter will use this to make your timeline better. UndoUndo
-
-
-
It also helps if you just use API access and sidestep all the other tasks humans need to do while playing these games. It's almost as if most video games are laughably primitive, and the "difficult" ones just add more distractions for the player.
Thanks. Twitter will use this to make your timeline better. UndoUndo
-
-
-
This seems to be the real strength of relational networks: They just seem to generalise a whole lot better in RL. Meanwhile CNNs seem to work as a very good particular-situation-detector - there's a reason RL CNN architectures are very different to other CNNs!
Thanks. Twitter will use this to make your timeline better. UndoUndo
-
-
-
https://youtu.be/O0CpCoCo0UQ Do animals have ‘aql, Then we can refine what is intelligence
Thanks. Twitter will use this to make your timeline better. UndoUndo
-
-
-
Physics simulation or real world data need ti be a new benchmark
Thanks. Twitter will use this to make your timeline better. UndoUndo
-
-
Show additional replies, including those that may contain offensive content
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.