-We can't prevent an AI from hacking its reward function
-Humans can't hack their reward function
?https://twitter.com/Plinz/status/985249543582355458 …
-
Show this thread
Replying to @Alrenous
Evolution is a check on humans hacking their reward functions - over time reward functions end up aligned with the goals of successful reproduction
8:38 AM - 17 Apr 2018
0 replies
0 retweets
1 like
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.