When a learned model is itself an optimizer, we call this a "mesa-optimizer". What factors help predict whether a machine learning system will produce a mesa-optimizer?https://www.alignmentforum.org/s/r9tYkB2a8Fp4DN8yB/p/q2rCMHNXazALgQpGH …
0 replies
2 retweets
12 likes
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.