Excited to share our work: collaboration requires understanding! In Overcooked, self-play doesn't gel with humans: it expects them to play like itself. (1/4) Demo: https://humancompatibleai.github.io/overcooked-demo/ … Blog: https://bair.berkeley.edu/blog/2019/10/21/coordination/ … Paper: https://arxiv.org/abs/1910.05789 Code: https://github.com/HumanCompatibleAI/overcooked_ai …pic.twitter.com/lqbzeTwoqr
-
-
Real humans adapt to the opaque protocols that SP learns, and play differently than the naive behavior cloned model that our agent was trained against, so the effect is smaller. Nonetheless, the human-aware agent still does better, sometimes beating human performance! (4/4)pic.twitter.com/FmR9Mn2Xwx
Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi
-
Čini se da učitavanje traje već neko vrijeme.
Twitter je možda preopterećen ili ima kratkotrajnih poteškoća u radu. Pokušajte ponovno ili potražite dodatne informacije u odjeljku Status Twittera.