. @Ylecun are you arguing that your team has a robust solution to the problem of getting deep nets to understand the causal consequences of events as they unfold over time, or just pointing me to a toy model? Have you tried it on the examples in this thread?https://twitter.com/ylecun/status/1188902027495006208 …
right, which makes his reply a non sequitur. I said GPT-2 doesn''t develop robust representations of *how events unfold over time*; he pointed to a different architecture w memory etc. bait-and-switch and overstated.
-
-
"models like GPT-2" is a very subjective thing. He probably thinks models /w external memory are in the same family (as would I), whereas you probably count them as a "hybrid model".
-
correct, i would. i have been saying since 2001 that memory woud be vital in moving forward.
End of conversation
New conversation -
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.