Megatron-CTRL: Controllable Story Generation using External Knowledge from @NVIDIAAI at #emnlp #gtc2020 talk Thursday at 11PM PDT. @pengxu1026 Mostafa Patwary @MohammadShoeybi @TheRealRPuri @pascalefung @ctnzr
Blog: https://developer.nvidia.com/blog/adding-external-knowledge-and-controllability-to-language-models-with-megatron-cntrl/ …
Paper: https://arxiv.org/abs/2010.00840
-
Show this thread
-
Demo of Megatron-CTRL. Text generation adapts seamlessly to dynamic human input through keywords. We also add retrieval from external knowledge base to improve consistency.pic.twitter.com/S0yy5rOZ1z
2 replies 9 retweets 20 likesShow this thread
91% of our stories from Megatron-CTRL are successfully controlled by new keywords and 93% are consistent, from Mturk evaluations.
This builds on Megatron project from @NVIDIAAI where we trained 8 billion model using model parallelism on 512 #GPU https://arxiv.org/abs/1909.08053
10:39 AM - 7 Oct 2020
0 replies
3 retweets
10 likes
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.