Megatron-CTRL: Controllable Story Generation using External Knowledge from @NVIDIAAI at #emnlp #gtc2020 talk Thursday at 11PM PDT. @pengxu1026 Mostafa Patwary @MohammadShoeybi @TheRealRPuri @pascalefung @ctnzr
Blog: https://developer.nvidia.com/blog/adding-external-knowledge-and-controllability-to-language-models-with-megatron-cntrl/ …
Paper: https://arxiv.org/abs/2010.00840
-
-
91% of our stories from Megatron-CTRL are successfully controlled by new keywords and 93% are consistent, from Mturk evaluations. This builds on Megatron project from
@NVIDIAAI where we trained 8 billion model using model parallelism on 512#GPU https://arxiv.org/abs/1909.08053Show this threadThanks. Twitter will use this to make your timeline better. UndoUndo
-
-
-
any evaluations of how it would perform with other training corpora?
Thanks. Twitter will use this to make your timeline better. UndoUndo
-
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.