Muse: Text-To-Image Generation via Masked Generative Transformers
Presents Muse, a text-to-image Transformer model that achieves SotA image generation perf while being far more efficient than diffusion or AR models.
proj: muse-model.github.io
abs: arxiv.org/abs/2301.00704

