Muse: Text-To-Image Generation via Masked Generative Transformers Huiwen Chang*, Han Zhang*, Jarred Barber†, AJ Maschinot†, José Lezama, Lu Jiang, Ming-Hsuan Yang, Kevin Murphy, William T. Freeman, Michael Rubinstein†, Yuanzhen Li†, Dilip Krishnan† *Equal contribution. †Core contribution. We present Muse, a text-to-image Transformer model that achieves state-of-the-art image generation performance
![Muse: Text-To-Image Generation via Masked Generative Transformers](https://cdn-ak-scissors.b.st-hatena.com/image/square/3e1f06fe6f0a2f04919a2331365cc7c355b5b32d/height=288;version=1;width=512/https%3A%2F%2Fmuse-model.github.io%2Fimages%2Flogo%2Fmuse_cake_1.jpg)