Music is easier than image generation, it's less structured, less contextual. You don't see as many mind-blowing demos because it's received less attention, but it's plausible that current architectures could already generate music at superhuman levels.
Quote Tweet
8
8
66


