Conversation

Replying to
Yeah I tried using DALL-E for fashion purposes and getting a guy with correct shirt color, jeans color and jacket color was basically impossible
NFT profile picture
Replying to
this isn’t surprising given the dalle architecture dalle uses CLIP which isn’t really suited for NLU but instead predicting captions from images