OH: Why are language models so much bigger than computer vision models?
...
Because a picture is worth a thousand words
🥁🤦🏻♀️
Conversation
This Tweet was deleted by the Tweet author. Learn more
Replying to
“She said because a picture is worth a thousand words “ 😓
GIF
read image description
ALT
Image is already embedded but in case of text we have to embed explicitly,which increses matrix size









