Conversation

basically same strategy as semantle #14, thought in terms of co-occurrence in similar documents. again ran into a mild issue with polysemous words but now that i know to look for it it's manageable
Quote Tweet
today i tried thinking a lot more about what words were likely to occur in similar positions in similar documents in word2vec’s training set and that worked a lot better than trying to use raw semantic distance only
Show this thread
1
4
the main mental move i'm doing is something like maintaining a "light grip" on what i think the word is "about." like too tight a grip and i don't move far away enough, too loose a grip and there are too many options where to go next
1
6
it's not about how similar their meanings are, it's about what kinds of sentences they're likely to be found in. a sentence like "this animal has a large ____" could be filled in with many different body parts e.g.
1
2
Show replies
Replying to
this gamification of developing intuition for neural nets is... brilliant. I kind of want to build a generic version of this for any model, it actually seems like a great way to QA them
1
3
Show replies
Replying to
I also find 25-35 as the sweet spot that I'm directionally on the right track but then get stuck. Going to try your co-occurence mental model instead of going by "type" of word (eg, similar adverbs) or "topic" of word (which might overlap with co-occurence)