Conversation

nice! i think sub-50 is possible here if you're more conservative with your guesses than i was, hoping to reach that milestone in the next few days and maybe try to maintain a long-run sub-100 average in the meantime
1
1
I still don't yet have a feel for how closeness is calculated. I'm wondering if it's worth entering generic words like "thing", "action", etc.
1
1
the training set here was newspaper articles so you want words that appear in a lot of those. word2vec is trained on a prediction task involving predicting a word from the surrounding words in the corpus
1
1
so you want to think about something like, given a word, the following 2-step inference: 1) which sentences in which kinds of newspaper articles is this word likely to appear in 2) which other words are likely to appear in similar sentences in similar articles
1
1
5