the problem with this is it runs out of memory sooner than my function does — mine does as it is which is a problem
-
-
Thanks. Twitter will use this to make your timeline better. UndoUndo
-
-
-
sklearn.metrics.pairwise.pairwise_distances has n_jobs parameter, which does precisely what you need, imho. (just back from vacation, slow)
-
no worries, I think it can't be what I need because it runs out of memory?
-
do you mean it does this with any value of n_jobs? or only some?
-
Lemme try a bunch of n_jobs values and get back to you
-
I think the problem with this function if I recall correctly is it tried to load the distance matrix in memory and I have 50mil data points!
-
50 million data points by 50 million data points does not fit in my RAM even though I have 128 and 64 on the two computers I use.
End of conversation
New conversation -
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.