I added the legacy code that does something very similar using pandas as well and AFAIK doesn't run out of memory — need to dig into it tho.
-
-
-
Code and comments are not mine so will have to dig into it and see how it works and give it test inputs etc. https://github.com/oliviaguest/pairwise_distance/blob/master/frank_version.py …
End of conversation
New conversation -
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.