Does anybody know of a fast implementation of Constrained K-Means like http://algoholic.eu/constrained-k-means-implementation-in-python/ … BUT FAST? Need it to cope with very big data.
-
-
Replying to @o_guest
what's your hardware situation like? Is parallelization possible?
1 reply 0 retweets 1 like -
Replying to @artificialsoph
32 cores and 64GB RAM! I've never parallelised something from scratch before.
Teach me the magic!1 reply 0 retweets 1 like -
Replying to @o_guest
idk what language you use but for python scikit-learn has a good parallelizable implementation http://scikit-learn.org/stable/modules/generated/sklearn.cluster.KMeans.html …
1 reply 2 retweets 2 likes -
Replying to @artificialsoph @o_guest
the input `n_jobs` determines the number of cores used. k-means is tricky to parallelize, though
1 reply 0 retweets 1 like
Replying to @artificialsoph
oooh thanks for the info and motivation, yes, I'm using python. I'll try to see if I can parallelise the code I linked to. 

2:36 PM - 2 Jan 2017
0 replies
0 retweets
0 likes
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.