I have a maybe interesting cardinality estimation problem. Twitter-friends, who should I talk to?
-
-
@avibryant@todd534 that was my hope, but no such luck. Closest thing I've found used HLL + minhash. Convert the expr to CNF then... -
@avibryant@todd534 compute unions for each sub expr then intersect all. It's not clear how error compounds as more sets are intersected. -
-
@avibryant@todd534@posco was looking at http://tech.adroll.com/blog/data/2013/07/10/hll-minhash.html … uses minhashes to estimate set similarity. Usual prob w/ large card. diff -
-
-
@avibryant@todd534@posco I'll benchmark next week. I could see where it might help, but as you said not sure a larger HLL wouldn't do more - 1 more reply
New conversation -
-
-
@avibryant@cbeckpdx no, just lets you avoid needing to invert already combined sketches. were hoping we weren't clued in on some ∩ magic.Thanks. Twitter will use this to make your timeline better. UndoUndo
-
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.