@avibryant you might be interested in https://github.com/seiflotfy/vHLL which is a maximum likelihood sketch
@seiflotfy instead of random numbers they use a hash of the header + first 12 bytes of the content of packet.
-
-
@seiflotfy so if you see the same packet two different places, then later merge the sketches, it only gets counted once (vs twice with PMC). -
@avibryant haven't completed reading the paper yet. But that sounds very right :D On another note I am confused on why no1 picked up PMC yet -
@avibryant The paper came out in 2011 -
@seiflotfy yeah very surprising, given how well-known HLL and CMS have become. I'm looking forward to doing PMC in Scala when I get time. -
@avibryant yeah. was thinking of picking up on#scala, ended up with#golang. Should be a nice first project to hack in Scala.
End of conversation
New conversation -
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.