Hey Tweeter scientists, is there a probabilistic structure that gives good estimation for counts in time windows?@posco @avibryant @ccsevers
-
-
Replying to @vitalygordon
@BigDataSc Our friend@avibryant created this one: https://github.com/twitter/algebird/blob/develop/algebird-core/src/main/scala/com/twitter/algebird/HyperLogLogSeries.scala … which is related using HLL. /cc@ccsevers2 replies 0 retweets 1 like -
Replying to @posco
@posco@avibryant@ccsevers Awesome! "it keeps every RhoW value it has seen", how is this constant space?1 reply 0 retweets 0 likes -
Replying to @vitalygordon
@BigDataSc@posco@ccsevers it only keeps the latest time it saw a given value. But that's enough to build the HLL for any window ending now1 reply 0 retweets 0 likes
@BigDataSc @posco @ccsevers in practice these things are 10s of KB per counter.
1:55 PM - 13 Nov 2015
0 replies
0 retweets
0 likes
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.