I'm experimenting with automatic lineage tracking in R: https://github.com/jwills/lineage Thinking in monads actually helps a lot. #shock #rstats
-
-
-
Replying to @samklr
@samklr@posco told me that he thinks@summingbird is what Driskill wanted to be; I've been meaning to prove that out.3 replies 1 retweet 0 likes -
Replying to @josh_wills
@josh_wills@posco Humm Imho summingbird solves half of the problem, I find it hard to adapt so spark rdd for ex.1 reply 0 retweets 0 likes -
Replying to @samklr
@samklr@josh_wills We are making a spark backend in Jan or Feb. Stay tuned (and/or help out!)2 replies 0 retweets 2 likes -
Replying to @josh_wills
@josh_wills Batch to start, streaming PRs welcome. Our internal storm is well supported, so we'll stay on that.1 reply 0 retweets 1 like -
Replying to @posco
@posco@josh_wills would be so useful for someone to write up a good comparison of Summingbird, Samza, Kinesis, Spark Streaming...3 replies 3 retweets 4 likes -
Replying to @avibryant
@avibryant@josh_wills in particular, as far as I can tell, Samza is a pure analog of Hadoop madreduce for queues. /cc@jaykreps1 reply 0 retweets 0 likes -
Replying to @posco
@posco@josh_wills@jaykreps and Kinesis is basically AWS Samza. But Spark Streaming is also a planner, no?3 replies 0 retweets 0 likes
@posco @josh_wills @jaykreps also, check out how Samza does state; there's some interesting stuff there that goes beyond "streaming MR".
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.