Spark vs MR benchmarking: http://www.vldb.org/pvldb/vol8/p2110-shi.pdf …. Main takeaway: the value of caching is not IO, but reuse of transformation/parsing steps.
-
-
@avibryant My point is: "currently". Spark's ancestry is in seq IO. In the presence of caching, bias should change. -
@stuhood@avibryant read@Frankmcsherry on the topic.
End of conversation
New conversation -
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.