fu() { op="$1"; shift; hdfs dfs "-$op" "$@"; }
@realWobu Well, I tried a MR version and gave up after 36 hours and 450GB of on-disk temporaries. Spark does it in 80GB and 6h (3h on SSD).
-
-
@marcan42 niceley done :) nevertheless the performance, Spark with scala or at least python, it's lot more beaty than plain old java...Thanks. Twitter will use this to make your timeline better. UndoUndo
-
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.