"ApproxJoin: approximate distributed joins" Le Quoc et al., SoCC'18 http://blog.acolyer.org/2018/11/09/approxjoin-approximate-distributed-joins … #themorningpaper
For aggregate queries involving joins, ApproxJoin can be up to 9x faster and shuffle 82x less data than native Spark joins.pic.twitter.com/ww2ipgBHjl
12:30 AM - 9 Nov 2018
0 replies
8 retweets
44 likes
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.