Striking acceptable training times for GPU accelerated machine learning on very large datasets has long-since been a challenge, in part because there are