Ha--I actually manage a Kaggle dataset! Less interested in detailed ML work, I think I'm at the point in that discipline that any extra learning will be ad hoc. I want a more systematic understanding of distributed data structures and pipeline management, instead.
I've done some scale work, but the only systems I've actually built myself were homebrewed, and I want a better grasp of how pipelines operate from the ground up in enterprise or aspirationally-enterprise settings.
-
-
@johnwittrock any recommendations for learning resources on this topic? -
Understanding how those big OSS tools work is also super helpful so then I’d also recommend http://aosabook.org/en/index.html
End of conversation
New conversation -
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.