Even with dockerized PySpark workflows, Dask has substantially lowered the friction for doing quick "I wonder" analyses. It's a wonderful tool. https://docs.dask.org/en/latest/
That's *sorta* how mine is. SSD for most things; one fat HDD that I use as intermediary caching for things I'm actually working on; 32TB NAS for everything else.
-
-
Fucked up thing is the NAS *would* be a lot faster than my local HDD if only I didn't have this terrible mesh wifi network that degrades during the rain since I'm in a detached garage...
Thanks. Twitter will use this to make your timeline better. UndoUndo
-
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.