As someone who actually kinda likes data munging, I agree. Cleaning is part of EDA. What I suspect most people don't like about data cleaning is repetition (which can be scripted away) and opaque errors in your tooling (which is not a problem with data cleaning in and of itself).https://twitter.com/Randy_Au/status/1304121716831064065 …
Pandas actually has a bunch of really nice utilities built into it, they're just impossible to find. IMO it suffers from fragmentation, but it is an open source project and incredible for what it is
-
-
But to your point, it doesn't change the fact that software engineering is not the area expertise of most DS. That's not what they're hired to do, not what they want to do, and not how they're best positioned to add value.
-
This Tweet is unavailable.
- Show replies
New conversation -
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.

