Do people have favorite datasets to train new analysts and data scientists on? Something that is messy enough to require some practice cleaning ideally. @_brohrer_ @vboykis @chrisalbon @djpardis @generativist @ryxcommar
-
-
It’s a cool general project though — add meta data to datasets about teachability.
-
Yeah, have to agree. It's hard to find datasets like this in the wild; my best suggestion would probably be to find some government data you're interested in. That tends to be hardest to get and in the worst shape (XML, CSVs, etc.) although this highly varies by agency
- 1 more reply
New conversation -
-
-
Yes, there isn't one single dataset for learning. I didn't interpet the question as asking for just one dataset, but targeting people's interests is a good thing to do when teaching, I think. E.g. I encouraged someone who emailed me about learning to find things that are fun.
-
I should note the first real data cleaning and data analysis task I ever did was compiling old paychecks and calculating wages that my employer stole from me at a dishwashing job. I wouldn't call this "fun" but earning money I was owed sure did compel me to do it.
End of conversation
New conversation -
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.