One of the hardest things that I have worked on is creating a master data specification for Pushshift to be able to intelligently classify various different pieces of data in a way that makes searching and filtering easier. Once I complete this, I will share it and hopefully ...
Conversation
... get some valuable feedback from the academic community. I know that many of my choices won't be the best ones, but it will always be a work in progress that can constantly be improved and refined with the help of others.
1
1
1
A lot of this gets to the very heart of data science. There are so many different methods for classifying data and all of them have their strengths and weaknesses.
