Conversation

1/ While the paper includes a DOI that points to a sample of the data, it’s important to note that we make billions of posts and comments available via a searchable API via pushshift.io. Just looking at the volume of data shows how much Reddit has grown over time!
Image
1
8
2/ Releasing this paper is the culmination of 5 years of effort. I started Pushshift as a side project, and while I hoped that people would find it useful, I never imagined the kind of impact it would have. Over 100 peer reviewed papers have relied on Pushshift for data!
Image
1
13
3/ Pushshift is a community of researchers, data journalists, and citizen scientists. Over the years, we’ve built up an active subreddit, and fostered a collaborative Slack workspace, even building tools that others can use in their own collaborative groups.
Image
Image
1
9
6/ In the end, what started as a side project has grown to be much more. This paper demonstrates our commitment to the community, both in continuing to provide the services that have pushed forward the frontier of science, but also our ongoing efforts to improve Pushshift!
5