Conversation

1/ While the paper includes a DOI that points to a sample of the data, it’s important to note that we make billions of posts and comments available via a searchable API via pushshift.io. Just looking at the volume of data shows how much Reddit has grown over time!
Image
1
8
2/ Releasing this paper is the culmination of 5 years of effort. I started Pushshift as a side project, and while I hoped that people would find it useful, I never imagined the kind of impact it would have. Over 100 peer reviewed papers have relied on Pushshift for data!
Image
1
13
3/ Pushshift is a community of researchers, data journalists, and citizen scientists. Over the years, we’ve built up an active subreddit, and fostered a collaborative Slack workspace, even building tools that others can use in their own collaborative groups.
Image
Image
1
9