If you need massive amounts of Reddit data quickly, the new beta API that is up allows getting comments and submissions 5 to 10 times faster than the current API.
Here's an example of getting 25,000 of the latest comments from the politics subreddit.
#bigdata #datascience
Conversation
Replying to
Here is the code that you can run to benchmark on your end (and the endpoint to use to get data). This endpoint accepts all Elasticsearch aggregations and search features.
reddit.com/r/pushshift/co
#api #datascience
1
2
16

