Hey ! I'm working on a paper and looking to grab a smallish random sample of Reddit posts (and comments), say around n=1000 posts. Any suggestions for how to solve this?
Conversation
Replying to
Random across all subreddits? Or a sampling? Does it have to be truly random?
Replying to
It doesn't have to be truly random - just roughly/justifiably random. Perhaps randomly sampled across a subset of popular subreddits? Specifically we are interested to analyse 'controversial' comments, but we don't want to oversample these.
1
Replying to
Do you want to restrict to a specific timeframe? We could look at the last month's worth of posts and see what the top X subreddits were based on number of posts, put all the post ids into an array and then get X random out of them. Would be easier if the time-frame was ...
3
Show replies

