Every available #reddit author including their author_id, true comment karma, true submission karma, data of creation, name and other metadata. All authors from 2005-2014 (28M+). 2015+ coming soon. Format is RA_YYYY.xz. Example: files.pushshift.io/reddit/authors
#bigdata #datascience
Conversation
This Tweet was deleted by the Tweet author. Learn more
Replying to
As far as I know, this would be the first close estimate. This will even show users that aren't available via /user/username (for example, "unidan") -- it also shows true negative karma and is not truncated to -100 like it is on the site. Should be some good science possible
1
This also has "lurkers" (aka people who have never posted or commented). If you try to do a median of karma, you may want to exclude people with 0 karma because I'm pretty sure the median karma would be 0 from this dataset.

