If you are looking for Reddit data for all of 2020 and early 2021, look no further. I thought I had to reingest the previous months because of a server failure a while back but I actually had a backup server still ingesting the data. So I was cleaning up today and found over
Conversation
Replying to
one billion Reddit comments for all of 2020 and early 2021. The script was doing its job happily until the 4TB SSD drive ran out of space -- but the data is there!
I just need to compress 1.2+ billion comments and submissions and the data will be up within a week.
2
24
This will get the archives caught up to around March of 2021 -- the last couple months will be ingested shortly. But hey, pandemic data!
2
26
Replying to
Hello Jason! Was wondering what the status of deletion requests was...I haven't heard back from my PM.
Replying to
you are a lifesaver! It looks like the comment dumps for 2020 are caught up through mid-may -- will the rest of the data be dumped?
P.S. the donation links (stripe) on the site are broken right now but I would love to support





