#reddit May 2019 comments are now available. reddit.com/r/pushshift/co
Compressed: 14,693,921,454 bytes
Uncompressed: 168,064,521,819 bytes
Number of Comments: 142,463,421
#bigdata #datascience
Conversation
Replying to
No worries -- there will be more data waiting for you when you're done. ;)
Dan, Jason's work is invaluable for our digital stuff research community. I frankly do not know how he manages to scrape all those data from multiple flatforms but he somehow manages it.
2
1


