My data analysis relies heavily on 's excellent Elasticsearch. and the rest of his team have created an amazing product over the years and I wholeheartedly recommend their software. The #pushshift API has over 5 billion indexed objects powered by Elasticsearch.
Conversation
Replying to
If you are interested in learning more about how Pushshift uses Elasticsearch to allow searching billions of Reddit comments, feel free to ask questions! Much of the source code is up and made available on Github.
Thanks again to the Elasticsearch team!
3
