Due to popular demand, I am now creating thorough documentation for the Pushshift search API:
github.com/pushshift/api
Conversation
Replying to
As an example, to find comments posted to Reddit related to Houston over the past 4 hours and find relevant posts: api.pushshift.io/reddit/search/
1
2
Replying to
Yep! It's intended to be a compliment to PRAW. PRAW is a great module, so hopefully this adds a lot of functionality to it.
1
Replying to
Do you plan to add an offset/page function, or should we rely using "before" with created_utc if we want more pages of results?
1
Replying to
Using the before / after parameters works best for the time being. Do you think adding pagination to the API would be helpful?
2
Replying to
BigQuery would probably be more sane and performant (just my guess - not sure though!)
2
Replying to
BigQuery does work better in some use cases, although I have worked hard to get the response times down for the API. For most simple
1
1
searches, the response time is usually under 100ms. I agree with you that the pagination is worth pursuing so I'll add it to the list of
features to be added to the API. Please feel free to recommend more using Github if you like. I do appreciate your feedback!
1
1
Show replies

