Pushshift will be releasing a Python module to make working with the Twitter API easier for researchers and data engineers. For the first stage, I will release the module with methods to handle rehydration of tweet ids.
The module will incorporate some exciting features...
Conversation
...including API key management and parallel workers to increase the speed of fetching status ids from Twitter's statuses/lookup endpoint. I hope to have this code up and available via Github shortly.
This code will complement the data sets that Pushshift will be dumping over..
1
1
16
...the next several weeks.
As always, if you have any questions about the code or the data sets, feel free to ask!
