Pushshift is processing 250 million tweets related to the Coronavirus and will be publishing a comprehensive list of hashtags associated with all tweets that mention the Coronavirus.
I expect the job to complete by tomorrow morning. The list should contain thousands of hashtags.
Conversation
Replying to
You're a saint! The volume of tweets had exceeded the data collection routine I threw together.
1
6
Replying to
It's a huge dataset. I'd like to publish the entire dataset but I have to check Twitter's TOS for research applications. I think at a bare minimum, I could publish the tweet ids along with some code to rehydrate the ids which should work.
Replying to
Yep! Tweet IDs and I'd be happy to help write and share code for data processing from those (at least for a lot of academics anyway)
1
8


