After testing, it appears that Pushshift can ingest, index and archive 10 million Telegram messages per day. With some additional tweaking, I should be able to push that up to 50-100 million.
I'm uncertain how many total messages there are but we definitely have the capacity ...
Conversation
To stream a good chunk of the total messages. I am still working on improving the code.
More updates to come soon!
#bigdata #datascience
1
8
Replying to
Very exciting! I know some people who were doing some work on telegram at a summer school I did in July. May see if i can put them in touch with you.
1
1

