Reddit March 2022 comments have been fixed and reuploaded. If you discover any more damaged uploads, please let me know!
Conversation
Replying to
Hey Jason, I think you uploaded the wrong file: RC_2022-03 is still the old corrupted version but now there is a new RC_2020-03.
4
I had to decode the latest files as iso-8859-1 as opposed to the usual utf-8. Is that expected?
2
1
Hey Rob -- are you still able to use / import the data?
Yes, though I only read certain basic fields and did not verify integrity of title/body which are the ones that would contain encoded characters. This was my experience:
1
Show replies


