Occasionally, when you run some analysis on data, you get back unexpected results. For instance, I wanted to get the lowest avg comment score by subreddit. What I got back were subreddits that were all spam. April 1-10, 2019.
#dataviz #datascience
Conversation
These subreddits all had at least 10,000 comments. The problem here is that most of these are comments from the same user. So to get out of spam territory, it might be necessary to require some other filter like a minimum author cardinality.
1
3
Replying to
Q: Are the same spam accounts posting spam to other reddit groups? (Is this a good way to identify/tag spammers who impact all of Reddit?)
1
Replying to
There seems to be two main categories of spammers. Those who work with a few accounts and create a subreddit and spam it for SEO purposes and then spammers that target a bunch of subreddits. This seems to pick up the former.

