Finally looked at the OKCupid paper, and their data collection methodology is a bad joke.pic.twitter.com/9BbWtxDU9p
You can add location information to your Tweets, such as your city or precise location, from the web and via third-party applications. You always have the option to delete your Tweet location history. Learn more
We kept usernames for 2 reasons: 1) forgot to scarpe some data, so using them others can get the rest later if they want to.
2) the usernames are an interesting topic in themselves -- what makes someone use e.g. "hot" in their profile name?
And sure, usernames can be interesting. I wouldn't have released them, especially as part of the same dataset.
The lack of consent from any parties is the bigger problem than pairing data inappropriately.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.