Does anyone have a Python method that extracts hashtags from text using the same rules that Twitter applies for their hashtag extraction? I've checked Stackoverflow but haven't found any updated code.
Conversation
Replying to
github.com/twitter/twitte is the reference with conformance tests, and github.com/edmondburnett/ seems to be the most up to date one - but i'd like to know too.
4
5
Replying to
It looks like it doesn't handle unicode very well. For instance, #汉字 is a valid hashtag but it doesn't extract that.

