Holy shit, I hate address normalization / dedup / fuzzy matching / probablistic matching.
-
-
Replying to @generativist
are you aware of PQ grams? They're a principled way to probabilistically compare hierarchical data. (I wrote a Scala library.)
1 reply 0 retweets 0 likes -
Replying to @hythloday
Hrm. I am not familiar with them. Looks like I'm mocking something like it right now. Link to your lib for me for me to read?
1 reply 0 retweets 0 likes -
Replying to @generativist
https://github.com/hythloday/pqgram … here you go
1 reply 0 retweets 0 likes -
Replying to @hythloday
Any papers / tutorials I should read that you think are best?
1 reply 0 retweets 0 likes -
Replying to @generativist
just the one linked in the readme is enough to get going I'd say.
1 reply 0 retweets 1 like
Replying to @hythloday
Excellent. Thanks! From now on, I'm asking twitter prior to all decisions.
6:32 AM - 3 Aug 2016
from New York, NY
0 replies
0 retweets
2 likes
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.