Has anyone written the regex for Unicode sentence segmentation before? Check it out: https://gist.github.com/BurntSushi/b440e1beb3d5077da769a2ba6759aa58 … --- Compiles down to a 141KB DFA.
-
Show this thread
Next is the word segmenter. I don't think I have the stomach to do line segmentation. >_<
5:48 AM - 8 Jan 2019
0 replies
0 retweets
3 likes
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.