any unicode pros available who can help write a high performance utf16 code point counter function for utf8 strings? My best attempt so far is https://gist.github.com/WebFreak001/65976f5d8916efc4bc8f28f93bfb827c … but I'm sure this could be vectorized! Also need to figure out a way to do it the other way around afterwards...
-
-
thanks! Pretty cool idea to check for continuation bytes and go on a byte-by-byte basis instead of utf16 code point to utf16 code point worked out the rest with someone else in the other reply now too!
Thanks. Twitter will use this to make your timeline better. UndoUndo
-
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.