What scripts break our Latin-1 assumptions in Unicode? I usually check against (Arabic|Hebrew), (some Indic script), Korean, Han, emoji.
I assume one of those definitions referred to combining into "ü"
-
-
Indic gets you most of the stuff with combining chars. The stuff it doesn't do is covered by Korean.
Thanks. Twitter will use this to make your timeline better. UndoUndo
-
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.