TIL the wacky S390 UTF-8 instructions are actually broken and wrongly convert pseudo-UTF-8 for surrogates. https://sourceware.org/ml/libc-alpha/2016-02/msg00707.html …
Because UTFs are bijections between the set of valid code unit sequences for the UTF and the set of Unicode Scalar Values.
-
-
The values 0xD800-0xDFFF are not Unicode Scalar Values (the reason being UTF-16 has no way to map them to code unit sequences).
Thanks. Twitter will use this to make your timeline better. UndoUndo
-
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.