. @FakeUnicode is Punycode (rfc394{0,2}) a UTF ? (see http://unicode.org/faq/utf_bom.html#gen2 … and https://en.wikipedia.org/wiki/Comparison_of_Unicode_encodings#Historical:_UTF-5_and_UTF-6 …)
-
-
Replying to @a804046a
@qolund That is an interesting question. Is it losslessly round-trippable for all non-surrogate codepoints? Hmm [cc@MaximeEuziere]1 reply 0 retweets 0 likes -
Replying to @FakeUnicode
@FakeUnicode@qolund can it encode all code points? Yep. Does that make it an UTF? I... guess...1 reply 0 retweets 0 likes -
Replying to @MaximeEuziere
@MaximeEuziere@qolund Do the Punycode specs disallow the surrogate codepoints?1 reply 0 retweets 0 likes -
Replying to @FakeUnicode
@FakeUnicode@qolund Hi, sorry I wasn't home. So AFAIK in theory nothing forbids lone surrogates to be encoded in punycode...1 reply 0 retweets 0 likes -
Replying to @MaximeEuziere
@FakeUnicode@qolund ... but concretely, I didn't find any online punicode / IDN tool allowing me to give one lone surrogate as input.2 replies 0 retweets 0 likes -
Replying to @MaximeEuziere
@MaximeEuziere@qolund Anything decode it? xn--ib9b1 reply 0 retweets 0 likes -
Replying to @FakeUnicode
@FakeUnicode@qolund strangely, my browser's addressbar transforms "www.\http://uD800.com " to "…http://www.xn--zn7c.com/ " tho.@mathias1 reply 0 retweets 0 likes -
Replying to @MaximeEuziere
@MaximeEuziere@FakeUnicode@qolund Try the DevTools Console on https://mothereff.in/punycode : > punycode.toASCII('\uD800') 'xn--ib9b'1 reply 0 retweets 1 like
@MaximeEuziere @FakeUnicode @qolund And it round-trips:
punycode.toUnicode('xn--ib9b') == '\uD800' // true
IIRC RFC doesn’t mention them
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.
JavaScript, HTML, CSS, HTTP, performance, security, Bash, Unicode, i18n, macOS.