Skip to content
  • Home Home Home, current page.
  • Moments Moments Moments, current page.

Saved searches

  • Remove
  • In this conversation
    Verified accountProtected Tweets @
Suggested users
  • Verified accountProtected Tweets @
  • Verified accountProtected Tweets @
  • Language: English
    • Bahasa Indonesia
    • Bahasa Melayu
    • Català
    • Čeština
    • Dansk
    • Deutsch
    • English UK
    • Español
    • Filipino
    • Français
    • Hrvatski
    • Italiano
    • Magyar
    • Nederlands
    • Norsk
    • Polski
    • Português
    • Română
    • Slovenčina
    • Suomi
    • Svenska
    • Tiếng Việt
    • Türkçe
    • Ελληνικά
    • Български език
    • Русский
    • Српски
    • Українська мова
    • עִבְרִית
    • العربية
    • فارسی
    • मराठी
    • हिन्दी
    • বাংলা
    • ગુજરાતી
    • தமிழ்
    • ಕನ್ನಡ
    • ภาษาไทย
    • 한국어
    • 日本語
    • 简体中文
    • 繁體中文
  • Have an account? Log in
    Have an account?
    · Forgot password?

    New to Twitter?
    Sign up
RichFelker's profile
Rich Felker
Rich Felker
Rich Felker
@RichFelker

Tweets

Rich Felker

@RichFelker

Yeah, I do @musllibc, FOSS & infosec stuff. But now is not the time for a mostly-/only-tech Twitter feed.

musl-libc.org
Joined March 2014

Tweets

  • © 2018 Twitter
  • About
  • Help Center
  • Terms
  • Privacy policy
  • Cookies
  • Ads info
Dismiss
Previous
Next

Go to a person's profile

Saved searches

  • Remove
  • In this conversation
    Verified accountProtected Tweets @
Suggested users
  • Verified accountProtected Tweets @
  • Verified accountProtected Tweets @

Promote this Tweet

Block

  • Tweet with a location

    You can add location information to your Tweets, such as your city or precise location, from the web and via third-party applications. You always have the option to delete your Tweet location history. Learn more

    Your lists

    Create a new list


    Under 100 characters, optional

    Privacy

    Copy link to Tweet

    Embed this Tweet

    Embed this Video

    Add this Tweet to your website by copying the code below. Learn more

    Add this video to your website by copying the code below. Learn more

    Hmm, there was a problem reaching the server.

    By embedding Twitter content in your website or app, you are agreeing to the Twitter Developer Agreement and Developer Policy.

    Preview

    Why you're seeing this ad

    Log in to Twitter

    · Forgot password?
    Don't have an account? Sign up »

    Sign up for Twitter

    Not on Twitter? Sign up, tune into the things you care about, and get updates as they happen.

    Sign up
    Have an account? Log in »

    Two-way (sending and receiving) short codes:

    Country Code For customers of
    United States 40404 (any)
    Canada 21212 (any)
    United Kingdom 86444 Vodafone, Orange, 3, O2
    Brazil 40404 Nextel, TIM
    Haiti 40404 Digicel, Voila
    Ireland 51210 Vodafone, O2
    India 53000 Bharti Airtel, Videocon, Reliance
    Indonesia 89887 AXIS, 3, Telkomsel, Indosat, XL Axiata
    Italy 4880804 Wind
    3424486444 Vodafone
    » See SMS short codes for other countries

    Confirmation

     

    Welcome home!

    This timeline is where you’ll spend most of your time, getting instant updates about what matters to you.

    Tweets not working for you?

    Hover over the profile pic and click the Following button to unfollow any account.

    Say a lot with a little

    When you see a Tweet you love, tap the heart — it lets the person who wrote it know you shared the love.

    Spread the word

    The fastest way to share someone else’s Tweet with your followers is with a Retweet. Tap the icon to send it instantly.

    Join the conversation

    Add your thoughts about any Tweet with a Reply. Find a topic you’re passionate about, and jump right in.

    Learn the latest

    Get instant insight into what people are talking about now.

    Get more of what you love

    Follow more accounts to get instant updates about topics you care about.

    Find what's happening

    See the latest conversations about any topic instantly.

    Never miss a Moment

    Catch up instantly on the best stories happening as they unfold.

    1. Rich Felker‏ @RichFelker 12 Sep 2017

      Of course the desirable properties necessitate one property that's hard to like: not all byte sequences can be legal/valid.

      2 replies 0 retweets 2 likes
      Show this thread
    2. Rich Felker‏ @RichFelker 12 Sep 2017

      But understanding the design goals/motivations makes it pretty obvious that this was the only reasonable tradeoff.

      1 reply 0 retweets 2 likes
      Show this thread
    3. Rich Felker‏ @RichFelker 12 Sep 2017

      @rob_pike's UTF-8 history (https://www.cl.cam.ac.uk/~mgk25/ucs/utf-8-history.txt …) sheds a lot of light on the desirable props, but some didn't become obvious til later.

      1 reply 2 retweets 11 likes
      Show this thread
    4. Rich Felker‏ @RichFelker 12 Sep 2017

      Just about the only thing that arguably could have been done better in UTF-8 would have been offsetting the base value for multibyte chars..

      2 replies 0 retweets 2 likes
      Show this thread
    5. Rich Felker‏ @RichFelker 12 Sep 2017

      So that, for example, c0 80 would represent 0x80+[decoded_bits] = 0x80 rather than being an illegal sequence.

      1 reply 0 retweets 2 likes
      Show this thread
    6. Rich Felker‏ @RichFelker 12 Sep 2017

      This would have allowed shorter encoding of a few chars and eliminated the confusion about "overlong sequences" & their invalidity.

      2 replies 0 retweets 3 likes
      Show this thread
    7. Fake “Boonicode.”  ↙️‏ @FakeUnicode 12 Sep 2017
      Replying to @RichFelker

      Punycode does that offset thang. Another UTF-8 stupid is artificially limiting it to UTF-16 scope. UTF-8 coulda supported up to U+7FFFFFFF.

      1 reply 1 retweet 1 like
    8. Rich Felker‏ @RichFelker 12 Sep 2017
      Replying to @FakeUnicode

      Limiting range is a feature.

      2 replies 0 retweets 2 likes
    9. Peter Barfuss 𒀱‏ @bofh453 13 Sep 2017
      Replying to @RichFelker @FakeUnicode

      Why?

      1 reply 0 retweets 0 likes
    10. Rich Felker‏ @RichFelker 13 Sep 2017
      Replying to @bofh453 @FakeUnicode

      Because a 1M entry table is large but manageable. A 2G entry table is absolutely not.

      1 reply 0 retweets 1 like
      Rich Felker‏ @RichFelker 13 Sep 2017
      Replying to @RichFelker @bofh453 @FakeUnicode

      Thankfully there are ways you can reduce the awfulness with multilevel tables & sparseness of ranges with non-const properties. But still...

      12:29 AM - 13 Sep 2017
      2 replies 0 retweets 0 likes
        1. New conversation
        2. Fake “Boonicode.”  ↙️‏ @FakeUnicode 13 Sep 2017
          Replying to @RichFelker @bofh453

          why not keep going...pic.twitter.com/1OCblAL7Gz

          1 reply 0 retweets 0 likes
        3. Greg Parker‏ @gparker 13 Sep 2017
          Replying to @FakeUnicode @RichFelker @bofh453

          Converting between encodings is simpler and faster if there is a small upper bound on the byte size of each encoded code point.

          1 reply 0 retweets 1 like
        4. Rich Felker‏ @RichFelker 13 Sep 2017
          Replying to @gparker @FakeUnicode @bofh453

          I don't think it's simpler or faster, but a limit of 4 bytes does allow in-place (reusing input buffer) conversion from UTF-32 to UTF-8.

          3 replies 0 retweets 0 likes
        5. Steve Canon‏ @stephentyrone 13 Sep 2017
          Replying to @RichFelker @gparker and

          Definitely potentially faster if your vector width is large enough and you have a compress operation.

          1 reply 0 retweets 1 like
        6. Rich Felker‏ @RichFelker 13 Sep 2017
          Replying to @stephentyrone @gparker and

          I don't see any viable way to vectorize the conversion unless you can assume validity of input, and even then it sounds hard.

          1 reply 0 retweets 1 like
        7. Steve Canon‏ @stephentyrone 13 Sep 2017
          Replying to @RichFelker @gparker and

          Vectorizing validity testing is easier than vectorizing conversion. (Neither is actually especially easy, merely possible).

          0 replies 0 retweets 0 likes
        8. End of conversation
        1. New conversation
        2. Peter Barfuss 𒀱‏ @bofh453 13 Sep 2017
          Replying to @RichFelker @FakeUnicode

          so b/c assigned chars aee sparse, you can limit range for *properties* by mapping most to "Unknown" while still preserving conversion range.

          1 reply 0 retweets 0 likes
        3. Rich Felker‏ @RichFelker 13 Sep 2017
          Replying to @bofh453 @FakeUnicode

          Only as long as they remain unassigned. That's not very comforting.

          0 replies 0 retweets 1 like
        4. End of conversation

      Loading seems to be taking a while.

      Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.

        Promoted Tweet

        false

        • © 2018 Twitter
        • About
        • Help Center
        • Terms
        • Privacy policy
        • Cookies
        • Ads info