Skip to content
By using Twitter’s services you agree to our Cookies Use. We and our partners operate globally and use cookies, including for analytics, personalisation, and ads.
  • Home Home Home, current page.
  • About

Saved searches

  • Remove
  • In this conversation
    Verified accountProtected Tweets @
Suggested users
  • Verified accountProtected Tweets @
  • Verified accountProtected Tweets @
  • Language: English
    • Bahasa Indonesia
    • Bahasa Melayu
    • Català
    • Čeština
    • Dansk
    • Deutsch
    • English UK
    • Español
    • Filipino
    • Français
    • Hrvatski
    • Italiano
    • Magyar
    • Nederlands
    • Norsk
    • Polski
    • Português
    • Română
    • Slovenčina
    • Suomi
    • Svenska
    • Tiếng Việt
    • Türkçe
    • Ελληνικά
    • Български език
    • Русский
    • Српски
    • Українська мова
    • עִבְרִית
    • العربية
    • فارسی
    • मराठी
    • हिन्दी
    • বাংলা
    • ગુજરાતી
    • தமிழ்
    • ಕನ್ನಡ
    • ภาษาไทย
    • 한국어
    • 日本語
    • 简体中文
    • 繁體中文
  • Have an account? Log in
    Have an account?
    · Forgot password?

    New to Twitter?
    Sign up
wycats's profile
Yehuda Katz 🥨
Yehuda Katz 🥨
Yehuda Katz  🥨
Verified account
@wycats

Tweets

Yehuda Katz  🥨Verified account

@wycats

Tilde Co-Founder, OSS enthusiast and world traveler.

Portland, OR
yehudakatz.com
Joined August 2007

Tweets

  • © 2018 Twitter
  • About
  • Help Center
  • Terms
  • Privacy policy
  • Cookies
  • Ads info
Dismiss
Previous
Next

Go to a person's profile

Saved searches

  • Remove
  • In this conversation
    Verified accountProtected Tweets @
Suggested users
  • Verified accountProtected Tweets @
  • Verified accountProtected Tweets @

Promote this Tweet

Block

  • Tweet with a location

    You can add location information to your Tweets, such as your city or precise location, from the web and via third-party applications. You always have the option to delete your Tweet location history. Learn more

    Your lists

    Create a new list


    Under 100 characters, optional

    Privacy

    Copy link to Tweet

    Embed this Tweet

    Embed this Video

    Add this Tweet to your website by copying the code below. Learn more

    Add this video to your website by copying the code below. Learn more

    Hmm, there was a problem reaching the server.

    By embedding Twitter content in your website or app, you are agreeing to the Twitter Developer Agreement and Developer Policy.

    Preview

    Why you're seeing this ad

    Log in to Twitter

    · Forgot password?
    Don't have an account? Sign up »

    Sign up for Twitter

    Not on Twitter? Sign up, tune into the things you care about, and get updates as they happen.

    Sign up
    Have an account? Log in »

    Two-way (sending and receiving) short codes:

    Country Code For customers of
    United States 40404 (any)
    Canada 21212 (any)
    United Kingdom 86444 Vodafone, Orange, 3, O2
    Brazil 40404 Nextel, TIM
    Haiti 40404 Digicel, Voila
    Ireland 51210 Vodafone, O2
    India 53000 Bharti Airtel, Videocon, Reliance
    Indonesia 89887 AXIS, 3, Telkomsel, Indosat, XL Axiata
    Italy 4880804 Wind
    3424486444 Vodafone
    » See SMS short codes for other countries

    Confirmation

     

    Welcome home!

    This timeline is where you’ll spend most of your time, getting instant updates about what matters to you.

    Tweets not working for you?

    Hover over the profile pic and click the Following button to unfollow any account.

    Say a lot with a little

    When you see a Tweet you love, tap the heart — it lets the person who wrote it know you shared the love.

    Spread the word

    The fastest way to share someone else’s Tweet with your followers is with a Retweet. Tap the icon to send it instantly.

    Join the conversation

    Add your thoughts about any Tweet with a Reply. Find a topic you’re passionate about, and jump right in.

    Learn the latest

    Get instant insight into what people are talking about now.

    Get more of what you love

    Follow more accounts to get instant updates about topics you care about.

    Find what's happening

    See the latest conversations about any topic instantly.

    Never miss a Moment

    Catch up instantly on the best stories happening as they unfold.

    1. Yehuda Katz  🥨‏Verified account @wycats Apr 3

      Is there a way to use the Unicode tables in the Rust regex crate to match a single char without turning it into a String or &str? It seems like the internals must have a way to do this, but perhaps not exposed? Is there a trick?

      6 replies 0 retweets 4 likes
    2. Andrew Gallant‏ @burntsushi5 Apr 6
      Replying to @wycats

      (And you are right that regex crate knows what a Unicode codepoint is. It is, in fact, the fundamental atom of a match for Unicode regexes. This is not as good as using grapheme clusters, but is easier to implement!)

      1 reply 0 retweets 0 likes
    3. Yehuda Katz  🥨‏Verified account @wycats Apr 6
      Replying to @burntsushi5

      Is it ridiculous to consider exposing a "codepoint match" facility? Or did I just not understand something about what makes matching the `char` type to a codepoint difficult? (the ontology is complicated enough that I could be missing a mismatch somewhere)

      1 reply 0 retweets 0 likes
    4. Andrew Gallant‏ @burntsushi5 Apr 6
      Replying to @wycats

      I don't know if I would use the term 'ridiculous' necessarily, but I think I would need some compelling evidence to motivate it. There's some incongruities to consider (like regexes that never match a single codepoint), and whether it's really worth a new API item.

      1 reply 0 retweets 0 likes
    5. Andrew Gallant‏ @burntsushi5 Apr 6
      Replying to @burntsushi5 @wycats

      e.g., If there was an example that said, "match on a specific codepoint by doing `http://re.is _match(codepoint.encode_utf8(&mut [0; 4]))`" that might be enough. https://play.rust-lang.org/?gist=79cd9455d12d186af21ae685f6f909fb&version=stable …

      1 reply 0 retweets 0 likes
      Yehuda Katz  🥨‏Verified account @wycats Apr 7
      Replying to @burntsushi5

      I would never have thought to turn a char into a &str that way! I wonder if it means we should add char as_str(&self) -> &str

      2:44 AM - 7 Apr 2018
      2 replies 0 retweets 0 likes
        1. New conversation
        2. Andrew Gallant‏ @burntsushi5 Apr 7
          Replying to @wycats

          Hmm. Don't think that would work. You would need to return an array, but we don't have a type for "fixed size array whose contents are guaranteed to be UTF-8."

          1 reply 0 retweets 0 likes
        3. Yehuda Katz  🥨‏Verified account @wycats Apr 7
          Replying to @burntsushi5

          I mean once you have the &[u8] can't you read it into a &str? Where would the unsafety come from?

          1 reply 0 retweets 0 likes
        4. Sean Griffin‏ @sgrif Apr 7
          Replying to @wycats @burntsushi5

          I don't think you can get a &[u8] for anything other than ASCII, since the in-memory representation is different between `char` and UTF-8. Would have to be owned

          1 reply 0 retweets 0 likes
        5. Sean Griffin‏ @sgrif Apr 7
          Replying to @sgrif @wycats @burntsushi5

          https://play.rust-lang.org/?gist=285bdf7fbaed7a25eb7a378b13bab1c1&version=stable …

          1 reply 0 retweets 0 likes
        6. Yehuda Katz  🥨‏Verified account @wycats Apr 7
          Replying to @sgrif @burntsushi5

          You could read the data into a new fixed size UTF8 stack array (doing the conversion) and then read it into the &str.

          1 reply 0 retweets 0 likes
        7. Andrew Gallant‏ @burntsushi5 Apr 7
          Replying to @wycats @sgrif

          What is the lifetime of your &str? Where does it point to?

          2 replies 0 retweets 1 like
        8. Yehuda Katz  🥨‏Verified account @wycats Apr 7
          Replying to @burntsushi5 @sgrif

          Couldn't you stack allocate another four bytes (or 8 if needed) and unsafely write into it after validating?

          1 reply 0 retweets 0 likes
        9. Andrew Gallant‏ @burntsushi5 Apr 7
          Replying to @wycats @sgrif

          I would suggest trying to write the code. The nature of Twitter prevents me from understanding where you've gone wrong. :-)

          1 reply 0 retweets 0 likes
        10. 1 more reply
        1. Alexander “bad code no docs” Payne‏ @myrrlyn Apr 7
          Replying to @wycats @burntsushi5

          It would have to be &mut char -> &str and overwrite the four bytes of char storage with the UTF-8 encoding … which is only safe while UTF-8 is prevented from having a fifth or sixth byte. But then you'd have a char binding in an invalid state that can still be accessed…

          0 replies 0 retweets 0 likes
          Thanks. Twitter will use this to make your timeline better. Undo
          Undo

      Loading seems to be taking a while.

      Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.

        Promoted Tweet

        false

        • © 2018 Twitter
        • About
        • Help Center
        • Terms
        • Privacy policy
        • Cookies
        • Ads info