Skip to content
By using Twitter’s services you agree to our Cookies Use. We and our partners operate globally and use cookies, including for analytics, personalisation, and ads.
  • Home Home Home, current page.
  • About

Saved searches

  • Remove
  • In this conversation
    Verified accountProtected Tweets @
Suggested users
  • Verified accountProtected Tweets @
  • Verified accountProtected Tweets @
  • Language: English
    • Bahasa Indonesia
    • Bahasa Melayu
    • Català
    • Čeština
    • Dansk
    • Deutsch
    • English UK
    • Español
    • Filipino
    • Français
    • Hrvatski
    • Italiano
    • Magyar
    • Nederlands
    • Norsk
    • Polski
    • Português
    • Română
    • Slovenčina
    • Suomi
    • Svenska
    • Tiếng Việt
    • Türkçe
    • Ελληνικά
    • Български език
    • Русский
    • Српски
    • Українська мова
    • עִבְרִית
    • العربية
    • فارسی
    • मराठी
    • हिन्दी
    • বাংলা
    • ગુજરાતી
    • தமிழ்
    • ಕನ್ನಡ
    • ภาษาไทย
    • 한국어
    • 日本語
    • 简体中文
    • 繁體中文
  • Have an account? Log in
    Have an account?
    · Forgot password?

    New to Twitter?
    Sign up
imightbemary's profile
Katie Bauer!
Katie Bauer!
Katie Bauer!
@imightbemary

Tweets

Katie Bauer!

@imightbemary

Wrong but useful. Working @Twitter, tweeting about data science and people management. "Wait, it's all her views?" 🌎 👨‍🚀🔫👨‍🚀 "Always has been"

Joined September 2010

Tweets

  • © 2021 Twitter
  • About
  • Help Center
  • Terms
  • Privacy policy
  • Cookies
  • Ads info
Dismiss
Previous
Next

Go to a person's profile

Saved searches

  • Remove
  • In this conversation
    Verified accountProtected Tweets @
Suggested users
  • Verified accountProtected Tweets @
  • Verified accountProtected Tweets @

Promote this Tweet

Block

  • Tweet with a location

    You can add location information to your Tweets, such as your city or precise location, from the web and via third-party applications. You always have the option to delete your Tweet location history. Learn more

    Your lists

    Create a new list


    Under 100 characters, optional

    Privacy

    Copy link to Tweet

    Embed this Tweet

    Embed this Video

    Add this Tweet to your website by copying the code below. Learn more

    Add this video to your website by copying the code below. Learn more

    Hmm, there was a problem reaching the server.

    By embedding Twitter content in your website or app, you are agreeing to the Twitter Developer Agreement and Developer Policy.

    Preview

    Why you're seeing this ad

    Log in to Twitter

    · Forgot password?
    Don't have an account? Sign up »

    Sign up for Twitter

    Not on Twitter? Sign up, tune into the things you care about, and get updates as they happen.

    Sign up
    Have an account? Log in »

    Two-way (sending and receiving) short codes:

    Country Code For customers of
    United States 40404 (any)
    Canada 21212 (any)
    United Kingdom 86444 Vodafone, Orange, 3, O2
    Brazil 40404 Nextel, TIM
    Haiti 40404 Digicel, Voila
    Ireland 51210 Vodafone, O2
    India 53000 Bharti Airtel, Videocon, Reliance
    Indonesia 89887 AXIS, 3, Telkomsel, Indosat, XL Axiata
    Italy 4880804 Wind
    3424486444 Vodafone
    » See SMS short codes for other countries

    Confirmation

     

    Welcome home!

    This timeline is where you’ll spend most of your time, getting instant updates about what matters to you.

    Tweets not working for you?

    Hover over the profile pic and click the Following button to unfollow any account.

    Say a lot with a little

    When you see a Tweet you love, tap the heart — it lets the person who wrote it know you shared the love.

    Spread the word

    The fastest way to share someone else’s Tweet with your followers is with a Retweet. Tap the icon to send it instantly.

    Join the conversation

    Add your thoughts about any Tweet with a Reply. Find a topic you’re passionate about, and jump right in.

    Learn the latest

    Get instant insight into what people are talking about now.

    Get more of what you love

    Follow more accounts to get instant updates about topics you care about.

    Find what's happening

    See the latest conversations about any topic instantly.

    Never miss a Moment

    Catch up instantly on the best stories happening as they unfold.

    1. Katie Bauer!‏ @imightbemary 16 Aug 2020

      Applying engineering best practices to data science is a well-intentioned effort, but it has to be done with care. The raw materials, goals and organizational roles of the two professions are different, so treating DS like eng sets it up to look like engineering done badly

      6 replies 44 retweets 196 likes
      Show this thread
    2. Katie Bauer!‏ @imightbemary 16 Aug 2020

      This article by @Mike_Kaminsky, which is about why git for data probably won't banish the specter of dAtA qUaLiTy that plagues so many data orgs, is a good example of just that.https://locallyoptimistic.com/post/git-for-data-not-a-silver-bullet/ …

      1 reply 3 retweets 17 likes
      Show this thread
    3. Katie Bauer!‏ @imightbemary 16 Aug 2020

      TL;DR the data is PERFECT. YOU are the problem.

      1 reply 0 retweets 5 likes
      Show this thread
      Katie Bauer!‏ @imightbemary 16 Aug 2020

      Nah, I'm just trolling. But the article does describe an interesting aspect of DS work--the data is fixed and DSes create value by writing code to transform it for various purposes. Often, the problem is that DS stakeholders misunderstand of the role of DATA in the DS-value chain

      10:11 AM - 16 Aug 2020
      • 4 Likes
      • Brian Weinstein Robert M. McDonnell Sean Law 🇨🇦 Hesam Haddad
      1 reply 0 retweets 4 likes
        1. New conversation
        2. Katie Bauer!‏ @imightbemary 16 Aug 2020

          There are two main modes for creating this transformation code (as described at length in the must-read DataOps Manifesto), innovation and productionization. https://www.dataopsmanifesto.org/ pic.twitter.com/hqQo27zBMB

          1 reply 3 retweets 12 likes
          Show this thread
        3. Katie Bauer!‏ @imightbemary 16 Aug 2020

          Innovation code evaluates whether transformations of raw data are useful, and productionization code makes those transformations widely available.

          1 reply 0 retweets 4 likes
          Show this thread
        4. Katie Bauer!‏ @imightbemary 16 Aug 2020

          (And as a side note, since I'm a shameless @LocalOptimistic stan, this article by @AyRenay and Caitlin Moorman captures the same innovation/productionization split under the names 'circular' and 'linear'! https://locallyoptimistic.com/post/linear-and-circular-projects-part-1/ …)

          1 reply 0 retweets 10 likes
          Show this thread
        5. Katie Bauer!‏ @imightbemary 16 Aug 2020

          At any rate, both workflows are similar to software engineering since they're based around writing code, and when it comes to the literal process of writing code, DSes should copy SWE by checking their work into version control, writing unit tests, etc.

          1 reply 0 retweets 3 likes
          Show this thread
        6. Katie Bauer!‏ @imightbemary 16 Aug 2020

          The departure point is the code's input. Rather than working with well-understood production database, DS works with the wild world of log data. And as Heraclitus said, no one ever steps in the same data lake twice, for it's not the same data lake and they are not the same person

          1 reply 0 retweets 11 likes
          Show this thread
        7. Katie Bauer!‏ @imightbemary 16 Aug 2020

          The fact that the shape and volume of input data can change so quickly is what makes writing DS code hard. Statistical (and data) modeling is all about encoding assumptions, and the logs of a fast changing product can upend your assumptions in real time.

          1 reply 1 retweet 4 likes
          Show this thread
        8. Katie Bauer!‏ @imightbemary 16 Aug 2020

          Rather than focus on changing the raw data, DSes are better suited to making their data transformations more robust to variation in said raw data. This can be hard to wrap your head around if you think data should always perfectly reflect what happened in your product.

          1 reply 0 retweets 4 likes
          Show this thread
        9. Katie Bauer!‏ @imightbemary 16 Aug 2020

          But it's also why it's called data SCIENCE. It's about finding signals in noise. It uses similar tools to SWE, but it's a fundamentally different craft. Being crisp about this distinction saves you the grief of looking like an amateurish engineer

          2 replies 0 retweets 10 likes
          Show this thread
        10. End of conversation

      Loading seems to be taking a while.

      Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.

        Promoted Tweet

        false

        • © 2021 Twitter
        • About
        • Help Center
        • Terms
        • Privacy policy
        • Cookies
        • Ads info