Skip to content
By using Twitter’s services you agree to our Cookies Use. We and our partners operate globally and use cookies, including for analytics, personalisation, and ads.
  • Home Home Home, current page.
  • About

Saved searches

  • Remove
  • In this conversation
    Verified accountProtected Tweets @
Suggested users
  • Verified accountProtected Tweets @
  • Verified accountProtected Tweets @
  • Language: English
    • Bahasa Indonesia
    • Bahasa Melayu
    • Català
    • Čeština
    • Dansk
    • Deutsch
    • English UK
    • Español
    • Filipino
    • Français
    • Hrvatski
    • Italiano
    • Magyar
    • Nederlands
    • Norsk
    • Polski
    • Português
    • Română
    • Slovenčina
    • Suomi
    • Svenska
    • Tiếng Việt
    • Türkçe
    • Ελληνικά
    • Български език
    • Русский
    • Српски
    • Українська мова
    • עִבְרִית
    • العربية
    • فارسی
    • मराठी
    • हिन्दी
    • বাংলা
    • ગુજરાતી
    • தமிழ்
    • ಕನ್ನಡ
    • ภาษาไทย
    • 한국어
    • 日本語
    • 简体中文
    • 繁體中文
  • Have an account? Log in
    Have an account?
    · Forgot password?

    New to Twitter?
    Sign up
bgrant0607's profile
Brian Grant
Brian Grant
Brian Grant
@bgrant0607

Tweets

Brian Grant

@bgrant0607

Google Cloud. Kubernetes Steering Committee emeritus, K8s SIG Architecture co-Chair emeritus, CNCF TOC member emeritus

Google
github.com/bgrant0607
Joined June 2014

Tweets

  • © 2020 Twitter
  • About
  • Help Center
  • Terms
  • Privacy policy
  • Imprint
  • Cookies
  • Ads info
Dismiss
Previous
Next

Go to a person's profile

Saved searches

  • Remove
  • In this conversation
    Verified accountProtected Tweets @
Suggested users
  • Verified accountProtected Tweets @
  • Verified accountProtected Tweets @

Promote this Tweet

Block

  • Tweet with a location

    You can add location information to your Tweets, such as your city or precise location, from the web and via third-party applications. You always have the option to delete your Tweet location history. Learn more

    Your lists

    Create a new list


    Under 100 characters, optional

    Privacy

    Copy link to Tweet

    Embed this Tweet

    Embed this Video

    Add this Tweet to your website by copying the code below. Learn more

    Add this video to your website by copying the code below. Learn more

    Hmm, there was a problem reaching the server.

    By embedding Twitter content in your website or app, you are agreeing to the Twitter Developer Agreement and Developer Policy.

    Preview

    Why you're seeing this ad

    Log in to Twitter

    · Forgot password?
    Don't have an account? Sign up »

    Sign up for Twitter

    Not on Twitter? Sign up, tune into the things you care about, and get updates as they happen.

    Sign up
    Have an account? Log in »

    Two-way (sending and receiving) short codes:

    Country Code For customers of
    United States 40404 (any)
    Canada 21212 (any)
    United Kingdom 86444 Vodafone, Orange, 3, O2
    Brazil 40404 Nextel, TIM
    Haiti 40404 Digicel, Voila
    Ireland 51210 Vodafone, O2
    India 53000 Bharti Airtel, Videocon, Reliance
    Indonesia 89887 AXIS, 3, Telkomsel, Indosat, XL Axiata
    Italy 4880804 Wind
    3424486444 Vodafone
    » See SMS short codes for other countries

    Confirmation

     

    Welcome home!

    This timeline is where you’ll spend most of your time, getting instant updates about what matters to you.

    Tweets not working for you?

    Hover over the profile pic and click the Following button to unfollow any account.

    Say a lot with a little

    When you see a Tweet you love, tap the heart — it lets the person who wrote it know you shared the love.

    Spread the word

    The fastest way to share someone else’s Tweet with your followers is with a Retweet. Tap the icon to send it instantly.

    Join the conversation

    Add your thoughts about any Tweet with a Reply. Find a topic you’re passionate about, and jump right in.

    Learn the latest

    Get instant insight into what people are talking about now.

    Get more of what you love

    Follow more accounts to get instant updates about topics you care about.

    Find what's happening

    See the latest conversations about any topic instantly.

    Never miss a Moment

    Catch up instantly on the best stories happening as they unfold.

    1. Brian Grant‏ @bgrant0607 22 Jul 2019
      • Report Tweet
      • Report NetzDG Violation

      Kubernetes Borg/Omega history topic 14: Computational Quality of Service (QoS) and oversubscription. What are they, why would you want them, and how is QoS different than priority? On the last point, it's distinguishing importance and urgency.

      2 replies 21 retweets 70 likes
      Show this thread
    2. Brian Grant‏ @bgrant0607 22 Jul 2019
      • Report Tweet
      • Report NetzDG Violation

      QoS is something that wouldn't matter if just one process were running per host system or if all the processes steadily used a constant amount of cpu, memory, and other resources. Because they vary, reserving max capacity needed for each would leave systems poorly utilized

      1 reply 1 retweet 1 like
      Show this thread
    3. Brian Grant‏ @bgrant0607 22 Jul 2019
      • Report Tweet
      • Report NetzDG Violation

      Oversubscription mitigates that by packing more applications onto a system than could all fit at their peak requirements. It's kind of like a bank: not everyone can withdraw at the same time. The question is then: what happens when apps demand more resources than they can get?

      1 reply 1 retweet 1 like
      Show this thread
    4. Brian Grant‏ @bgrant0607 22 Jul 2019
      • Report Tweet
      • Report NetzDG Violation

      With time-division multiplexing, many CPU threads can be interleaved. They can be blocked and queued by the OS, typically at the cost of context switches and waiting a few time slices. Thus there is no fixed limit to how many can be packed onto a machine. CPU is compressible

      1 reply 1 retweet 1 like
      Show this thread
      Brian Grant‏ @bgrant0607 22 Jul 2019
      • Report Tweet
      • Report NetzDG Violation

      OTOH, swapping memory pages, even to local SSD, is painfully expensive. This is why systems hosting services that need to respond with subsecond latency disable swap. Memory is considered an incompressible resource. For simplicity, I'll ignore resources other than CPU and memory

      9:30 AM - 22 Jul 2019
      • 1 Retweet
      • 3 Likes
      • Niels van Brecht Timothy Chen Blaize D'souza
      1 reply 1 retweet 3 likes
        1. New conversation
        2. Brian Grant‏ @bgrant0607 22 Jul 2019
          • Report Tweet
          • Report NetzDG Violation

          Compressible resources like CPU can be made available quickly by the kernel with low impact to the threads that were interrupted, provided it knows which threads urgently need the resources and which ones don't. We call this latency sensitive and latency tolerant respectively

          1 reply 1 retweet 3 likes
          Show this thread
        3. Brian Grant‏ @bgrant0607 22 Jul 2019
          • Report Tweet
          • Report NetzDG Violation

          Borg used an explicit attribute to indicate this, called appclass, which is described by the Borg paper: https://ai.google/research/pubs/pub43438 …. This was translated to scheduling latency in LMCTFY: https://github.com/google/lmctfy/blob/master/include/lmctfy.proto#L142 …. In Kubernetes, it's inferred from resource requests and limits

          1 reply 1 retweet 4 likes
          Show this thread
        4. Brian Grant‏ @bgrant0607 22 Jul 2019
          • Report Tweet
          • Report NetzDG Violation

          In order to reallocate incompressible resources quickly, threads need to be killed, which is obviously not low impact. (For memory, in Linux this is done by the OOM killer.) This was why Borg used priority (production priority vs not) to make memory oversubscription decisions

          1 reply 1 retweet 1 like
          Show this thread
        5. Brian Grant‏ @bgrant0607 22 Jul 2019
          • Report Tweet
          • Report NetzDG Violation

          Borg's resource reclamation approach is described by the paper: reservations based on observed usage were computed and oversubscribed resources (latency-tolerant cpu and non-production memory) were tallied against reservations whereas guaranteed ones used limits. Complicated.

          1 reply 1 retweet 1 like
          Show this thread
        6. Brian Grant‏ @bgrant0607 22 Jul 2019
          • Report Tweet
          • Report NetzDG Violation

          Vertical autoscaling (VA) added even more complexity. VA changed limits, but left its own padding to provide slack for reaction time and observation of demand. Ad hoc mechanisms were added to disable limit enforcement for each resource, creating a notion similar to request in K8s

          1 reply 1 retweet 2 likes
          Show this thread
        7. Brian Grant‏ @bgrant0607 22 Jul 2019
          • Report Tweet
          • Report NetzDG Violation

          In K8s, I wanted something simpler, to directly convey the desire for oversubscription and bursting flexibility. The discussion started way back in http://issues.k8s.io/147  and http://issues.k8s.io/168 . The model we settled on was determined by looking at limits and requests

          2 replies 1 retweet 3 likes
          Show this thread
        8. Brian Grant‏ @bgrant0607 22 Jul 2019
          • Report Tweet
          • Report NetzDG Violation

          Request==Limit implies guaranteed resources (not oversubscribed). Request<Limit implies burstable (oversubscribed). Zero request implies best effort. Borg scheduled best effort using reservation, but no throughput guarantees could be made in practice

          1 reply 1 retweet 3 likes
          Show this thread
        9. Brian Grant‏ @bgrant0607 22 Jul 2019
          • Report Tweet
          • Report NetzDG Violation

          This is described in the resource model design (https://github.com/kubernetes/community/blob/master/contributors/design-proposals/scheduling/resources.md …) and the QoS proposal (https://github.com/kubernetes/community/blob/master/contributors/design-proposals/node/resource-qos.md …), including the mapping to OOM scores. The mapping to cgroup cpu shares is described in the pod resource design (https://github.com/kubernetes/community/blob/master/contributors/design-proposals/node/pod-resource-management.md …).

          1 reply 1 retweet 1 like
          Show this thread
        10. Brian Grant‏ @bgrant0607 22 Jul 2019
          • Report Tweet
          • Report NetzDG Violation

          Some work on Vertical Pod Autoscaling for Kubernetes has started: https://github.com/kubernetes/community/blob/master/contributors/design-proposals/autoscaling/vertical-pod-autoscaler.md …. There have been proposals to implement oversubscription also (https://github.com/kubernetes/enhancements/issues/355 …). As for horizontal scaling, resource monitoring infrastructure is a prerequisite

          1 reply 2 retweets 3 likes
          Show this thread
        11. Brian Grant‏ @bgrant0607 22 Jul 2019
          • Report Tweet
          • Report NetzDG Violation

          If managing cluster-level sharing using ResourceQuota and LimitRange, oversubscription can be done at that level also. The original designs were described by https://github.com/kubernetes/community/blob/master/contributors/design-proposals/resource-management/admission_control_limit_range.md … and https://github.com/kubernetes/community/blob/master/contributors/design-proposals/resource-management/admission_control_resource_quota.md …, with improvements in https://github.com/kubernetes/community/blob/master/contributors/design-proposals/resource-management/resource-quota-scoping.md …

          1 reply 1 retweet 4 likes
          Show this thread
        12. Brian Grant‏ @bgrant0607 22 Jul 2019
          • Report Tweet
          • Report NetzDG Violation

          Ok, this topic doesn't fit into a Twitter form factor very well. Maybe some day I'll get around to writing this up more in long form. For now, that's about all I have time for, but questions are welcome

          3 replies 0 retweets 7 likes
          Show this thread
        13. End of conversation

      Loading seems to be taking a while.

      Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.

        Promoted Tweet

        false

        • © 2020 Twitter
        • About
        • Help Center
        • Terms
        • Privacy policy
        • Imprint
        • Cookies
        • Ads info