Skip to content
  • Home Home Home, current page.
  • Moments Moments Moments, current page.

Saved searches

  • Remove
  • In this conversation
    Verified accountProtected Tweets @
Suggested users
  • Verified accountProtected Tweets @
  • Verified accountProtected Tweets @
  • Language: English
    • Bahasa Indonesia
    • Bahasa Melayu
    • Català
    • Čeština
    • Dansk
    • Deutsch
    • English UK
    • Español
    • Filipino
    • Français
    • Hrvatski
    • Italiano
    • Magyar
    • Nederlands
    • Norsk
    • Polski
    • Português
    • Română
    • Slovenčina
    • Suomi
    • Svenska
    • Tiếng Việt
    • Türkçe
    • Ελληνικά
    • Български език
    • Русский
    • Српски
    • Українська мова
    • עִבְרִית
    • العربية
    • فارسی
    • मराठी
    • हिन्दी
    • বাংলা
    • ગુજરાતી
    • தமிழ்
    • ಕನ್ನಡ
    • ภาษาไทย
    • 한국어
    • 日本語
    • 简体中文
    • 繁體中文
  • Have an account? Log in
    Have an account?
    · Forgot password?

    New to Twitter?
    Sign up
ESYudkowsky's profile
Eliezer Yudkowsky
Eliezer Yudkowsky
Eliezer Yudkowsky
Verified account
@ESYudkowsky

Tweets

Eliezer YudkowskyVerified account

@ESYudkowsky

Ours is the era of inadequate AI alignment theory. Any other facts about this era are relatively unimportant, but sometimes I tweet about them anyway.

Joined June 2014

Tweets

  • © 2018 Twitter
  • About
  • Help Center
  • Terms
  • Privacy policy
  • Cookies
  • Ads info
Dismiss
Previous
Next

Go to a person's profile

Saved searches

  • Remove
  • In this conversation
    Verified accountProtected Tweets @
Suggested users
  • Verified accountProtected Tweets @
  • Verified accountProtected Tweets @

Promote this Tweet

Block

  • Tweet with a location

    You can add location information to your Tweets, such as your city or precise location, from the web and via third-party applications. You always have the option to delete your Tweet location history. Learn more

    Your lists

    Create a new list


    Under 100 characters, optional

    Privacy

    Copy link to Tweet

    Embed this Tweet

    Embed this Video

    Add this Tweet to your website by copying the code below. Learn more

    Add this video to your website by copying the code below. Learn more

    Hmm, there was a problem reaching the server.

    By embedding Twitter content in your website or app, you are agreeing to the Twitter Developer Agreement and Developer Policy.

    Preview

    Why you're seeing this ad

    Log in to Twitter

    · Forgot password?
    Don't have an account? Sign up »

    Sign up for Twitter

    Not on Twitter? Sign up, tune into the things you care about, and get updates as they happen.

    Sign up
    Have an account? Log in »

    Two-way (sending and receiving) short codes:

    Country Code For customers of
    United States 40404 (any)
    Canada 21212 (any)
    United Kingdom 86444 Vodafone, Orange, 3, O2
    Brazil 40404 Nextel, TIM
    Haiti 40404 Digicel, Voila
    Ireland 51210 Vodafone, O2
    India 53000 Bharti Airtel, Videocon, Reliance
    Indonesia 89887 AXIS, 3, Telkomsel, Indosat, XL Axiata
    Italy 4880804 Wind
    3424486444 Vodafone
    » See SMS short codes for other countries

    Confirmation

     

    Welcome home!

    This timeline is where you’ll spend most of your time, getting instant updates about what matters to you.

    Tweets not working for you?

    Hover over the profile pic and click the Following button to unfollow any account.

    Say a lot with a little

    When you see a Tweet you love, tap the heart — it lets the person who wrote it know you shared the love.

    Spread the word

    The fastest way to share someone else’s Tweet with your followers is with a Retweet. Tap the icon to send it instantly.

    Join the conversation

    Add your thoughts about any Tweet with a Reply. Find a topic you’re passionate about, and jump right in.

    Learn the latest

    Get instant insight into what people are talking about now.

    Get more of what you love

    Follow more accounts to get instant updates about topics you care about.

    Find what's happening

    See the latest conversations about any topic instantly.

    Never miss a Moment

    Catch up instantly on the best stories happening as they unfold.

    1. gwern‏ @gwern 26 Nov 2017

      I know there are multi-scale or patch-specific GANs, but the losses there still seem to be averaged out. A D is 'looking' at all pixels already, so it doesn't necessarily seem like that much extra work to produce additional labels/losses?

      2 replies 0 retweets 1 like
      Show this thread
    2. gwern‏ @gwern 26 Nov 2017

      Like, when I look at GAN samples, usually there's some specific region which looks bad, typically much less than 1/4th of the total pixels, and the rest are fine. GANs aren't *really* RL so why provide them impoverished RL-style global losses?

      1 reply 0 retweets 3 likes
      Show this thread
    3. Eliezer Yudkowsky‏Verified account @ESYudkowsky 26 Nov 2017
      Replying to @gwern

      I must not understand this proposal. Why wouldn't you get the same probability at all pixels? What distinguishes one pixel from another in the loss function for D?

      1 reply 0 retweets 1 like
    4. gwern‏ @gwern 26 Nov 2017
      Replying to @ESYudkowsky

      The D usually emits a single scalar loss for the whole image, eg 0-3 for a WGAN. My suggestion is that it could instead emit for, say, a 64x64px image, a 64x64 array filled with 0-3 scalars, explaining how each pixel contributes to the global average loss of 2.2234.

      2 replies 0 retweets 0 likes
    5. Eliezer Yudkowsky‏Verified account @ESYudkowsky 26 Nov 2017
      Replying to @gwern

      Why isn't that already automatically implicit in doing backprop from D's output to D's input pixels / G's output? How would you train D to do that in any other way?

      1 reply 0 retweets 0 likes
    6. gwern‏ @gwern 26 Nov 2017
      Replying to @ESYudkowsky

      It is implicit but that doesn't mean equivalent. The signal could be a lot noisier. You can train AG with MCTS supervision, all of which feedback is 'implicit' in win/loss feedback, but the latter is still much slower/unstabler.

      1 reply 0 retweets 0 likes
    7. Eliezer Yudkowsky‏Verified account @ESYudkowsky 26 Nov 2017
      Replying to @gwern

      Did you have a particular proposal in mind? What came to my mind after 30sec was training G to complete arbitrary fractions of images including 100%, with both contiguous and discontiguous regions replaced, and then training D to judge pixel probabilities.

      1 reply 0 retweets 0 likes
    8. gwern‏ @gwern 26 Nov 2017
      Replying to @ESYudkowsky

      I hadn't thought about how exactly you'd train the D itself (rather than using it to train the G). Yes, inpainting regions of a real image would be one way... or noise/shuffle random pixels in just real images. Or assign all pixels in fake/real images arbitrary low/high constants

      1 reply 0 retweets 0 likes
    9. Eliezer Yudkowsky‏Verified account @ESYudkowsky 26 Nov 2017
      Replying to @gwern

      The last idea gets you constant predictions for all pixels, I expect. Training D this way seems like the obvious hard part of the problem, requiring cleverness and probably not working.

      1 reply 0 retweets 0 likes
    10. gwern‏ @gwern 26 Nov 2017
      Replying to @ESYudkowsky

      Possibly, yeah. On the other hand, REINFORCE and GANs don't seem like they should work either with crude average global feedback (and often don't). But it seems like getting richer supervision out of Ds is something obvious which someone should've tried but no one has...

      2 replies 0 retweets 0 likes
      Eliezer Yudkowsky‏Verified account @ESYudkowsky 26 Nov 2017
      Replying to @gwern

      I've seen "train D to output probabilities, but train G to match activations in pre-final layer(s) of D". Played myself with training D to estimate distance from image to reconstructed image in a VAEGAN, didn't help (when I clumsily tried) even though it's a more pixelwise idiom.

      9:02 PM - 26 Nov 2017
      0 replies 0 retweets 0 likes

      Loading seems to be taking a while.

      Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.

        Promoted Tweet

        false

        • © 2018 Twitter
        • About
        • Help Center
        • Terms
        • Privacy policy
        • Cookies
        • Ads info