Skip to content
  • Home Home Home, current page.
  • About

Saved searches

  • Remove
  • In this conversation
    Verified accountProtected Tweets @
Suggested users
  • Verified accountProtected Tweets @
  • Verified accountProtected Tweets @
  • Language: English
    • Bahasa Indonesia
    • Bahasa Melayu
    • Català
    • Čeština
    • Dansk
    • Deutsch
    • English UK
    • Español
    • Filipino
    • Français
    • Hrvatski
    • Italiano
    • Magyar
    • Nederlands
    • Norsk
    • Polski
    • Português
    • Română
    • Slovenčina
    • Suomi
    • Svenska
    • Tiếng Việt
    • Türkçe
    • Ελληνικά
    • Български език
    • Русский
    • Српски
    • Українська мова
    • עִבְרִית
    • العربية
    • فارسی
    • मराठी
    • हिन्दी
    • বাংলা
    • ગુજરાતી
    • தமிழ்
    • ಕನ್ನಡ
    • ภาษาไทย
    • 한국어
    • 日本語
    • 简体中文
    • 繁體中文
  • Have an account? Log in
    Have an account?
    · Forgot password?

    New to Twitter?
    Sign up
gwern's profile
gwern
gwern
gwern
@gwern

Tweets

gwern

@gwern

Writer, independent researcher, Internet besserwisser. 𝘞𝘢𝘵𝘢𝘴𝘩𝘪 𝘬𝘪𝘯𝘪𝘯𝘢𝘳𝘪𝘮𝘢𝘴𝘶! Links: https://www.reddit.com/r/gwern/ 

Present day. Present time. (Ahahaha!)
gwern.net
Joined November 2008

Tweets

  • © 2019 Twitter
  • About
  • Help Center
  • Terms
  • Privacy policy
  • Cookies
  • Ads info
Dismiss
Previous
Next

Go to a person's profile

Saved searches

  • Remove
  • In this conversation
    Verified accountProtected Tweets @
Suggested users
  • Verified accountProtected Tweets @
  • Verified accountProtected Tweets @

Promote this Tweet

Block

  • Tweet with a location

    You can add location information to your Tweets, such as your city or precise location, from the web and via third-party applications. You always have the option to delete your Tweet location history. Learn more

    Your lists

    Create a new list


    Under 100 characters, optional

    Privacy

    Copy link to Tweet

    Embed this Tweet

    Embed this Video

    Add this Tweet to your website by copying the code below. Learn more

    Add this video to your website by copying the code below. Learn more

    Hmm, there was a problem reaching the server.

    By embedding Twitter content in your website or app, you are agreeing to the Twitter Developer Agreement and Developer Policy.

    Preview

    Why you're seeing this ad

    Log in to Twitter

    · Forgot password?
    Don't have an account? Sign up »

    Sign up for Twitter

    Not on Twitter? Sign up, tune into the things you care about, and get updates as they happen.

    Sign up
    Have an account? Log in »

    Two-way (sending and receiving) short codes:

    Country Code For customers of
    United States 40404 (any)
    Canada 21212 (any)
    United Kingdom 86444 Vodafone, Orange, 3, O2
    Brazil 40404 Nextel, TIM
    Haiti 40404 Digicel, Voila
    Ireland 51210 Vodafone, O2
    India 53000 Bharti Airtel, Videocon, Reliance
    Indonesia 89887 AXIS, 3, Telkomsel, Indosat, XL Axiata
    Italy 4880804 Wind
    3424486444 Vodafone
    » See SMS short codes for other countries

    Confirmation

     

    Welcome home!

    This timeline is where you’ll spend most of your time, getting instant updates about what matters to you.

    Tweets not working for you?

    Hover over the profile pic and click the Following button to unfollow any account.

    Say a lot with a little

    When you see a Tweet you love, tap the heart — it lets the person who wrote it know you shared the love.

    Spread the word

    The fastest way to share someone else’s Tweet with your followers is with a Retweet. Tap the icon to send it instantly.

    Join the conversation

    Add your thoughts about any Tweet with a Reply. Find a topic you’re passionate about, and jump right in.

    Learn the latest

    Get instant insight into what people are talking about now.

    Get more of what you love

    Follow more accounts to get instant updates about topics you care about.

    Find what's happening

    See the latest conversations about any topic instantly.

    Never miss a Moment

    Catch up instantly on the best stories happening as they unfold.

    gwern‏ @gwern 5 Jul 2018

    One interesting thing about deep learning is that even as ever better results surface, everything we know about NNs is probably wrong. A short list (in rough chronological order): - "you need to pretrain a NN" - "NNs require thousands of datapoints to train" (1/10)

    1:51 PM - 5 Jul 2018
    • 33 Retweets
    • 159 Likes
    • Kelin Christi Kara Paul Tilley Borja Merino Ben Gianni Milos Hasan Rob Cobb Kaiser Söze
    2 replies 33 retweets 159 likes
      1. New conversation
      2. gwern‏ @gwern 5 Jul 2018

        - "NNs must be trained by backpropagation" - "hybrid approaches like SVMs on top of NN features will always work better" - "backpropagation in any form is biologically implausible" - "CNNs are nothing like the human visual cortex & certainly don't predict its activations"

        2 replies 0 retweets 12 likes
        Show this thread
      3. gwern‏ @gwern 5 Jul 2018

        - "small NNs can't be trained directly, so NNs must need to be big" - "NNs only learn task-specific features, certainly no kind of hidden or latent 'dark knowledge'" - [style transfer arrives] "Who ordered that?" - "simple SGD is the worst update rule"

        1 reply 0 retweets 10 likes
        Show this thread
      4. gwern‏ @gwern 5 Jul 2018

        - "simple SGD is the worst update rule" - "simple self-supervision like next-frame prediction can't learn semantics" - "adversarial examples're easy to fix & won't transfer, well, won't blackbox transfer, well, won't transfer to realworld, well..." - [batchnorm arrives] "Oops."

        1 reply 0 retweets 9 likes
        Show this thread
      5. gwern‏ @gwern 5 Jul 2018

        - "big NNs overfit by memorizing data" - "you can't train 1000-layer NNs but that's OK, that wouldn't be useful anyway" - "big minibatches don't generalize" - "NNs aren't Bayesian at all"

        1 reply 0 retweets 9 likes
        Show this thread
      6. gwern‏ @gwern 5 Jul 2018

        - "convolutions are only good for images; only LSTM RNNs can do translation/seq2seq/generation/meta-learning" - "you need small learning rates, not superhigh ones, to get fast training" (superconvergence) - "memory/discrete choices aren't differentiable"

        1 reply 0 retweets 7 likes
        Show this thread
      7. gwern‏ @gwern 5 Jul 2018

        - [CycleGAN arrives] "Who ordered that?" - "you can't learn to generate raw audio, it's too low-level" - "you need bilingual corpuses to learn translation" - "you need shortcut connections, not new activations or initializations to train 1000-layer nets"

        2 replies 1 retweet 9 likes
        Show this thread
      8. gwern‏ @gwern 5 Jul 2018

        - "NNs can't do zero-shot or few-shot learning" - "NNs can't do planning, symbolic reasoning, or deductive logic" - "NNs can't do causal reasoning" - "pure self-play is unstable and won't work"

        2 replies 0 retweets 9 likes
        Show this thread
      9. gwern‏ @gwern 5 Jul 2018

        - "only convolutions and LSTM RNNs can do (translation/...), not feedfoward NNs with attention" - "learning deep environment models is unstable and won't work" - "OK maybe we do need pretraining, er, 'warmup', after all"

        1 reply 0 retweets 5 likes
        Show this thread
      10. gwern‏ @gwern 5 Jul 2018

        - "we need hierarchical RL to learn long-term strategies" (not bruteforce PPO) - "you can't reuse minibatches for faster training, it never works" (https://arxiv.org/abs/1806.07353 ) - ...

        2 replies 0 retweets 9 likes
        Show this thread
      11. gwern‏ @gwern 5 Jul 2018

        gwern Retweeted gwern

        See also https://twitter.com/gwern/status/945019813365272576 … ...

        gwern added,

        NN layer meme explaining advantage of Zero over AG Lee in being easily trainable to deep depths
        gwern @gwern
        .@Miles_Brundage I made you an attempt at explaining the importance of AlphaGo Zero in a simple way. pic.twitter.com/AGHclgwDjh
        1 reply 0 retweets 5 likes
        Show this thread
      12. gwern‏ @gwern 5 Jul 2018

        (As @karpathy says, 'neural networks want to work' - and they are very patient with us as we figure out every possible way to train them orders of magnitude worse, slower, and bigger than necessary...)

        0 replies 4 retweets 24 likes
        Show this thread
      13. End of conversation

    Loading seems to be taking a while.

    Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.

      Promoted Tweet

      false

      • © 2019 Twitter
      • About
      • Help Center
      • Terms
      • Privacy policy
      • Cookies
      • Ads info