Tweetovi
- Tweetovi, trenutna stranica.
- Tweetovi i odgovori
- Medijski sadržaj
Blokirali ste korisnika/cu @danijarh
Jeste li sigurni da želite vidjeti te tweetove? Time nećete deblokirati korisnika/cu @danijarh
-
Prikvačeni tweet
We introduce Dreamer, an RL agent that solves long-horizon tasks from images purely by latent imagination inside a world model. Dreamer improves over existing methods across 20 tasks. paper https://arxiv.org/pdf/1912.01603.pdf … code https://github.com/google-research/dreamer … Thread
pic.twitter.com/K5DnooVIUHPrikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
If something in the forward pass needs more precision (e.g. numerically unstable ops), cast to float32 before and back to the original dtype after.
Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Tried mixed precision yet? Took 10 min to set up and my model runs almost 2x faster with same results. Vars and grads are still 32 bits so it usually doesn't affect predictive performance. E.g. in TF2, set option and make all input to your layers float16 (data, RNN states, ..):pic.twitter.com/Dk7berTCGF
Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Danijar Hafner proslijedio/la je Tweet
Beautiful quantum physics animations from the basics like • Wave-Particle duality • How lasers work • Tunneling effect to research level stuff like • Bose-Einstein condensate • Pump-probe technique All freely available on http://QuantumMadeSimple.com pic.twitter.com/H3DK5pAZXF
Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Danijar Hafner proslijedio/la je Tweet
Training Neural SDEs: We worked out how to do scalable reverse-mode autodiff for stochastic differential equations. This lets us fit SDEs defined by neural nets with black-box adaptive higher-order solvers. https://arxiv.org/pdf/2001.01328.pdf … With
@lxuechen,@rtqichen and@wongtkleonard.pic.twitter.com/qlUwMxezjOPrikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Danijar Hafner proslijedio/la je Tweet
I'm excited to share that I have joined Imperial College London as a lecturer (asst prof)! I'm convinced it will be a great environment to continue working on GPs, Bayesian Deep Learning, and model-based RL. Do get in touch if you're interested joining to do a PhD!pic.twitter.com/Rrm0Bjv3Qq
Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
RL shifts the question of what intelligent behavior is to finding a reward function. I think we should focus more on what environment and reward function rather than on what RL algorithm to use. Is there theory for how properties of env and reward affect the resulting behavior?https://twitter.com/kaixhin/status/1213526645438648320 …
Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Danijar Hafner proslijedio/la je Tweet
Bayesian methods are *especially* compelling for deep neural networks. The key distinguishing property of a Bayesian approach is marginalization instead of optimization, not the prior, or Bayes rule. This difference will be greatest for underspecified models like DNNs. 1/18https://twitter.com/carlesgelada/status/1208618401729568768 …
Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Danijar Hafner proslijedio/la je Tweet
As asked by
@danijarh and others#ICLR2020 authors with >= 5 submissions sorted by acceptance rate Zhiyuan Li, Mingyuan Zhou, Deva Ramanan 4/5 80.0% Le Song 7/9 77.8% Jimmy Ba 6/8 75.0% Martin Jaggi 5/7 71.4% Abhinav Gupta 5/7 71.4% Pushmeet Kohli 6/9 66.7% Max Welling 5/8 62.5%pic.twitter.com/XyFYJPDuEV
Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Danijar Hafner proslijedio/la je Tweet
Trade talks, a prediction: UK - We don't like our deal EU - Why not? UK - We only get 95% of what we want EU - It only gives us 95% too UK - We want a new deal that gives us 100% of what we want EU - But that means we only get 90% of what we want 1/13
Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Danijar Hafner proslijedio/la je Tweet
A
#NeurIPS2019 highlight for me was@DavidDuvenaud's refreshingly honest talk about the Neural ODEs paper, part of the retrospectives workshop. Check it out https://youtu.be/YZ-_E7A3V2w !Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Danijar Hafner proslijedio/la je Tweet
i have been laughing at this since yesterday. please turn your volume up
pic.twitter.com/SlTduFF2ffPrikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Danijar Hafner proslijedio/la je Tweet
8:30am Sunday;
#NeurIPS2019 starts with a bang!@ylecun admits to the world that he’s a Bayesian! (meme-ready short version: https://youtu.be/sRHP_WQpr1k ). :-)pic.twitter.com/ReLoLJhefQHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Danijar Hafner proslijedio/la je Tweet
Dream to Control: Learning Behaviors by Latent Imagination The agent learns a latent world model via interactions, and backprops thru imagined latent trajectories of this model to learn useful behaviors.
@danijarh et al. pdf https://arxiv.org/abs/1912.01603 code https://danijar.com/project/dreamer/ …pic.twitter.com/67ePHW3Z2THvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Thanks to my advisors on the project:
@Mo_Norouzi, Tim Lillicrap, and Jimmy Ba. Let me know if you have any questions!
https://danijar.com/dreamer Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
We evaluate Dreamer across 20 challenging visual control tasks with image inputs, where it exceeds previous methods in terms of final performance, sample-efficiency, and wall-clock time. Dreamer is also applicable to discrete actions and episodes with early termination.pic.twitter.com/JsfxKPoza3
Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Naturally, the value function enables longsighted behavior and lets Dreamer be robust to the imagination horizon. This lets us solve new tasks that a policy without value function or online planning with PlaNet could not solve.pic.twitter.com/fee40JWADV
Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Dreamer learns a world model from experience. Inside the compact latent space of the model, it predicts actions and state values. The policy is optimized efficiently by propagating analytic value gradients back through imagined trajectories.pic.twitter.com/61JMdSV76d
Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Danijar Hafner proslijedio/la je Tweet
Robo-PlaNet: Learning to Poke in a Day - a robotics project to learn a simple task from pixels on a single robot using model-based RL and real data only - this is a collaboration I worked on with Guillaume Alain &
@shoddy_robots at@MILAMontreal http://arxiv.org/abs/1911.03594Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Danijar Hafner proslijedio/la je Tweet
A mental test of mine when I write a paper, is to see whether the paper is also suitable as a “blog post” intended for a general audience, without much modifications to the text. Most of my papers are written this way, and some of them have also been published at ML conferences.https://twitter.com/ericjang11/status/1193931669956218880 …
Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi
Čini se da učitavanje traje već neko vrijeme.
Twitter je možda preopterećen ili ima kratkotrajnih poteškoća u radu. Pokušajte ponovno ili potražite dodatne informacije u odjeljku Status Twittera.
I wish academic writing/publishing conventions could be dismantled.