Tweetovi
- Tweetovi, trenutna stranica.
- Tweetovi i odgovori
- Medijski sadržaj
Blokirali ste korisnika/cu @seungjaeryanlee
Jeste li sigurni da želite vidjeti te tweetove? Time nećete deblokirati korisnika/cu @seungjaeryanlee
-
My first paper got published! But I still need 1-2 ML papers published before I apply to graduate school
https://www.tandfonline.com/doi/full/10.1080/10586458.2019.1702123 …Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Seungjae Ryan Lee proslijedio/la je Tweet
How to Read a Paper, by Srinivasan Keshav. http://blizzard.cs.uwaterloo.ca/keshav/home/Papers/data/07/paper-reading.pdf …pic.twitter.com/hyZnqBLAjd
Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Great papers accepted to ICLR 2020! Here's a summary of the two I particularly enjoyed reading:https://www.endtoend.ai/rl-weekly/38
Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Seungjae Ryan Lee proslijedio/la je Tweet
What novel techniques in deep RL fundamentally advanced the state of the art this year, in ways that are very likely to be attributable to the novel techniques and not other features of the codebase / optimization tricks / "tuning" by hyperparameter or architecture fiddling?
Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
ICLR 2020 workshops are out:
@iclr_conf https://iclr.cc/Conferences/2020/Schedule …Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
A set of environments with procedurally generated content (level layout, game assets, entity spawn location and timing, etc.) to encourage agents robust to variations. By
@karlcobbe,@christophrhesse, Jacob Hilton, and John Schulmanhttps://twitter.com/OpenAI/status/1201909508009627649 …
0:29Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
It is possible capture credit assignment explicitly with "hindsight distribution" and use it to estimate value function and policy gradient. By
@aharutyu et al.https://twitter.com/aharutyu/status/1202594577842085888 …
Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Overparametrized networks can help with agent CNNs focusing on background rather than important objects. By Xingyou Song,
@yidingjiang, Yilun Du, and@bneyshabur https://arxiv.org/abs/1912.02975Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
A lot of exciting papers last/this week, probably due to NeurIPS! I summarized a few of them in my weekly newsletter:https://www.endtoend.ai/rl-weekly/37
Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Seungjae Ryan Lee proslijedio/la je Tweet
Also at our
#NeurIPS2019 booth,@sguada will demo@TensorFlow-Agents (http://goo.gle/367vB9w ).#TFAgents is a reliable, scalable and easy to use RL library. It targets distributed and large-scale RL, and can be easily integrated into your projects and deployed to production.Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Seungjae Ryan Lee proslijedio/la je Tweet
Postdoc positions in theoretical machine learning at Princeton CS Dept. Relevant faculty include Elad Hazan, Ryan Adams, Yoram Singer, and me. Mention in cover letter which faculty you are interested in. Best to apply by Dec 15; latest by Jan 10. https://mltheory.cs.princeton.edu/positions/
Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Seungjae Ryan Lee proslijedio/la je Tweet
Postdoc positions available at COS and EE Princeton, in the theoretical machine learning group, pls see application details below by
@prfsanjeevarora.https://twitter.com/prfsanjeevarora/status/1201902884163592194 …Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Got the stickers today! Thank you
@_inesmontani and@honnibal
https://twitter.com/spacy_io/status/1174708066585067525 …pic.twitter.com/F3LaHQI0NQ
Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
I just realized that everything in my TODO list is inspired by people I follow on Twitter
1. Swift for TensorFlow (inspired by @DynamicWebPaige) 2. Private AI (inspired by@iamtrask) 3. Unity ML-Agents (inspired by@awjuliani) 4. Robot Control (inspired by@svlevine)Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Given raw pixel observations, GAIL discriminators use irrelevant details to discern agent and expert. Let's add a constraint so that the discriminator can't discern agent and expert if there is no meaningful behavior! By
@konradzolna and@DeepMindAIhttps://twitter.com/konradzolna/status/1179847352087191552 …
Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Using tanh() function immediately after additive noise doesn't work well... let's normalize them first! This replicates the effect and performance of MaxEnt (SAC) on TD3. By C. Wang, Y. Wu, Q. Vuong, and K. Ross https://arxiv.org/abs/1910.02208
Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Train off-policy RL, define new macro actions from common action sequences and add them to action space. Repeat! Improves performance of DDPG and SAC on Atari envs: By P. Christodoulou,
@RobertTLange,@ali_shafti, and@analogaldo https://arxiv.org/abs/1910.02876Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Here are three papers I really liked published in arXiv last two weeks:https://www.endtoend.ai/rl-weekly/33
Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Thoroughly enjoying the RL day! There is a lot to learnhttps://twitter.com/MSFTResearch/status/1179791591046832128 …
Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
The Appendix of the new Attraction-Repulsion Actor Critic paper is a great example of using this checklist! https://arxiv.org/abs/1909.07543 (Paper by Thang Doan,
@bogdan_mazoure,@audurand Joelle Pineau (who authored the checklist), and@devon_hjelm)https://twitter.com/benhamner/status/1176941629015379969 …Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi
Čini se da učitavanje traje već neko vrijeme.
Twitter je možda preopterećen ili ima kratkotrajnih poteškoća u radu. Pokušajte ponovno ili potražite dodatne informacije u odjeljku Status Twittera.