Tweetovi
- Tweetovi, trenutna stranica.
- Tweetovi i odgovori
Blokirali ste korisnika/cu @UMassScott
Jeste li sigurni da želite vidjeti te tweetove? Time nećete deblokirati korisnika/cu @UMassScott
-
I agree completely. The practice of making claims, particularly in regards to performance, without sufficient support needs to end.https://twitter.com/pyoudeyer/status/1222911050028285954 …
Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Scott Jordan proslijedio/la je Tweet
I have a favor to ask. For 20+ years I've been working on a dream: to make all science funded by US taxpayers freely available to all. We are on the verge of achieving this. But we need to show that people care. So please, if you can, sign this letter: http://oaintheusa.com
Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Scott Jordan proslijedio/la je Tweet
Delighted to share our Science article on making it easier ensure AI systems satisfy societal values. https://news.stanford.edu/2019/11/21/stanford-helps-train-ai-not-misbehave/?sf112920043=1 … Lead by former postdoc Phil Thomas, w/Castro da Silvam, Barto, Giguere, Brun.
Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Scott Jordan proslijedio/la je Tweet
In research published today in
@sciencemag,#UMassCICS researchers Philip Thomas,@YuriyBrun & Andy Barto + colleagues from@ufrgsnoticias,@Stanford introduce a new “Seldonian” framework for fairer, safer#ML algorithms: http://bit.ly/3384Csn#computing4commongood#isaacasimovpic.twitter.com/j9VoDrou3AHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Scott Jordan proslijedio/la je Tweet
Pretty good article discussing
@OpenAI's Rubik's Cube result: https://www.skynettoday.com/briefs/openai-rubiks-cube … This is what good criticism looks like, unlike most of the critical tweets mentioned in this article. Good work@jackyliang42 and@andrey_kurenkov.Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Scott Jordan proslijedio/la je Tweet
Philip Thomas is the Asst Professor and co-director of the Autonomous Learning Lab at
@UMassAmherst. He shared some of his upcoming work on a new framework for designing ML algorithms. View slides from his "Safe and Fair Reinforcement Learning" talk here: https://aka.ms/AA66ama pic.twitter.com/ZjcDo33JAz
Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Scott Jordan proslijedio/la je Tweet
Really clear and thoughtful review of an amazing decade of thinking by
@yael_nivhttps://www.nature.com/articles/s41593-019-0470-8 …Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Scott Jordan proslijedio/la je Tweet
We are pleased to announce the 3rd edition of the reproducibility challenge at #NeurIPS2019!
Researchers and students at all levels are encouraged to participate.
Registration Deadline: Nov. 1 2019
Report Submission Deadline: Dec. 1 2019
https://reproducibility-challenge.github.io/neurips2019/ Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Scott Jordan proslijedio/la je Tweet
Is the Policy Gradient a Gradient?. Chris Nota and Philip S. Thomas http://arxiv.org/abs/1906.07073
Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Scott Jordan proslijedio/la je TweetHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi
-
Scott Jordan proslijedio/la je Tweet
Reinforcement Learning When All Actions are Not Always Available. Yash Chandak, Georgios Theocharous, Blossom Metevier, and Philip S. Thomas http://arxiv.org/abs/1906.01772
Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Scott Jordan proslijedio/la je Tweet
Lifelong Learning with a Changing Action Set. Yash Chandak, Georgios Theocharous, Chris Nota, and Philip S. Thomas http://arxiv.org/abs/1906.01770
Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Scott Jordan proslijedio/la je Tweet
RL folks: are there any theoretical results for policy gradient methods that prove that lower variance via a baseline actually improves convergence rates (in certain settings)? All the arguments I've ever seen are super hand-wavy and basically just say "variance is bad".
Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Scott Jordan proslijedio/la je Tweet
if the high-point of your @emnlp2019 submission is "we sota'ed" then consider arxiving instead, because your paper probably isn't very interesting for curious reviewers who want to learn (a.k.a. scientists who provide blind peer reviews as a service)
#scienceisnotaleaderboardHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Scott Jordan proslijedio/la je Tweet
My first ICLR was a blast! Notes for
@iclr2019 available here: https://david-abel.github.io/notes/iclr_2019.pdf …#ICLR2019pic.twitter.com/BH0W80w8B4
Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Scott Jordan proslijedio/la je Tweet
Let's Play Again: Variability of Deep Reinforcement Learning Agents in Atari Environments. Kaleigh Clary, Emma Tosch, John Foley, and David Jensen http://arxiv.org/abs/1904.06312
Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Scott Jordan proslijedio/la je Tweet
Reinforcement Learning Without Backpropagation or a Clock. James Kostas, Chris Nota, and Philip S. Thomas http://arxiv.org/abs/1902.05650
Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Scott Jordan proslijedio/la je Tweet
A Meta-MDP Approach to Exploration for Lifelong Reinforcement Learning. Francisco M. Garcia and Philip S. Thomas http://arxiv.org/abs/1902.00843
Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Scott Jordan proslijedio/la je Tweet
Learning Action Representations for Reinforcement Learning. Yash Chandak, Georgios Theocharous, James Kostas, Scott Jordan, and Philip S. Thomas http://arxiv.org/abs/1902.00183
Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Scott Jordan proslijedio/la je Tweet
ToyBox: Better Atari Environments for Testing Reinforcement Learning Agents. John Foley, Emma Tosch, and Kaleigh Clary http://arxiv.org/abs/1812.02850
Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi
Čini se da učitavanje traje već neko vrijeme.
Twitter je možda preopterećen ili ima kratkotrajnih poteškoća u radu. Pokušajte ponovno ili potražite dodatne informacije u odjeljku Status Twittera.