Tweetovi
- Tweetovi, trenutna stranica.
- Tweetovi i odgovori
- Medijski sadržaj
Blokirali ste korisnika/cu @McaleerStephen
Jeste li sigurni da želite vidjeti te tweetove? Time nećete deblokirati korisnika/cu @McaleerStephen
-
Stephen McAleer proslijedio/la je Tweet
This semester I'm teaching a new PhD course "Economics, AI, and Optimization." I'll be covering how AI/Opt methods enable large-scale economic solution concepts. http://www.columbia.edu/~ck2945/courses/s20_8100/ … I'm planning to share lectures notes that I hope will be of broader interest.
Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Stephen McAleer proslijedio/la je TweetHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi
-
Stephen McAleer proslijedio/la je Tweet
Neural Replicator Dynamics: bringing replicator dynamics to a new level in reinforcement learning - this will unlock some more exciting work in the future! "Neural Replicator Dynamics: Multiagent Learning via Hedging Policy Gradients"
#AAMAS2020@sharky6000@DustinRMorrillPrikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Stephen McAleer proslijedio/la je Tweet
Meta Reinforcement Learning is good at adaptation to very similar environments. But can we meta-learn general RL algorithms? Our new approach MetaGenRL is able to. With
@vansteenkiste_s and@SchmidhuberAI Paper: https://arxiv.org/abs/1910.04098 Blog:http://louiskirsch.com/metagenrlHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Stephen McAleer proslijedio/la je Tweet
Excited to share that we are organizing the AAAI Workshop on Reinforcement Learning in Games (AAAI-RLG), to be held February 7th or 8th in New York City! Submissions due November 15th. http://aaai-rlg.mlanctot.info with Julien Perolat and Marc Lanctot
@sharky6000Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Stephen McAleer proslijedio/la je Tweet
RLlib callbacks are a gamechanger: storing custom env metrics directly in tensorboard is invaluable.
Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Stephen McAleer proslijedio/la je Tweet
Someone is obviously really close to solving AGI: http://adamoptimizer.com
Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Stephen McAleer proslijedio/la je Tweet
Our paper on the
#Pluribus poker AI is now in the print edition of@sciencemagazine, and we're on the front cover! https://science.sciencemag.org/content/365/6456/885 …pic.twitter.com/75BgMnScSO
Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Stephen McAleer proslijedio/la je Tweet
We're excited to release OpenSpiel: a framework for reinforcement learning in games. It contains over 25 games, and 20 algorithms, including tools for visualisation and evaluation. GitHub: http://github.com/deepmind/open_spiel … Paper: https://arxiv.org/abs/1908.09453 pic.twitter.com/9atJDrpHHw
Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
This is a great article about the problems with the attention economy: What is the Price of our Attention? by Quentin LE GARREChttps://medium.com/p/what-is-the-price-of-our-attention-607bafb7dcf3 …
Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
This is what happens when your objective is to maximize users screen time. YouTube needs to drastically change their recommendation system.https://twitter.com/Max_Fisher/status/1160950447156473856 …
Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Our paper "Solving the Rubik's Cube with Deep Reinforcement Learning and Search" has been published in Nature Machine Intelligence. You can check it out here: https://go.nature.com/2XNUF0M
Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Stephen McAleer proslijedio/la je Tweet
We've compiled a meta-reading list for our meta-learning tutorial: http://tinyurl.com/meta-reading Short list of the main papers we covered in our meta-learning tutorial: https://sites.google.com/view/icml19metalearning …https://twitter.com/chelseabfinn/status/1138334966612238336 …
Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Excited to be at
#iclr2019 to present our paper on solving the Rubik's cube with reinforcement learning.https://openreview.net/forum?id=Hyfn2jCcKm …Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Stephen McAleer proslijedio/la je Tweet
PEARL: Meta-RL that is 20-100x faster than prior methods, with better final performance, using soft actor-critic and order-invariant context embedding: https://arxiv.org/abs/1903.08254 w/ K. Rakelly, A. Zhou, D. Quillen,
@chelseabfinn (ours is the blue one)pic.twitter.com/ZmpAFrsZyQ
Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Stephen McAleer proslijedio/la je Tweet
Feedback loops in recommendation systems can give rise to “echo chambers” and “filter bubbles” which can narrow a user’s content exposure, and ultimately shift their world view.
Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Stephen McAleer proslijedio/la je Tweet
We introduce α-Rank, a principled method to evaluate multi-agent strategies, grounded in a new game-theoretic solution concept, Markov-Conley Chains, unique & tractable to compute. Joint work @karltuyls, S. Omidshafiei, C. Papadimitriou and G. Piliouras: https://arxiv.org/abs/1903.01373
Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Stephen McAleer proslijedio/la je Tweet
For folks looking for a thorough intro to the mathematical foundations of reinforcement learning: Video lectures for Bertsekas’ course on RL and control are now available here: http://web.mit.edu/dimitrib/www/RLbook.html …
Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
For example, the main goal of AI research seems to be to create human-level intelligence. Is that really something that we want? Although
#GPT2 seems relatively harmless, it's good to start thinking about what is worth researching in the first place.Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
I agree with many of these points. However, instead of doing research on potentially harmful technology and then not releasing it, the research community needs to have a discussion about what research is not worth doing in the first place.
#GPT2https://twitter.com/jachiam0/status/1097030712945831937 …Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi
Čini se da učitavanje traje već neko vrijeme.
Twitter je možda preopterećen ili ima kratkotrajnih poteškoća u radu. Pokušajte ponovno ili potražite dodatne informacije u odjeljku Status Twittera.