Tweetovi
- Tweetovi, trenutna stranica.
- Tweetovi i odgovori
- Medijski sadržaj
Blokirali ste korisnika/cu @ryancareyai
Jeste li sigurni da želite vidjeti te tweetove? Time nećete deblokirati korisnika/cu @ryancareyai
-
The proposal: get AI companies to promise (ahead of time) that if AI succeeds beyond their wildest dreams, they share the benefits. Then, AI companies have a greater incentive to cooperate if vast success becomes likely. Details in the report!https://twitter.com/Cullen_OK/status/1225174676986425344 …
Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Ryan Carey proslijedio/la je Tweet
Nice example of how SCM can serve as a laboratory to test various interpretations of familiar and colloquially used terms, in this case "incentives".
#Bookofwhyhttps://twitter.com/tom4everitt/status/1220657129264177152 …Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Ryan Carey proslijedio/la je Tweet
Thanks to structural causal models, we now a more precise understanding of incentives in causal influence diagrams blog post: https://medium.com/@RyanCarey/d6d8bb77d2e4 … arXiv: https://arxiv.org/abs/2001.07118
Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
What if technological risks were so acute that we had to mass-surveil or suffer an inevitable catastrophe? A provocative new TED conversation (and paper) from the boss (Bostrom):https://www.youtube.com/watch?v=JrjjOGI6YB4&t …
Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi
-
Hvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi
-
- Further discussion is here: https://medium.com/@RyanCarey/new-paper-when-is-truth-telling-favored-in-ai-debate-8f58f14562e5 … - Our paper: https://arxiv.org/abs/1911.04266 - Irving's original: https://arxiv.org/abs/1805.00899 6/6
Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
"the tail is here", etc. Then the opposing debater would endorse and explain these features, or challenge them by zooming into smaller ones. By zooming up and down the ladder of abstraction, we argue much more efficiently. But analyzing this is a problem for further work. 5/6
Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
e.g. if you know arguments get steadily less informative as the debate goes on, you can get a better guarantee on debate outcomes. What we didn't yet look into is abstraction. If you're arguing about a picture of a dog, you'll usually make claims about mid-sized aspects: 4/6
Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
We analyzed what happens in such debates when there are some finite number of facts, and only can be revealed at a time. We also showed that in the general case, the outcomes are fairly sensitive to the setup of the debate -- the number of turns available, etc. 3/6
Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
A big question for this approach is whether AI systems win more often by defending the truth, than defending some lie. When he visited FHI, Vojta Kovarik started to model these kinds of debates as extensive form games with an answering phase and an argumentation phase. 2/6
Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi -
One proposal for getting AI systems to answer difficult questions is to have them debate alternative answers, and for a human judge to reward the AI who defended their answer more persuasively (AI Safety via Debate by Irving et al.). 1/6
Prikaži ovu nitHvala. Twitter će to iskoristiti za poboljšanje vaše vremenske crte. PoništiPoništi
Čini se da učitavanje traje već neko vrijeme.
Twitter je možda preopterećen ili ima kratkotrajnih poteškoća u radu. Pokušajte ponovno ili potražite dodatne informacije u odjeljku Status Twittera.