A 'powerful' AGI is one with a sufficiently huge potential impact that we have to align it. Operationally, let's say an AGI is 'powerful' if it can invent and deploy biotechnology at least 10 years in advance of the human state of the art.
-
-
Prikaži ovu nit
-
An 'aligned' powerful AGI is one that can be pointed in any direction at all, even what seems like a simple task that isn't morally fascinating. E.g. "Place, onto this particular plate here, two strawberries identical down to the cellular but not molecular level."
Prikaži ovu nit -
A 'safely' aligned powerful AI is one that doesn't kill everyone on Earth as a side effect of its operation; or as a somewhat more stringent requirement, one that has less than a 50% chance of killing more than a billion people.
Prikaži ovu nit -
Safely aligning a powerful AI will be said to be 'difficult' if that work takes two years longer or 50% more serial time, whichever is less, compared to the work of building a powerful AI without trying to safely align it.
Prikaži ovu nit
Kraj razgovora
Novi razgovor -
-
-
I don't think it's "difficult"--it's logically impossible. And undesirable. Isn't it? In what sense would it be a "powerful AGI" if it was constrained (i.e., doomed) by the goal we programmed into it?
-
Canonical reply at:https://arbital.com/p/orthogonality
- Još 6 drugih odgovora
Novi razgovor -
-
-
But isn't it obvious by now that Friendly AI is a less tractable problem of Friendly Humans? (Some) Humans are Friendly because of game theoretic checks we call society. Wouldn't a Friendly AGI be one living in a society of other AGIs, checked by quorum, incentivized by AI money?
-
So the simple solution to building a Friendly AI is... building a society of Friendly AIs. Easy!
- Još 2 druga odgovora
Novi razgovor -
-
-
We can't even safely align humans, and humans do have human values by birth right.
- Još 1 odgovor
Novi razgovor -
Čini se da učitavanje traje već neko vrijeme.
Twitter je možda preopterećen ili ima kratkotrajnih poteškoća u radu. Pokušajte ponovno ili potražite dodatne informacije u odjeljku Status Twittera.