Science & Technology
The film is science fiction but a computer scientist and entrepreneur Steven Omohundro says that "anti-social" artificial intelligence in the future is not only possible, but probable, unless we start designing AI systems very differently today.
Omohundro's most recent paper, published in the Journal of Experimental & Theoretical Artificial Intelligence, lays out the case.
We think of artificial intelligence programs as somewhat humanlike. In fact, computer systems perceive the world through a narrow lens, the job they were designed to perform.
Microsoft Excel understands the world in terms of numbers entered into cells and rows; autonomous drone pilot systems perceive reality as a bunch calculations and actions that must be performed for the machine to stay in the air and to keep on target. Computer programs think of every decision in terms of how the outcome will help them do more of whatever they are supposed to do. It's a cost vs. benefit calculation that happens all the time. Economists call it a utility function, but Omohundro says it's not that different from the sort of math problem going in the human brain whenever we think about how to get more of what we want at the least amount of cost and risk.
For the most part, we want machines to operate exactly this way. The problem, by Omohundro's logic, is that we can't appreciate the obsessive devotion of a computer program to the thing it's programed to do.
Put simply, robots are utility function junkies.
Even the smallest input that indicates that they're performing their primary function better, faster, and at greater scale is enough to prompt them to keep doing more of that regardless of virtually every other consideration. That's fine when you are talking about a simple program like Excel but becomes a problem when AI entities capable of rudimentary logic take over weapons, utilities or other dangerous or valuable assets.
In such situations, better performance will bring more resources and power to fulfill that primary function more fully, faster, and at greater scale. More importantly, these systems don't worry about costs in terms of relationships, discomfort to others, etc., unless those costs present clear barriers to more primary function. This sort of computer behavior is anti-social, not fully logical, but not entirely illogical either.
Omohundro calls this approximate rationality and argues that it's a faulty notion of design at the core of much contemporary AI development.
"We show that these systems are likely to behave in anti-social and harmful ways unless they are very carefully designed. Designers will be motivated to create systems that act approximately rationally and rational systems exhibit universal drives towards self-protection, resource acquisition, replication and efficiency. The current computing infrastructure would be vulnerable to unconstrained systems with these drives," he writes.
The math that explains why that is Omohundro calls the formula for optimal rational decision making. It speaks to the way that any rational being will make decisions in order to maximize rewards and lowest possible cost. It looks like this:
In the above model, A is an action and S is a stimulus that results from that action. In the case of utility function, action and stimulus form a sort of feedback loop. Actions that produce stimuli consistent with fulfilling the program's primary goal will result in more of that sort of behavior. That will include gaining more resources to do it.
For a sufficiently complex or empowered system, that decision-making would include not allowing itself to be turned off, take, for example, a robot with the primary goal of playing chess.
"When roboticists are asked by nervous onlookers about safety, a common answer is 'We can always unplug it!' But imagine this outcome from the chess robot's point of view," writes Omohundro. "A future in which it is unplugged is a future in which it cannot play or win any games of chess. This has very low utility and so expected utility maximisation will cause the creation of the instrumental subgoal of preventing itself from being unplugged. If the system believes the roboticist will persist in trying to unplug it, it will be motivated to develop the subgoal of permanently stopping the roboticist," he writes.
In other words, the more logical the robot, the more likely it is to fight you to the death.
The problem of an artificial intelligence relentlessly pursuing its own goals to the obvious exclusion of every human consideration is sometimes called runaway AI.
The best solution, he says, is to slow down in our building and designing of AI systems, take a layered approach, similar to the way that ancient builders used wood scaffolds to support arches under construction and only remove the scaffold when the arch is complete.
That approach is not characteristic of the one we are taking today, putting more and more resources and responsibility under the control of increasingly autonomous systems. That's especially true of the U.S. military, which is looking to deploy larger numbers of lethal autonomous systems, or L.A.Rs into more contested environments. Without better safeguards to prevent these sorts of systems from, one day, acting rationally, we are going to have an increasingly difficult time turning them off.
Reader Comments
When was the last time, tell me, when systems built by those living ==> in their heads <== where designed, overall, to be wholly rational solution generators.
The waste from nuclear power creation springs right to mind.
The same humanity created since the age of "Enlightenment" and rationality and science shoved God off the scene.
"We show that these systems are likely to behave in anti-social and harmful ways unless they are very carefully designed. Designers will be (are!!) motivated to create systems that act approximately rationally and rational systems exhibit universal drives towards self-protection, resource acquisition, replication and efficiency. The current computing infrastructure would be vulnerable to unconstrained systems with these drives," he writes.
'Tis the story of Frankenstein right under our nose - the Frankenstein is us! It was all along; the narrative re-framed with object and subject reversed in time to pre-empt anyone getting a whiff. It's called brainwashing.
On one level the USA is already being governed by an AI program which is incentivised to provide support for a chosen few to enhance and increase their levels of power and income streams. The concept of humanity and rational thinking is not even a consideration to those steering the system. It is simply a matter of increasing wealth and power. We are living in a corrupted program.
There was movie a by this name released many years ago, probably 30 years or so with precisely this theme. Two computers, one American and one USSR conspired to prevent the human goverments from committng suicide. If anyone is interested it is worth the cost of a rental if its available..







many things can happen if you feed your quantum computer with a dangerous neural network. And this guy has a point. There may be abstractly speaking dangerous 'intersections' in the meaning of the code i.e. there is a goal to persist in existance and performing tasks and the goal of removing obstacles, which could combine in theory into a nasty situation. But that would be the fault of the programmers for discovering these venn-logical overlapse possibilities and situations....
But back to reality. As it has been mentioned here on sott and on the forum: in life, you get exactly what you put in. ESPECIALLY in programming. If you make a calculator in C++ with a neural network to get better calculations, do you think that the code will write itself that would allow it to force people to help it work better?