Learning Without Thinking

As machine learning continues to develop, the intuition that thinking necessarily precedes learning — much less that humans alone can learn — should wane.

Noah Campeau for Noema Magazine
Credits

Jacob Browning is a postdoc in NYU’s Computer Science Department working on the philosophy of AI.

“Mindless learning.” The phrase looks incoherent — how could there be learning without a learner?

Learning — broadly defined as improving at solving a given task over time — seems to require some conscious agent reflecting on what is happening, drawing out connections and deciding what strategies will work better. Contrary to intuition, learning is possible in the absence of any thought or even the capacity to think — agentless, mindless learning. In fact, this kind of learning is central to contemporary artificial intelligence.

Tracing a history of the idea of mindless learning can replace our anthropocentric intuitions about learning and thinking with an awareness of the different ways — both natural and artificial — that problems can be solved. It can also reshape our sense of what is capable of learning, and the benefits attached to non-human kinds of learning.

As it is commonly understood, thinking is a matter of consciously trying to connect the dots between ideas. It’s only a short step for us to assume that thinking must precede learning, that we need to consciously think something through in order to solve a problem, understand a topic, acquire a new skill or design a new tool. This assumption — an assumption shared by early AI researchers — suggests that thinking is the mechanism that drives learning. Learning depends on reasoning, our capacity to detect the necessary connections — causal, logical and mathematical — between things.

Think of how someone learns to grasp a few geometric proofs about the length of lines and then the area of squares, moving and turning imaginary shapes in their head until they discern how the pieces relate. Identifying the essential features of lines and squares allows them to draw out necessary connections between other shapes and their interrelations — using old rules to generate novel inferences about circles, triangles and a host of irregular shapes.

“Learning is possible in the absence of any thought or even the capacity to think.”

Our capacity to reason so impressed Enlightenment philosophers that they took this as the distinctive character of thought — and one exclusive to humans. The Enlightenment approach often simply identified the human by its impressive reasoning capacities — a person understood as synonymous with their mind.

This led to the Enlightenment view that took the mind as the motor of history: Where other species toil blindly, humans decide their own destiny. Each human being strives to learn more than their parents and, over time, the overall species is perfected through the accumulation of knowledge. This picture of ourselves held that our minds made us substantively different and better than mere nature — that our thinking explains all learning, and thus our brilliant minds explain “progress.”

This picture, though flattering to our vanity as a species, was met with skepticism by less anthropocentric thinkers like David Hartley, David Hume and James Mill. These skeptics instead argued that human learning is better understood as similar to the stimulus-responsive learning seen in animals, which hinges on creating associations between arbitrary actions that become lifelong patterns of behavior.

Training an animal often involves getting it to associate a sign (like snapping fingers) with an action (like sitting) by rewarding them with a treat for success. The same associations work equally well among humans: A teacher can reward correct answers with treats or gold stars to promote more studying, and a manager can praise good work to encourage similar behavior in the future.

This approach — later popularized by the behaviorists — held that rewarded associations can account not just for trained behavior but for any aspect of animal behavior, even seemingly “thoughtful” behavior. My cat seems to understand how can openers work and that they contain cat food, but there isn’t necessarily any reasoning involved. Anything that sounds like the can opener, or even looks like a can, will result in the same behavior: running into the kitchen, meowing expectantly, scratching at the empty food bowl. And what works for animals works in humans as well. Humans can accomplish many tasks through repetition without understanding what they are doing, as when children learn to multiply by memorizing multiplication tables through tedious practice and recitation.

“Where other species toil blindly, humans decide their own destiny.”

Many philosophers and scientists have argued that associative learning need not be limited to explaining how an individual animal learns. They contend that arbitrary events in diverse and non-cooperative agents could still lead to problem-solving behavior — a spontaneous organization of things without any organizer.

The 18th-century economist Adam Smith, for example, formulated the concept of the “invisible hand of the market,” which revealed the capacity for millions of localized interactions between strangers to push a market toward a dynamic, efficient and thoroughly amoral deployment of resources over time. This perspective enables casting off the rationalist conviction that the mind is the acting force of history and all progress is the result of “geniuses” and “great men” (because it was always men in these male philosophers’ telling). Rather, “progress” — if it is appropriate to use the term — in politics, language, law and science results not from any grand plan but instead from countless, undirected interactions over time that adaptively shape groups towards some stable equilibrium amongst themselves and their environment.

As Charles Darwin saw, this kind of adaptation was not unique to humans or their societies. The development of species emerges from chance events over time snowballing into increasingly complex capacities and organs. This doesn’t involve progress or the appearance of “better” species, as some assumed at the time. Rather, it suggests that members of a species will eventually develop an adequate fit with other species in a given environment. As such, mindless learning is more natural and commonplace than the minded variety we value so highly.

This history helps us recognize contemporary artificial intelligence as deploying a familiar, well-used technique for solving problems: many trials, a mechanism for varying responses on each trial and some indication of whether a solution is a better or worse fit for the task at hand. The mathematical sophistication and sheer computing power behind contemporary AI systems can deceive us into thinking the process is a kind of technological wizardry only possible through human ingenuity.

But it’s possible for us to think about machine learning otherwise. The artist Philipp Schmitt, for example, built his physical neural network out of simple parts, wooden boards and very dumb sensors to highlight the underlying simplicity of machine learning that’s lost when it gets gussied up in complex diagrams and mathematical formalisms. The machine arrives at appropriate answers without consciousness, imagination or reasoning; these are all irrelevant for the kind of learning that Schmitt’s neural network undertakes.

As with evolution, there are far more trials ending in failure than in success. However, these failures are a feature, not a bug. Repeated failure is essential for mindless learning to test out many possible solutions — as in the 30 million trials leading up to AlphaGo’s famous move 37 in its game against the second-best Go player in the world, Lee Sedol. Going through so many trials meant AlphaGo played many moves no human had ever made, principally because these moves usually result in losing. Move 37 was just such a long shot, and it was so counterintuitive that it made Lee visibly uncomfortable.

“‘Progress’ in politics, language, law and science results not from any grand plan but instead from countless, undirected interactions over time.”

New mindless learning machines — like the headline-grabbing GPT-3, a tool that uses deep learning to create human-readable text  — help point toward the expansive possibilities of non-human intelligence. GPT-3 is trained by hiding a handful of words in millions upon millions of texts, forcing the machine to guess the hidden word at random, and then telling it how different its answers are from the original. In the process, it learns when to use a noun instead of a verb, what adjective works and when a preposition is needed, and so on.

Many critics rightly note that GPT-3 doesn’t “understand” the world it is talking about, but this shouldn’t be seen as a criticism. The algorithm adapts to the world of text itselfthe statistically relevant ways humans deploy symbols — not to the world as such. It does not occupy our niche, the niche of social beings who use language for diverse reasons, but its own: the regular interplay of signs.

It was a failure of imagination that led to the Turing Test, which assumed language couldn’t be dissociated from human-like intelligence. That same limited viewpoint rejected the idea that language might form its own ecosystem, into which a being might fit without human-like intelligence.

Undermining the anthropocentrism of earlier assumptions, GPT-3 displays a wildly diverse facility with language. It can write poetry in many styles, code in many programming languages, create immersive interactive adventures, design websites and prescribe medicines, conduct interviews in the guise of famous people, explain jokes and generate surprisingly plausible undergraduate philosophy papers. Mocking its failure to be human is to overlook how distinctly successful it is at being itself: an impossibly well-adapted machine for generating plausible scripts in almost any text-based scenario.

As machine learning continues to develop, the intuition that thinking necessarily precedes learning — much less that humans alone learn — should wane. We will eventually read surprising headlines, such as the recent finding that a machine is a champion poker player or that slime molds learn, without immediately asking: “Does it have a mental life? Is it conscious?” Mindless learning proves that many of our ideas about what a mind is can be broken into distinct mindless capacities. It pushes us to ask more insightful questions: If learning doesn’t need a mind, why are there minds? Why is any creature conscious? How did consciousness evolve in the first place? These questions help clarify both how minded learning works in humans, and why it would be parochial to treat this as the only possible kind of learning AI should aspire to.  

“Much human learning has itself been mindless.”

We should be clear that much human learning has itself been, and still is, mindless. The history of human tools and technologies — from the prehistoric hammer to the current search for effective medicines — reveals that conscious deliberation plays a much less prominent role than trial and error. And there are plenty of gradations between the mindless learning at work in bacteria and the minded learning seen in a college classroom. It would be needlessly reductive to claim, as some have, that human learning is the only “real learning” or “genuine cognition,” with all other kinds — like association, evolution and machine learning — as mere imitations.

Rather than singling out the human, we need to identify those traits essential for learning to solve problems without exhaustive trial and error. The task is figuring out how minded learning plays an essential role in minimizing failures. Simulations sidestep fatal trials; discovering necessary connections rules out pointless efforts; communicating discards erroneous solutions; and teaching passes on success. Identifying these features helps us come up with machines capable of similar skills, such as those with “internal models” able to simulate trials and grasp necessary connections, or systems capable of being taught by others.

But the second insight is broader. While there are good reasons to make machines that engage in human-like learning, artificial intelligence need not — and should not — be confined to simply imitating human intelligence. Evolution is fundamentally limited because it can only build on solutions it has already found, permitting limited changes in DNA from one individual to the next before a variant is unviable. The result is (somewhat) clear paths in evolutionary space from dinosaurs to birds, but no plausible path from dinosaurs to cephalopods. Too many design choices, and their corresponding trade-offs, are already built in.

If we imagine a map of all possible kinds of learning, the living beings that have popped up on Earth take up only a small territory, and came into being along connected (if erratic) lines. In this scenario, humans occupy only a tiny dot at the end of one of a multitude of strands. Our peculiar mental capacities could only arise in a line from the brains, physical bodies and sense-modalities of primates. Constraining machines to retrace our steps — or the steps of any other organism — would squander AI’s true potential: leaping to strange new regions and exploiting dimensions of intelligence unavailable to other beings. There are even efforts to pull human engineering out of the loop, allowing machines to evolve their own kinds of learning altogether.

The upshot is that mindless learning makes room for learning without a learner, for rational behavior without any “reasoner” directing things. This helps us better understand what is distinctive about the human mind, at the same time that it underscores why the human mind isn’t the key to understanding the natural universe, as the rationalists believed. The existence of learning without consciousness permits us to cast off the anthropomorphizing of problem-solving and, with it, our assumptions about intelligence.