Skip to main content

Voice recognition will always be stupid

By David R. Wheeler, Special to CNN
August 20, 2013 -- Updated 1144 GMT (1944 HKT)
STORY HIGHLIGHTS
  • David Wheeler: We all know futility or trying to make automated systems understand us
  • We put up with it because we think the technology will soon be perfected -- it won't, he says
  • He says computers can't get human logic, ambiguity; some firms put people back on phones
  • Wheeler: Unless companies design systems based on human intelligence, they'll never work

Editor's note: David R. Wheeler lives in Lexington, Kentucky, where he is a freelance writer and a journalism professor at Asbury University. Follow him on Twitter @David_R_Wheeler

(CNN) -- "Please tell me your name," a robotic female voice says to the caller.

"Larry Valentine," the caller responds.

"You said, 'Barry Shmalenpine.' Is that right?"

So begins the exchange between actor Kevin James and the automated phone system in the 2007 film "I Now Pronounce You Chuck and Larry" -- a scene that correctly assumes moviegoers have had personal experience with the absurdity of non-human customer service.

Who hasn't? We've all waited through the recitation of menu items, none of which were related to our actual question. We've all hit zero repeatedly, hoping to be transferred to a real person. When that didn't work, maybe we even lost our temper, shouting "Representative!" over and over, whether the robot had given us that option or not.

Why hasn't there been a mass revolt against automated systems? The answer is simple: We believe that this nonsense is temporary. We believe that computers are on the cusp of being able to understand human language. And that belief, according to many linguists and cognitive scientists, is completely wrong.

David Wheeler
David Wheeler

First, there's the problem of voice recognition itself. Julie Sedivy, a professor of linguistics and psychology at the University of Calgary, told me that simply recognizing speech sounds and matching them up with specific words is much more complicated than most people realize.

"The way I say 'dog' will depend on my age, gender, geographic dialect, the particular anatomy of my vocal tract, and how quickly or formally I'm speaking," she said. "Humans are able to calibrate their perception after hearing just a couple of seconds of someone's speech, but really good speech recognition is still a problem for many programs."

Although this technology has steadily improved for the past 20 years, "speech recognition systems are still markedly inferior to human beings in understanding spoken language," said John Nerbonne, a linguist and information sciences professor based in The Netherlands. "Telephones are a particularly difficult medium because they limit the signal a good deal."

Furthermore, studies show that people almost universally hate automated phone systems, and that most customers are even willing to pay more to speak to an actual person.

"Ally Bank, Discover Card and TD Bank all have ads on television right now that brag about the fact that if you call the phone number, a real live person will answer," said Adam Goldkamp, spokesperson for GetHuman, an organization dedicated to improving customer service. Such is the sad state of the current customer service world. In other words, only now are companies starting to wake up to the lost revenue potential of frustrated customers who give up on the automated system and take their business elsewhere.

"When you see companies launching ads like these, it shows they understand that there are things they can do to increase their future revenue by giving customers what they want: an actual person to speak to when they have an issue," Goldkamp said.

Maybe one day in the future, automated systems will be able to identify our words with perfect accuracy. Even then, there is still an insurmountable problem: the ability to understand what we mean by those words.

2011 Siri: Apple's new voice recognition

"We're a long way from being able to communicate with computers in real language," said Suzanne Kemmer, director of Cognitive Sciences and associate professor of Linguistics at Rice University. "Human language has a powerful design feature that works great for normal person-to-person interactions, but is completely at odds with the way computers work."

Computers are based on formal logic and fixed categories, she explained. Human language is flexible and dynamic, and follows a cognitive logic that differs fundamentally from computers. In short, human words and grammatical structures don't have fixed meanings. Instead, they have a certain amount of vagueness and ambiguity built in, so that their meaning is highly affected by context.

Actually understanding meaning is a very different problem from voice recognition or from the auto-correct on your computer or phone, Kemmer said.

When I brought up this topic with Harvard professor Steven Pinker, one of the world's most influential linguists, he noted that major companies, by looking for statistical patterns in large datasets and applying them to user input, have largely dropped the ball when it comes to real artificial intelligence: "The stupidity of a lot of computer language understanding systems comes from the fact that they've turned their backs on genuine intelligence and satisfied themselves with statistics."

In other words, computers are still very bad at trying to guess what we mean when we say something.

They also don't get our social and emotional psychology. "Often the automated phone systems were developed with a tin ear to the way people interact with each other," Pinker told me. "They sound like people, but if you think of them as such, they are the most infuriating people in the world. When I hit '0' to get a human being, and a voice dripping with a combination of mock concern and mock confusion says, 'I'm sorry, but I did not understand your answer,' I am apt to go into a rage."

"If this were a real person," Pinker added, "she would be simultaneously stupid, mendacious, and condescending."

It's time to stop the madness. We are not on the cusp of inventing computers that understand human language. Silicon Valley can, and will, continue to strive for this goal.

In the meantime, let's stop kidding ourselves. Let's admit that computers, by themselves, are terrible at customer service. Let's admit that, at a time of economic uncertainty and job losses, we should be supporting companies that employ real people to answer our questions. Let's admit that, unless we demand change, we will be forced, forever, to deal with an automated system that thinks our name is Barry Shmalenpine.

Follow us on Twitter @CNNOpinion.

Join us on Facebook/CNNOpinion.

The opinions expressed in this commentary are solely those of David R. Wheeler.

ADVERTISEMENT
Part of complete coverage on
July 30, 2014 -- Updated 1941 GMT (0341 HKT)
Stuart Gitlow says pot is addictive and those who smoke it can experience long-term psychiatric disease.
July 30, 2014 -- Updated 1645 GMT (0045 HKT)
Gabby Giffords and Katie Ray-Jones say "Between 2001 and 2012, more women were shot to death by an intimate partner in our country than the total number of American troops killed in the Iraq and Afghanistan wars combined."
July 29, 2014 -- Updated 2357 GMT (0757 HKT)
Alan Elsner says Secretary Kerry's early cease-fire draft was leaked and presented as a final document, which served the interests of hard-liners on both sides who don't want the Gaza war to stop.
July 30, 2014 -- Updated 1158 GMT (1958 HKT)
Vijay Das says Medicare is a success story that could provide health care for everybody, not just seniors
July 30, 2014 -- Updated 1818 GMT (0218 HKT)
Rick Francona says Israel seems determined to render Hamas militarily ineffective.
July 30, 2014 -- Updated 1743 GMT (0143 HKT)
S.E. Cupp says the entrepreneur and Dallas Mavericks owner thinks for himself and refuses to be confined to an ideological box.
July 30, 2014 -- Updated 1311 GMT (2111 HKT)
A Christian group's anger over the trailer for "Black Jesus," an upcoming TV show, seems out of place, Jay Parini says
July 30, 2014 -- Updated 2028 GMT (0428 HKT)
LZ Granderson says the cyber-standing ovation given to Robyn Lawley, an Australian plus-size model who posted unretouched photos, shows how crazy Americans' notions of beauty have become
July 30, 2014 -- Updated 1939 GMT (0339 HKT)
Carol Dweck and Rachel Simmons: Girls tend to have a "fixed mindset" but they should have a "growth mindset."
July 28, 2014 -- Updated 1156 GMT (1956 HKT)
A crisis like the Gaza conflict or the surge of immigrants can be an opportunity for a lame duck president, writes Julian Zelizer
July 26, 2014 -- Updated 1822 GMT (0222 HKT)
Carol Costello says the league's light punishment sent the message that it didn't consider domestic violence a serious offense
July 28, 2014 -- Updated 1251 GMT (2051 HKT)
Danny Cevallos says saggy pants aren't the kind of fashion statement protected by the First Amendment.
July 28, 2014 -- Updated 1852 GMT (0252 HKT)
Margaret Hoover says some GOP legislators support a state's right to allow same-sex marriage and the right of churches, synagogues and mosques not to perform the sacrament
July 28, 2014 -- Updated 1831 GMT (0231 HKT)
Megan McCracken and Jennifer Moreno say it's unacceptable for states to experiment with new execution procedures without full disclosure
July 30, 2014 -- Updated 1744 GMT (0144 HKT)
Priya Satia says today's drones for bombardment and surveillance have their roots in the deadly history of Western aerial control of the Middle East that began in World War One
July 28, 2014 -- Updated 1635 GMT (0035 HKT)
Jeff Yang says it's great to see the comics make an effort at diversifying the halls of justice
July 26, 2014 -- Updated 1555 GMT (2355 HKT)
Rick Francona says the reported artillery firing from Russian territory is a sign Vladimir Putin has escalated the Ukraine battle
July 27, 2014 -- Updated 1822 GMT (0222 HKT)
Paul Callan says the fact that appeals delay the death penalty doesn't make it an unconstitutional punishment, as one judge ruled
July 24, 2014 -- Updated 2225 GMT (0625 HKT)
Pilot Robert Mark says it's been tough for the airline industry after the plane crashes in Ukraine and Taiwan.
July 25, 2014 -- Updated 1510 GMT (2310 HKT)
Jennifer DeVoe laments efforts to end subsidies that allow working Americans to finally afford health insurance.
July 26, 2014 -- Updated 1533 GMT (2333 HKT)
Ruti Teitel says assigning a costly and humiliating "collective guilt" to Germany after WWI would end up teaching the global community hard lessons about who to blame for war crimes
July 25, 2014 -- Updated 1245 GMT (2045 HKT)
John Sutter responds to criticism of his column on the ethics of eating dog.
July 25, 2014 -- Updated 1302 GMT (2102 HKT)
Frida Ghitis says it's tempting to ignore North Korea's antics as bluster but the cruel regime is dangerous.
July 25, 2014 -- Updated 1850 GMT (0250 HKT)
To the question "Is Putin evil?" Alexander Motyl says he is evil enough for condemnation by people of good will.
July 24, 2014 -- Updated 1803 GMT (0203 HKT)
Laurie Garrett: Poor governance, ignorance, hysteria worsen the Ebola epidemic in Sierra Leone, Guinea, Liberia.
July 24, 2014 -- Updated 1349 GMT (2149 HKT)
Patrick Cronin and Kelley Sayler say the world is seeing nonstate groups such as Ukraine's rebels wielding more power to do harm than ever before
July 23, 2014 -- Updated 2205 GMT (0605 HKT)
Ukraine ambassador Olexander Motsyk places blame for the MH17 tragedy squarely at the door of Russia
July 24, 2014 -- Updated 1142 GMT (1942 HKT)
Mark Kramer says Russia and its proxies have a history of shooting down civilian aircraft, often with few repercussions
July 24, 2014 -- Updated 1853 GMT (0253 HKT)
Les Abend says, with rockets flying over Tel Aviv and missiles shooting down MH17 over Ukraine, a commercial pilot's pre-flight checklist just got much more complicated
July 24, 2014 -- Updated 1317 GMT (2117 HKT)
Mark Kramer says Russia and its proxies have a history of shooting down civilian aircraft, often with few repercussions
July 24, 2014 -- Updated 1637 GMT (0037 HKT)
Gerard Jacobs says grieving families and nations need the comfort of traditional rituals to honor the remains of loved ones, particularly in a mass disaster
July 24, 2014 -- Updated 1413 GMT (2213 HKT)
The idea is difficult to stomach, but John Sutter writes that eating dog is morally equivalent to eating pig, another intelligent animal. If Americans oppose it, they should question their own eating habits as well.
July 23, 2014 -- Updated 1630 GMT (0030 HKT)
Bill van Esveld says under the laws of war, civilians who do not join in the fight are always to be protected. An International Criminal Court could rule on whether Israeli airstrikes and Hamas rocketing are war crimes.
July 30, 2014 -- Updated 1209 GMT (2009 HKT)
Gordon Brown says the kidnapped Nigerian girls have been in captivity for 100 days, but the world has not forgotten them.
ADVERTISEMENT