Skip to main content

Voice recognition will always be stupid

By David R. Wheeler, Special to CNN
August 20, 2013 -- Updated 1144 GMT (1944 HKT)
STORY HIGHLIGHTS
  • David Wheeler: We all know futility or trying to make automated systems understand us
  • We put up with it because we think the technology will soon be perfected -- it won't, he says
  • He says computers can't get human logic, ambiguity; some firms put people back on phones
  • Wheeler: Unless companies design systems based on human intelligence, they'll never work

Editor's note: David R. Wheeler lives in Lexington, Kentucky, where he is a freelance writer and a journalism professor at Asbury University. Follow him on Twitter @David_R_Wheeler

(CNN) -- "Please tell me your name," a robotic female voice says to the caller.

"Larry Valentine," the caller responds.

"You said, 'Barry Shmalenpine.' Is that right?"

So begins the exchange between actor Kevin James and the automated phone system in the 2007 film "I Now Pronounce You Chuck and Larry" -- a scene that correctly assumes moviegoers have had personal experience with the absurdity of non-human customer service.

Who hasn't? We've all waited through the recitation of menu items, none of which were related to our actual question. We've all hit zero repeatedly, hoping to be transferred to a real person. When that didn't work, maybe we even lost our temper, shouting "Representative!" over and over, whether the robot had given us that option or not.

Why hasn't there been a mass revolt against automated systems? The answer is simple: We believe that this nonsense is temporary. We believe that computers are on the cusp of being able to understand human language. And that belief, according to many linguists and cognitive scientists, is completely wrong.

David Wheeler
David Wheeler

First, there's the problem of voice recognition itself. Julie Sedivy, a professor of linguistics and psychology at the University of Calgary, told me that simply recognizing speech sounds and matching them up with specific words is much more complicated than most people realize.

"The way I say 'dog' will depend on my age, gender, geographic dialect, the particular anatomy of my vocal tract, and how quickly or formally I'm speaking," she said. "Humans are able to calibrate their perception after hearing just a couple of seconds of someone's speech, but really good speech recognition is still a problem for many programs."

Although this technology has steadily improved for the past 20 years, "speech recognition systems are still markedly inferior to human beings in understanding spoken language," said John Nerbonne, a linguist and information sciences professor based in The Netherlands. "Telephones are a particularly difficult medium because they limit the signal a good deal."

Furthermore, studies show that people almost universally hate automated phone systems, and that most customers are even willing to pay more to speak to an actual person.

"Ally Bank, Discover Card and TD Bank all have ads on television right now that brag about the fact that if you call the phone number, a real live person will answer," said Adam Goldkamp, spokesperson for GetHuman, an organization dedicated to improving customer service. Such is the sad state of the current customer service world. In other words, only now are companies starting to wake up to the lost revenue potential of frustrated customers who give up on the automated system and take their business elsewhere.

"When you see companies launching ads like these, it shows they understand that there are things they can do to increase their future revenue by giving customers what they want: an actual person to speak to when they have an issue," Goldkamp said.

Maybe one day in the future, automated systems will be able to identify our words with perfect accuracy. Even then, there is still an insurmountable problem: the ability to understand what we mean by those words.

2011 Siri: Apple's new voice recognition

"We're a long way from being able to communicate with computers in real language," said Suzanne Kemmer, director of Cognitive Sciences and associate professor of Linguistics at Rice University. "Human language has a powerful design feature that works great for normal person-to-person interactions, but is completely at odds with the way computers work."

Computers are based on formal logic and fixed categories, she explained. Human language is flexible and dynamic, and follows a cognitive logic that differs fundamentally from computers. In short, human words and grammatical structures don't have fixed meanings. Instead, they have a certain amount of vagueness and ambiguity built in, so that their meaning is highly affected by context.

Actually understanding meaning is a very different problem from voice recognition or from the auto-correct on your computer or phone, Kemmer said.

When I brought up this topic with Harvard professor Steven Pinker, one of the world's most influential linguists, he noted that major companies, by looking for statistical patterns in large datasets and applying them to user input, have largely dropped the ball when it comes to real artificial intelligence: "The stupidity of a lot of computer language understanding systems comes from the fact that they've turned their backs on genuine intelligence and satisfied themselves with statistics."

In other words, computers are still very bad at trying to guess what we mean when we say something.

They also don't get our social and emotional psychology. "Often the automated phone systems were developed with a tin ear to the way people interact with each other," Pinker told me. "They sound like people, but if you think of them as such, they are the most infuriating people in the world. When I hit '0' to get a human being, and a voice dripping with a combination of mock concern and mock confusion says, 'I'm sorry, but I did not understand your answer,' I am apt to go into a rage."

"If this were a real person," Pinker added, "she would be simultaneously stupid, mendacious, and condescending."

It's time to stop the madness. We are not on the cusp of inventing computers that understand human language. Silicon Valley can, and will, continue to strive for this goal.

In the meantime, let's stop kidding ourselves. Let's admit that computers, by themselves, are terrible at customer service. Let's admit that, at a time of economic uncertainty and job losses, we should be supporting companies that employ real people to answer our questions. Let's admit that, unless we demand change, we will be forced, forever, to deal with an automated system that thinks our name is Barry Shmalenpine.

Follow us on Twitter @CNNOpinion.

Join us on Facebook/CNNOpinion.

The opinions expressed in this commentary are solely those of David R. Wheeler.

ADVERTISEMENT
Part of complete coverage on
April 16, 2014 -- Updated 1642 GMT (0042 HKT)
Rick McGahey says Rep. Paul Ryan is signaling his presidential ambitions by appealing to hard core Republican values
April 16, 2014 -- Updated 1539 GMT (2339 HKT)
Paul Saffo says current Google Glasses are doomed to become eBay collectibles, but they are only the leading edge of a surge in wearable tech that will change our lives
April 15, 2014 -- Updated 1849 GMT (0249 HKT)
Kathleen Blee says the KKK and white power or neo-Nazi groups give haters the purpose and urgency to use violence.
April 16, 2014 -- Updated 1156 GMT (1956 HKT)
Sen. Sheldon Whitehouse and Rep. Henry Waxman say read deep, and you'll see the federal Keystone pipeline report spells out the pipeline is bad news
April 16, 2014 -- Updated 1153 GMT (1953 HKT)
Frida Ghitis says President Obama needs to stop making empty threats against Russia and consider other options
April 15, 2014 -- Updated 2129 GMT (0529 HKT)
Peter Bergen and David Sterman say the Kansas Jewish Center killings are part of a string of lethal violence in the U.S. that outstrips al Qaeda-influenced attacks. Why don't we pay more attention?
April 15, 2014 -- Updated 1641 GMT (0041 HKT)
Danny Cevallos says families of the passengers on Malaysian Airlines Flight 370 need legal counsel
April 14, 2014 -- Updated 1523 GMT (2323 HKT)
David Frum says Russia is on a rampage of mischief while Western leaders and Western alliances charged with keeping the peace hem and haw
April 14, 2014 -- Updated 1156 GMT (1956 HKT)
Most adults make the mistakes of hitting the snooze button and of checking emails first thing in the morning, writes Mel Robbins
April 14, 2014 -- Updated 1754 GMT (0154 HKT)
David Wheeler says as middle-class careers continue to disappear, we need a monthly cash payment to everyone
April 14, 2014 -- Updated 1155 GMT (1955 HKT)
Democrats need to show more political spine when it comes to the issue of taxes.
April 14, 2014 -- Updated 1555 GMT (2355 HKT)
Donna Brazile recalls the 50th Anniversary of the Civil Rights Act as four presidents honored the heroes of the movement and Lyndon Johnson, who signed the law
April 14, 2014 -- Updated 1317 GMT (2117 HKT)
Elmer Smith remembers Chuck Stone, the legendary journalist from Philadelphia who was known as a thorn in the side of police and an advocate for the little guy
April 13, 2014 -- Updated 1856 GMT (0256 HKT)
Al Franken says Comcast, the nation's largest cable provider, wants to acquire Time Warner Cable, the nation's second-largest cable provider. Should we be concerned?
April 11, 2014 -- Updated 1522 GMT (2322 HKT)
Philip Cook and Kristin Goss says the Pennsylvania stabbing attack, which caused grave injury -- but not death, carries a lesson on guns for policymakers
April 11, 2014 -- Updated 1906 GMT (0306 HKT)
Wikipedia lists 105 football movies, but all too many of them are forgettable, writes Mike Downey
April 11, 2014 -- Updated 1432 GMT (2232 HKT)
John Sutter and hundreds of iReporters set out to run marathons after the bombings -- and learned a lot about the culture of running
April 11, 2014 -- Updated 1649 GMT (0049 HKT)
Timothy Stanley says it was cowardly to withdraw the offer of an honorary degree to Ayaan Hirsi Ali. The university should have done its homework on her narrow views and not made the offer
April 11, 2014 -- Updated 1416 GMT (2216 HKT)
Al Awlaki
Almost three years after his death in a 2011 CIA drone strike in Yemen, Anwar al-Awlaki continues to inspire violent jihadist extremists in the U.S, writes Peter Bergen
April 12, 2014 -- Updated 0121 GMT (0921 HKT)
David Bianculli says Colbert is a smart, funny interviewer, but ditching his blowhard persona to take over the mainstream late-night role may cost him fans
April 10, 2014 -- Updated 1731 GMT (0131 HKT)
Rep. Paul Ryan says the Republican budget places its trust in the people, not in Washington
April 10, 2014 -- Updated 2128 GMT (0528 HKT)
Aaron David Miller says Obama isn't to blame for Kerry's lack of progress in resolving Mideast talks
April 14, 2014 -- Updated 1522 GMT (2322 HKT)
David Weinberger says beyond focusing on the horrors of the attack a year ago, it's worth remembering the lessons it taught about strength, the dangers of idle speculation and Boston's solidarity
April 10, 2014 -- Updated 1632 GMT (0032 HKT)
Katherine Newman says the motive for the school stabbing attack in Pennsylvania is not yet known, but research on such rampages turns up similarities in suspects and circumstances
April 11, 2014 -- Updated 1103 GMT (1903 HKT)
Simon Tisdall: Has John Kerry's recent track record left Russia's wily leader ever more convinced of U.S. weakness?
April 10, 2014 -- Updated 1640 GMT (0040 HKT)
Mel Robbins says Nate Scimio deserves credit for acting bravely in a frightening attack and shouldn't be criticized for posting a selfie afterward
April 9, 2014 -- Updated 1839 GMT (0239 HKT)
Wendy Townsend says the Rattlesnake Roundup -- where thousands of pounds of snakes are killed and tormented -- is barbaric
April 10, 2014 -- Updated 1345 GMT (2145 HKT)
Dr. Mary Mulcahy says doctors who tell their patients the truth risk getting bad ratings from them
April 9, 2014 -- Updated 1328 GMT (2128 HKT)
Peggy Drexler says the married Rep. McAllister, caught on video making out with a staffer, won't get a pass from voters who elected him as a Christian conservative with family values
April 9, 2014 -- Updated 1143 GMT (1943 HKT)
David Frum says the president has failed to react strongly to crises in Iran, Syria, Ukraine and Venezuela, encouraging others to act out
April 9, 2014 -- Updated 2057 GMT (0457 HKT)
Eric Liu says Paul Ryan gets it very wrong: The U.S.'s problem is not a culture of poverty, it is a culture of wealth that is destroying the American value linking work and reward
April 9, 2014 -- Updated 1151 GMT (1951 HKT)
Frida Ghitis writes: "We are still seeing the world mostly through men's eyes. We are still hearing it explained to us mostly by men."
April 10, 2014 -- Updated 1408 GMT (2208 HKT)
Chester Wisniewski says the Heartbleed bug shows how we're all tangled together, relying on each other for Internet security
April 9, 2014 -- Updated 1926 GMT (0326 HKT)
Danny Cevallos says an Ohio school that suspended a little kid for pointing his finger at another kid and pretending to shoot shows the growth in "zero tolerance" policies at school run amok
ADVERTISEMENT