Speech technology: 'Hey, I'm talkin' to you!'

It was one of the first promises made at the beginning of the PC revolution - computers that understand human speech and could speak back.

Nevertheless, voice recognition has had problems making the leap from Hollywood fantasy to the real world. ZDNet Australia takes a look at the current state of play.

Most of us take the gift of speech for granted, but what about talking to computers, or talking through a telephone to a recorded voice on a computer? Do people want to be able to talk to computers? How do they want to talk to them? Even stranger is the notion of computers talking back!

Although speech interfaces have been around for some time, it is only now that they are actually being used in a wide variety of applications and delivered through different technologies to provide different services.

In the following article, we take a look at the ways in which people communicate with each other in order to arrive at a better understanding of how we can communicate with computers.

We then provide a brief overview of speech interfaces currently in use, their differences, and the issues that surround each system.

The three main areas of speech interface examined are:

  • Text to Speech - whereby the computer converts text on a computer screen to sound,
  • IVR systems - which allow customers to navigate information and conduct services by pressing buttons on the telephone in response to computer-given instructions,
  • Speech Recognition - whereby the customer talks to the computer and it understands or interprets and may even respond vocally.

Whether it is dictating to a PC, an over-the-phone credit card payment system, a taxi booking system, or a natural language banking service, speech offers a method of interaction with computers and intelligent devices that can save us significant time, effort and even physical pain(such as RSI).

Finally this article looks at the usability of speech interfaces and what needs to be considered when developing Speech IVRs and Natural Language Speech Recognition interfaces.

Advertisement

Talkback 1 comments

    Don't forget those are hearing impaired people who cannot hear!!! I'm not impressed of this article about voice communication. Those techology will leave hearing impaired without jobs that require voice communication and can affect them in maAnonymous -- 08/09/04

    Don't forget those are hearing impaired people who cannot hear!!! I'm not impressed of this article about voice communication.

    Those techology will leave hearing impaired without jobs that require voice communication and can affect them in many ways.

    Hearing impaired user!


Latest Videos

Blogs

  • David Braue Will Rudd's bush backhaul bonanza deliver?
    Rural areas will be welcoming the government's decision to put its money where its politicising is, funnelling $250m into a regional fibre upgrade to six rural centres. Remedying over a decade of near-neglect at the hands of telecoms privatisation, the investment could be the firmest step yet for Labor's NBN dream — but with inevitable political questions and a looming election, Rudd and Conroy need to deliver, and quickly, to preserve the NBN's credibility.
  • Array Doing for AV what VoIP did for telephony
    Sydney-based start-up Audinate is making traditional analog cabling obsolete in favour of TCP/IP-based networking technology. And it's doing a pretty good job so far, with its technology used by World Youth Day and the Sydney Opera House.
  • Array WiMax in Australia: Part two
    WiMax could be the standard that drives the next phase of mobile broadband, it provides an opportunity for players wanting to establish a pure IP network to carry voice and data effectively — but is this what operators want?
  • More blogs »

Tags

Back to top

Featured