IBM Talks Up Natural Speech Software
Page 1 of 1
NEW YORK -- IBM today upgraded its embedded speech software to allow users of automobiles and mobile devices to issue requests as they arise.
Officials from the systems vendor convened at IBM's offices today to discuss the new "freeform command recognition." This is a technology that uses statistical language modeling and semantic interpretation to ensure natural communication between the voice recognition system and the user.
To this point, speech-activated electronics devices relied on preset commands to performing such tasks as calling up specific songs, selecting a specific radio station, or requesting directions in a car. Consumers of this technology had to memorize exact commands to do what they wanted to do.
Now IBM's Embedded ViaVoice 4.4 software allows consumers to communicate with the software using natural speech.
Aside from the obvious convenience of doing away with memorizing commands, IBM argues the natural speech approach can help keep drivers focused on the road in case they forget the commands and get distracted trying to remember them or dig up a reference sheet.
Igor Jablokov, program director of multimodal and voice portals at IBM, said many of the 2 billion telephone users in the world are becoming Internet users if they aren't already.
Increasingly, these technology users want converged devices that allow them to both access the Internet and communicate via keystrokes, as well as voice-activate their handheld devices or other machines that act on speech. It's these type of user behaviors that IBM wants to target with its information on demand and information-as-a-service strategies.
"Our goal is to deliver information on demand for people who want one-button access to the Web," Jablokov said during his turn as speaker for the event.
IBM engineers also provided demos of how the Embedded ViaVoice speech recognition engine works.
In one demo, IBM officials showed how the free-form technology can be used to control automobile functions, such as radios, CD players and climate control systems. In another, Websphere Voice Server was used to let consumers access a bank account and complete transactions.
Big Blue's speech technology improvements are receiving endorsements from partners.
Mike Kennewick, CEO of speech recognition specialist VoiceBox, today said his company would integrate the Embedded ViaVoice 4.4 speech recognition engine into VoiceBox's Navigator voice-search software. Like ViaVoice, Navigator lets users search form information using free-form speech.
VoiceBox, which already has a deal in place to provide speech recognition to XM Satellite Radio users, determines the "intent" of the user from the context of the request.
In other related speech technology news, mobile software maker Openstream has agreed to embed IBM's Websphere Everyplace Multimodal software into its software platform so that users can call up information by talking to a device or by typing on phones and other handheld devices.