NEW YORK — IBM today upgraded its embedded speech software to allow users
of automobiles and mobile devices to issue requests as they arise.
Officials from the systems vendor convened at IBM’s offices today to discuss
the new “freeform command recognition.” This is a technology that uses
statistical language modeling and semantic interpretation to ensure natural
communication between the voice recognition system and the user.
To this point, speech-activated electronics devices relied on preset
commands to performing such tasks as calling up specific songs, selecting a
specific radio station, or requesting directions in a car. Consumers of this
technology had to memorize exact commands to do what they wanted to do.
Now IBM’s Embedded ViaVoice 4.4 software allows consumers to communicate
with the software using natural speech.
Aside from the obvious convenience of doing away with memorizing commands,
IBM argues the natural speech approach can help keep drivers focused on the
road in case they forget the commands and get distracted trying to remember
them or dig up a reference sheet.
Igor Jablokov, program director of multimodal and voice portals at IBM, said
many of the 2 billion telephone users in the world are becoming Internet
users if they aren’t already.
Increasingly, these technology users want converged devices that allow them
to both access the Internet and communicate via keystrokes, as well as voice-activate their handheld devices or other machines that act on speech. It’s these type of user behaviors that IBM wants to target with its
information on demand and information-as-a-service strategies.
“Our goal is to deliver information on demand for people who want one-button
access to the Web,” Jablokov said during his turn as speaker for the event.
IBM engineers also provided demos of how the Embedded ViaVoice speech
recognition engine works.
In one demo, IBM officials showed how the free-form technology can be used
to control automobile functions, such as radios, CD players and climate
control systems. In another, Websphere Voice Server was used to let
consumers access a bank account and complete transactions.
Big Blue’s speech technology improvements are receiving endorsements from
partners.
Mike Kennewick, CEO of speech recognition specialist VoiceBox, today said his company
would integrate the Embedded ViaVoice 4.4 speech recognition engine into
VoiceBox’s Navigator voice-search software. Like ViaVoice, Navigator lets
users search form information using free-form speech.
VoiceBox, which already has a deal in
place to provide speech recognition to XM Satellite Radio users, determines
the “intent” of the user from the context of the request.
In other related speech technology news, mobile software maker Openstream
has agreed to embed IBM’s Websphere Everyplace Multimodal software into
its software platform so that users can call up information by talking to a
device or by typing on phones and other handheld devices.