Speech input gets a boost with Microsoft SDK, SALT spec

[PROGRAMMERS REPORT -Sept 24, 2002] - It has been a long wait, but it may be happening. The speech technology market is set for a bounce. In fact, according to In-Stat/MDR analyst Brian Strachman, we're on the 'cusp of a speech recognition revolution.' The Kelsey Group thinks so, too, and expects spending on speech recognition to reach $41 billion by 2005.

One of the most important drivers in this market is the emergence of developer tools, as well as a resulting increase in application production speed coupled with lower development costs, wrote Strachman in a recently published report. Microsoft's .NET Speech SDK, in particular, is likely to push speech app development.

Microsoft announced the beta release of its SDK, a Web developer tool the Redmond software maker bills as the first product based on the new Speech Application Language Tags (SALT) specification, in May. According to James Mastan, group product manager for Microsoft's .NET speech technologies group, the toolset is intended to 'help jumpstart the industry' for building speech-enabled Web applications. Coupled with Microsoft's Visual Studio .NET developer tools, the SDK is designed to add voice to the list of methods for inputting data, which includes the mouse, keyboard and stylus.

Microsoft's entrance into this market has received mixed reviews, but is generally considered a positive development.

'We view Microsoft's involvement here as a good thing,' said Steve Chambers, chief marketing officer at SpeechWorks. 'The company's presence tends to legitimize the technology and draw a lot of attention. And it doesn't hurt that Microsoft is seeding the developer community, so millions of Visual Studio .NET developers will now be able to code speech apps.'

The 1.0 version of the .NET Speech SDK can be used to retool an existing Web application created with Microsoft developer tools. Features include workspaces for programming applications and for creating the spoken questions and answers that a voice-enabled application would need to understand.

However, Meta Group analyst Earl Perkins views Microsoft's speech-tech activities as a double-edged sword. 'I think it has had both a positive and negative effect,' he said. 'It drew attention to a growing market, because Microsoft never enters a market unless they realize there's money to be made. But on the other hand, they introduced another standard, so there may be a bit of a delay while vendors sort out how they're going to support both of them.'

The older standard, Voice eXtensible Markup Language (VoiceXML), has been around for about two-and-a-half years, and there are more than 600 vendors and service providers who currently adhere to this particular standard for development.

SALT was created by the SALT Forum, a group of technology companies working together to accelerate the development of speech technologies in telephony and so-called multimodal systems. Founding members of the group include SpeechWorks, Intel, Cisco Systems, Philips, Comverse and Microsoft.

The SALT Forum has offered the SALT spec to the World Wide Web Consortium (W3C), the same standards body working on VoiceXML. The Forum has asked the W3C Multimodal Interaction Working Group and Voice Browser Working Group to review the SALT specification as part of their development of standards for promoting multimodal interaction and voice-enabling the Web.

Links:
For other Programmer Report articles, please go to http://www.adtmag.com/article.asp?id=6265

About the Author

John K. Waters is a freelance writer based in Silicon Valley. He can be reached at [email protected].