TMA Associates

Reprinted from Speech Strategy News, November 2009

Microsoft integrates its efforts in speech technology--the "new touch"

The Speech at Microsoft Group exemplifies a strong company commitment

Microsoft execs have indicated to the press that part of the problem with the Vista version of Windows was too many silos, a lack of communication between groups developing aspects of the operating system. The company is seeking to avoid that problem with speech technology. Zig Serafin is General Manager of the "Speech at Microsoft Group" and is responsible for both integrating the groups doing speech R&D at Microsoft and its Tellme subsidiary plus evangelizing and delivering speech solutions for some of Microsoft's most critical products.

Two years ago, Microsoft acquired Tellme Networks and has subsequently merged Microsoft?s speech development team (formerly the Speech Components Group) with Tellme to form the Speech at Microsoft group. Recent examples of the importance of speech technology at Microsoft are its inclusion in Windows 7, with apps for end users and access for outside developers (p. 10), its inclusion in Microsoft Exchange 2010 (p. 12), and Tellme's release of new interactive outbound calling solutions for customer service (p. 8).

"Voice is the new touch," says Serafin. "It's the natural evolution from keyboards and touch screens. Today, speech is rapidly becoming an expected part of our everyday experience across a variety of devices. Bill Gates articulated this vision a decade ago, and we're seeing it happen today."

Microsoft cites examples such as the Ford Sync, with in-dash voice-activated navigation and search supported by Microsoft and Tellme. Bing for Mobile and new Windows Mobile phones such as the Samsung Intrepid from Sprint are all voice-enabled. The Intrepid is the first Windows phone to use Microsoft?s Tellme voice user interface. Intrepid users simply press the Tellme button on the phone and say what they want?speak a search query, dictate a text message, or dial/text a friend by speaking their name.

The Bing for Mobile application is a free version of Bing with voice-enabled search. Using this application, people simply speak their search query to retrieve results on their Windows phone. The Bing 411 service works for any phone. People call 1-800-Bing-411, speak their search (or request other information, SSN, September 2009, p. 6), and hear the results or get a text message for later access.

Yusuf Mehdi, senior vice president of the Online Services Division at Microsoft, noted, "When you're on the go, using only keystrokes to search can be cumbersome, especially if you?re multi-tasking. It takes over 20 strokes of the keypad to find a restaurant on the Web. With Bing for Mobile or Bing 411, you simply speak your query to get results quickly, easily and safely. Using your voice to simply "say what you want and get it:" helps you do more when you're in a mobile scenario."

Larry Heck, the new chief scientist for the Speech at Microsoft group (p. 49), said, "Speech belongs in the cloud. Only there can you reach the scale, the enormous volume of interactions required to create a speech system capable of rivaling human understanding. With the formation of the Speech at Microsoft group, the unrivaled breadth of our platform today, and our cloud-based approach, this future is within sight."

 "For perhaps the first time in the history of Microsoft, we have our world-class speech scientists and highly respected software-plus-services experts under one roof, and I believe the resulting collaboration will lead to path-breaking innovation," emphasized Serafin. "The climate in our R&D environment is optimally charged to accelerate advances, leverage the power of software plus services, and revolutionize the ways customers interact with a wide range of Microsoft products."