Broadcasting IM Text Based on Speech Recognition in a UCMA Application: Introduction (Part 1 of 3)

Summary:   Combine speech recognition, Microsoft Lync 2010 Automation API, and Microsoft Unified Communications Managed API (UCMA) 3.0 to broadcast urgent text to clients and customers.

Applies to:   Microsoft Unified Communications Managed API (UCMA) 3.0 Core SDK | Microsoft Lync 2010 SDK | Microsoft Speech Platform SDK

Published:   November 2011 | Provided by:   John Clarkson, Microsoft | About the Author


This article is the first in a three-part series of articles about how to use speech recognition to broadcast instant messaging text from a Microsoft Unified Communications Managed API (UCMA) 3.0 application.

Imagine the typical day of Kate Berger, a stockbroker for Northwind Traders. She spends part of every day visiting clients and businesses, and she discovers an excellent and highly urgent investment opportunity for her clients. She has to notify them now to buy stock in Fabrikam Inc. as long as the price is under fifty dollars. She leaves her meeting, steps into the lobby, dials a number on her mobile phone, and says “Buy Fabrikam, maximum fifty, send”. Within minutes each of her customers receives an instant message (IM) with the following text: “Hi. This is your broker Kate Berger with an urgent recommendation to BUY Fabrikam with a maximum value of fifty dollars.”

This article describes the steps to create this application, which uses UCMA 3.0, the Lync SDK, and the Microsoft Speech Platform SDK.

  • One of the following editions of Microsoft Visual Studio development system: Visual Studio 2008 SP1 Standard Edition, Professional Edition, or Team Suite; Visual Basic 2008 Express Edition, Visual C# 2008 Express Edition, Visual Studio 2010 Professional, Premium, or Ultimate; Visual Basic 2010 Express, or Visual C# 2010 Express.

  • Microsoft Speech Platform SDK v10.2

  • UCMA 3.0 Core SDK.

  • Lync SDK.

  • Microsoft .NET Framework 3.5 SP1 and later versions of Microsoft .NET Framework 3.5.

The UCMA 3.0 application is running on the application server, waiting for a call. The user places a voice call to the application by using Microsoft Lync 2010. When the UCMA 3.0 application answers, the user speaks words recognized by the application grammar. The application uses the recognized text from the voice call to compose the text of an IM. Finally, the Lync SDK Automation API uses each SIP URI contained in a List object, passed as a parameter to the BeginStartConversation method, and the IM is sent.

Figure 1. Scenario description


John Clarkson is a programming writer with the Microsoft Lync product team.