Responding to speech interactions (XAML)
Incorporate voice commands, speech recognition, and text-to-speech (TTS) into the user interaction experience of your Windows Runtime app.
Speech can be a very compelling and enjoyable way for users to interact with your app. Use it as a primary or complementary input option that can increase usability and broaden the appeal of your app in the Store. Speech integration can be particularly useful where physical manipulation or eye contact is difficult or undesirable.
Prerequisites: Have a look through these topics to get familiar with the technologies discussed here.
Learn about events with Events and routed events overview
User experience guidelines:
See Speech design guidelines for helpful tips on designing a useful and engaging speech-enabled app.
Use speech recognition to provide input, specify an action or command, and accomplish tasks in your universal Windows app.
Learn how to define and use custom constraints for speech recognition.
Learn how to manage issues with speech-recognition accuracy caused by audio-input quality and condition.
Speech functionality is composed of three modes: Cortana voice commands, app-enabled speech recognition, and TTS. This illustration shows how these modes work together.
Voice commands are supported through Cortana, extended in your app, and accessed by the user from outside your app.
Once an app is installed, it can be launched through voice commands such as "open" or "start", followed by the app name. By extending voice command functionality in your app, you can link to a specific page in the app, perform a task, or initiate an action using a phrase such as "Start Contoso Search" or "Contoso Show Me My Favorites."
When you extend and customize voice commands, users can discover what phrases your app is listening for through system help and the What can I say screen.
For more info, see Quickstart: Voice commands.
Speech recognition is implemented in your app and accessed by the user from your app.
Users can provide input or accomplish tasks with speech recognition. The feature includes support for pre-defined grammars for free-text dictation and web search, and support for custom grammars authored using Speech Recognition Grammar Specification (SRGS) Version 1.0.
You can use the default system UI for speech recognition that supports disambiguation and provides visual feedback to users, or you can create your own UI.
Text-to-speech (TTS), also known as speech synthesis, is implemented in your app and accessed by the user from your app.
TTS enables your app to read aloud a basic text string, or a more complex one declared in Speech Synthesis Markup Language (SSML).
SSML provides a standard way to control characteristics of speech output such as pronunciation, volume, pitch, rate or speed, and emphasis.
Voice commands and speech recognition are not supported by Windows Store apps in Windows 8 and Windows 8.1.
- Responding to user interaction