Text To Speech
Glossary Item Box
The Text to Speech service is designed to provide your applications with a verbal interface. It can be used in conjunction with the Speech Recognizer service for two-way communication with the computer.
When the service is running you can inspect it by typing the following Uniform Resource Identifier (URI) into your web browser's address bar:
This displays the Text To Speech web page, as shown in the screenshot below. It will show you the state of the speech object (which includes the last text spoken), as well as various other speech parameters.
You can make changes using the web page, and also test out the TextToSpeech service by entering some text and pressing the Say button.
The available operations for the Text To Speech service include the following:
|SayText||Says the specified text|
|SayTextSynch||Says the specified text but does not send a response until the speaking has finished. If you are using the service from VPL you might find this operation useful because program execution can be suspended until the response is received from the Text To Speech service.|
|SetRate||Sets the speed at which text is spoken|
|SetVolume||Sets the sound volume|
|SetVoice||Selects a particular voice|
The Text To Speech service allows other services to subscribe to it. Subscribers will be notified of state updates as well as of viseme states as speech is synthesized. Whereas phonemes are the basic acoustic unit of speech, visemes are the basic visual unit of speech. Thus, a viseme describes a particular facial and oral expression that occurs alongside the voicing of phonemes.
The VisemeNotify messages can be used to assist with displaying a facial representation during speech. The service does not provide a simulated face, so you will have to write your own code for this.
As with most services, the speech parameters can be set in a configuration file. (Use the Set Configuration option in the Properties panel). Alternatively, the speech parameters can be modified through messages. The parameters include:
Changes the volume of the voice.
Valid range is 0 through 100.
Changes the speed at which the text is read.
Valid range is from -10 to 10.
Changes the speaker's voice.
Possible voices may include:
- LH Michael
- LH Michelle
- Microsoft Anna
- Microsoft Mary
- Microsoft Mike
- Microsoft Sam
- Sample TTS Voice
Short names may be used. For example, Mike may be used to select Microsoft Mike.
Note: You might only have one of these voices on your system.
VPL User Interface Services: Speech Recognition
© 2012 Microsoft Corporation. All Rights Reserved.