TtsEngineAction Enumeration

Specifies the Speech Synthesis Markup Language (SSML) action to be taken in rendering a given TextFragment.

Namespace: System.Speech.Synthesis.TtsEngine
Assembly: System.Speech (in system.speech.dll)

public enum TtsEngineAction
public enum TtsEngineAction
public enum TtsEngineAction

 Member nameDescription
BookmarkIndicates that TextFragment is to be used as the contents of a bookmark. 

Corresponds to the <mark> XML tag in the SSML specification.

ParseUnknownTagIndicates that no action has been determined from SSML input. 

The associated TextFragment is an unknown XML tag not part of the SSML standard.

This input that may be interpreted or ignored by at the discretion of a synthesis engine.

PronounceRequest that input TextFragment text be interpreted as phonemes.  

Exact pronunciation is specified by the Phoneme member of the FragmentState object returned by the State property on TextFragment instances.

Corresponds to the <Phoneme> XML tag in the SSML specification.

SilenceIndicates that a TextFragment contains no text to be rendered as speech.  

Duration of the silence is specified by the Duration member of the he FragmentState object returned by the State property on TextFragment instances.

Corresponds to the <Silence> XML tag in the SSML specification.

SpeakRequests that the associated TextFragment should be processed and spoken. 

This is the default value of TtsEngineAction for a TextFragment,

Corresponds to the <speak> XML tag in the SSML specification

SpellOutIndicates that text values provided by a TextFragment through its TextToSpeak property are to be synthesize as individual characters.  

This rendering includes punctuation, other than white space, as well as alphanumeric text.

For example thee associated text fragment "word!" should be synthsized into "w o r d exclamation point".

StartParagraphIndicates state of paragraph. 

Corresponds to the <p> XML tag in the SSML specification

StartSentenceIndicates start of sentence. 

Corresponds to the <s> XML tag in the SSML specification

TtsEngineAction represents requests for servicing a TextFragment. The actions correspond closely to elements in the SSML specification and are implemented on the text returned by the TextToSpeak property on a TextFragment.

The TtsEngineAction value associated with a TextFragment is returned by the State property.

Processing of the TtsEngineAction value returned by the State property is handled by a speech synthesizes implementation of the Speak method on the a class derived from TtsEngineSsml.

The example below is part of a custom speech synthesis implementation inheriting from TtsEngineSsml, and using the use of TextFragment, FragmentState, and TtsEventID

The implementation of Speak

  1. Receives an array of TextFragment instances and creates a new array of TextFragment instances to be passed to the Speak method on an underlying synthesis engine.

  2. If the TtsEngineAction enumeration value by found from the Action property on the FragmentState returned by the State property of each TextFragment instance is Speak, the implementation

    • Translates Americanism to Britishisms in the text to be spoken.

    • If the EventInterest property on the ITtsEngineSite interfaces provided to the implementation support the WordBoundary event type, an event to drive a synthesizer progress meter is created.

  3. A speech rendering engine is then called with the modified TextFragment array.

private const int WordBoundaryFlag = 1 << (int)TtsEventId.WordBoundary;
private readonly char[] spaces = new char[] { ' ', '\t', '\r', '\n' };
internal struct UsVsUk
  internal string UK;
  internal string US;

override public void Speak (TextFragment [] frags, IntPtr wfx, ITtsEngineSite site)
  TextFragment [] newFrags=new TextFragment[frags.Length];
  for (int i=0;i<frags.Length;i++){
    newFrags[i].TextToSpeak = frags[i].TextToSpeak.Substring(frags[i].TextOffset,
    newFrags[i].TextLength = newFrags[i].TextToSpeak.Length;
    newFrags[i].TextOffset = 0;
    if (newFrags[i].State.Action == TtsEngineAction.Speak) {
      //Us to UK conversion
      foreach (UsVsUk term in TransList) {
      newFrags[i].TextToSpeak.Replace(term.US, term.UK);
      //Generate progress meter events if supported
      if ((site.EventInterest & WordBoundaryFlag) != 0) {
      string[] subs = newFrags[i].TextToSpeak.Split(spaces);
      foreach (string s in subs) {
        int offset = newFrags[i].TextOffset;
        SpeechEventInfo spEvent = new SpeechEventInfo((Int16)TtsEventId.WordBoundary, 
                 s.Length, new IntPtr(offset));
        offset += s.Length;
        if (s.Trim().Length > 0) {
          SpeechEventInfo[] events = new SpeechEventInfo[1];
          events[0] = spEvent;
          site.AddEvents(events, 1);
  _baseSynthesize.Speak(newFrags, wfx, site);

Windows 98, Windows Server 2000 SP4, Windows CE, Windows Millennium Edition, Windows Mobile for Pocket PC, Windows Mobile for Smartphone, Windows Server 2003, Windows XP Media Center Edition, Windows XP Professional x64 Edition, Windows XP SP2, Windows XP Starter Edition

The Microsoft .NET Framework 3.0 is supported on Windows Vista, Microsoft Windows XP SP2, and Windows Server 2003 SP1.

.NET Framework

Supported in: 3.0

Community Additions