Share via


SAPI Interfaces

Other versions of this page are also available for the following:

Windows Mobile Not SupportedWindows Embedded CE Supported

8/28/2008

The following table shows the Speech API (SAPI) interfaces with a description of the purpose of each.

Programming element Description

_ISpPrivateEngineCall

This interface enables private calls to a speech recognition (SR) engine.

IEnumSpObjectTokens

This interface enumerates speech object tokens.

ISpAudio

This interface represents a real-time audio stream, such as those connected to a live microphone or telephone line.

ISpCFGInterpreter

This interface interprets a context-free grammar (CFG).

ISpCFGInterpreterSite

This interface sets properties and text replacements in a phrase.

ISpContainerLexicon

This interface defines a container lexicon that automatically loads the user lexicon and all available application lexicons when created.

ISpDataKey

This interface provides a mechanism for storing and retrieving string and other data.

ISpErrorLog

This interface prepares a log file of compilation error messages.

ISpEventSink

This interface allows event sources to send events directly to an event sink through a free-threaded call.

ISpEventSource

This interface provides the mechanism to filter and queue events.

ISpGramCompBackend

This interface represents the grammar compiler back end that works with a binary grammar.

ISpGrammarBuilder

This interface exposes the back end of the SAPI context-free grammar (CFG) compiler.

ISpGrammarCompiler

This interface represents the SAPI 5.0 text grammar compiler.

ISpITNProcessor

This interface performs inverse text normalization (ITN) for an ITN grammar.

ISpLexicon

This interface is used to access lexicons, which contain information about words that can be recognized or spoken.

ISpMMSysAudio

This interface represents the audio implementation for the standard Windows multimedia layer (wave in and wave out).

ISpNotifyCallback

This interface allows the application to receive notifications.

ISpNotifySink

This interface specifies an event sink to receive notifications.

ISpNotifySource

This interface allows the application to configure the events that trigger notifications.

ISpNotifyTranslator

This interface converts a call to the ISpNotifySink::Notify method into the appropriate notification.

ISpObjectTokenCategory

This interface defines an object token category

ISpObjectTokenEnumBuilder

This interface maintains an object token enumerator (ISpObjectToken).

ISpObjectTokenInit

This interface initializes an object token to use a data key.

ISpObjectToken

This interface handles object token entries.

ISpObjectWithToken

This interface represents any object that is associated with an object token.

ISpPhoneConverter

This interface enables the client to convert from the SAPI character phone set to the identifier phone set.

ISpPhraseAlt

This interface enables applications to retrieve alternate phrase information from an SR engine.

ISpPhraseBuilder

This interface builds a phrase from the provided elements.

ISpPhrase

This interface accesses information contained in a phrase.

ISpRecoContext

This interface enables applications to create different functional views or contexts of the SR engine.

ISpRecognizer

This interface enables applications to control aspects of an SR engine and its audio input.

ISpRecoGrammar

This interface enables applications to manage the words and phrases that the SR engine will recognize.

ISpRecoResult

This interface enables applications to retrieve information about hypotheses, recognitions, and false recognitions of the SR engine.

ISpRegDataKey

This interface creates a new data key using a specific key in the registry for storage.

ISpResourceManager

This interface provides access to resources shared by different speech components in the same process.

ISpSRAlternates

This interface allows alternate word selection and implementation for speech recognition.

ISpSREngineSite

This interface gets audio data, retrieves grammar information, sends events, and returns recognition information to SAPI.

ISpSREngine

This interface defines an SR engine.

ISpStreamFormatConverter

This interface is the primary interface implemented by the SAPI audio data format converter.

ISpStreamFormat

This interface is the minimum extra interface required by SAPI on top of the IStream interface.

ISpStream

This interface provides two distinct pieces of functionality:

  • It allows the application to wrap up an existing stream by providing both an IStream and its format so that the underlying ISpStreamFormat object can provide this data to SAPI when required.
  • It allows the creation of an object suitable for SAPI use from a file using the BindToFile method.

ISpTaskManager

This interface implements a task management service provider to optimize thread use.

ISpTask

This interface is a C++ pure virtual interface, and not a COM interface. It is used by objects that want to perform atomic operations that are optimized for a multiprocessor computer.

ISpThreadControl

This interface controls a thread.

ISpThreadTask

This interface supports initializing and processing of an allocated thread, and is used to simplify thread-based operations.

ISpTokenUI

This interface provides programmatic management of a UI associated with an object implementing ISpObjectToken.

ISpTranscript

This interface represents the transcript for wave audio files supported by the SAPI SpStream object.

ISpTTSEngineSite

This interface writes audio data and queues events for the TTS engine.

ISpTTSEngine

This interface defines the SAPI 5.0 text-to-speech (TTS) engine, a driver that is used for speech synthesis.

ISpVoice

This interface enables an application to perform text synthesis operations.

See Also

Reference

SAPI Reference

Other Resources

Speech API (SAPI) 5.0