IWordBreaker

The IWordBreaker interface is a language-specific language resource component. The word breaker parses text and identifies individual words and phrases. The word breaker is used in background processes and must be optimized for both throughput and minimal use of resources.

When to Implement

Implement this interface to create a custom word breaker for a language. Indexing Service calls the methods of this interface when it builds content indexes and runs queries.

Methods in Vtable Order
IWordBreaker methodDescription
InitInitializes the IWordBreaker implementation and indicates the mode in which the component operates.
BreakTextBreaks text to identify words and phrases and provides the results to the WordSink and PhraseSink objects.
ComposePhraseNot supported.
GetLicenseToUseGets the license information for this IWordBreaker implementation.

Requirements

  Windows NT or later: Requires Windows NT 4.0 with the Windows NT 4.0 Option Pack or Windows 2000 or later.

  Header: Declared in indexsrv.h.

  Import Library: User defined.

Remarks

Word breaker components for Indexing Service run in the Local Security context. They should be written to manage buffers and to stack correctly. All string copies must have explicit checks to guard against buffer overruns. You should always verify the allocated size of the buffer and test the size of the data against the size of the buffer.

See Also

Implementing a Word Breaker, Language Resource Samples, PhraseSink, Secure Code Practices, WordSink

 

Page view tracker