Constructing Language Resource Components
This section describes how to construct a custom language resource component. Word breakers and stemmers need a DLL wrapper that implements and exports the standard DLL entry points. Implementing a word breaker entails implementing the IWordBreaker interface. Implementing a stemmer entails implementing the IStemmer interface. The IWordBreaker and IStemmer interfaces allow flexible implementation of tokenization and morphological analysis. These interfaces can be implemented in the same DLL or in separate DLLs, depending on the approach and whether the implementations share code.
This section includes the following topics:
Implementing the DLL Entry Points provides instructions for implementing DLLs for word breakers and stemmers.
Implementing a Word Breaker provides instructions for implementing a custom word breaker by using the IWordBreaker interface.
Implementing a Stemmer provides instructions for implementing a custom stemmer by using the IStemmer interface.