
Registering a Word Breaker and the Related Linguistic Components
To register a word breaker, you must do the following:
-
Add the COM ClassID(s) for the word breaker and stemmer interfaces for the language being registered as a key to the <InstanceRoot>\MSSearch\CLSID node of the registry.
-
Add a key to the <InstanceRoot>\MSSearch\Language node for the language.
-
Add configuration values that specify the location of the lexicon, noise word, and thesaurus files for the language.
Caution: |
|---|
|
Incorrectly editing the registry can severely damage your system. Before making changes to the registry, you should back up any valued data on the computer.
|
The installation procedure for third-party word breakers licensed by Microsoft consists of three stages.
Note: |
|---|
|
The Danish word breaker is used as an example in the steps below. Values to install other language word breakers are provided in the tables later in this topic.
|
Stage 1: Add the COM ClassID(s) for the Word Breaker and Stemmer Interfaces for the Language Being Registered
To add COM Class ID(s) for these components for the Danish language:
-
Open the Registry Editor, by:
-
Clicking Start, and clicking Run.
-
In the Run dialog box, in the Open box, type Regedit.
-
In Registry Editor, select the following registry key for the first instance of SQL Server: HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Microsoft SQL Server\MSSQL10.MSSQLSERVER\MSSearch\CLSID
-
On the menu bar, click Edit, click New, and click Key.
-
Type {16BC5CE4-2C78-4CB9-80D5-386A68CC2B2D}.
-
Press ENTER.
-
In the right pane, right-click the Default registry value, and then click Modify.
-
In the Edit String dialog box, in the Value data box, type danlr.dll, and then click OK.
-
Repeat steps 3 through 7, replacing the value in step 4 with {83BC7EF7-D27B-4950-A743-0F8E5CA928F8}.
For a given language, follow the steps above, replacing the key values in steps 4 and 8 with the key values for the language you want. These values are listed below. In step 7, replace danlr.dll with the .dll name for the language you want.
|
Language
|
Key value for step 4
|
.DLL name for step 7
|
Key value for step 8
|
|---|
|
Danish
|
{16BC5CE4-2C78-4CB9-80D5-386A68CC2B2D}
|
danlr.dll
|
{83BC7EF7-D27B-4950-A743-0F8E5CA928F8}
|
|
Polish
|
{B8713269-2D9D-4BF5-BF40-2615D75723D8}
|
lrpolish.dll
|
{CA665B09-4642-4C84-A9B7-9B8F3CD7C3F6}
|
|
Turkish
|
{23A9C1C3-3C7A-4D2C-B894-4F286459DAD6}
|
trklr.dll
|
{8DF412D1-62C7-4667-BBEC-38756576C21B}
|
Stage 2: Add a Key to the <InstanceRoot>\MSSearch\Language Node for the Language
To add a key to this node for the Danish language:
-
Select the following registry key for the first instance of SQL Server:
HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Microsoft SQL Server\MSSQL10.MSSQLSERVER\MSSearch\Language
-
Repeat steps 3 through 5 in the preceding procedure, replacing the key name in step 4 with dan.
For a given language, follow the preceding steps, replacing the key name in step 4 with the value listed below for the specific language.
|
Language
|
Key name for step 4
|
|---|
|
Danish
|
dan
|
|
Polish
|
plk
|
|
Turkish
|
trk
|
Stage 3: Add Configuration Values That Give the Location of Each Linguistic Component for a Language
To add configuration values for these components for the Danish language:
-
Select the registry key you entered in Stage 2 above. For the first instance of SQL Server this would be:
HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Microsoft SQL Server\MSSQL10.MSSQLSERVER\MSSearch\Language\dan
-
On the menu bar, click Edit, click New, and click String Value.
-
Type NoiseFile.
-
Press ENTER.
-
Right-click the NoiseFile registry value you just added, and then click Modify.
-
In the Edit String dialog box, in the Value data box, type <Instance_specific_FTData_path>\noisedan.txt, where <Instance_specific_FTData_path> is the path retrieved in the section described earlier ("Retrieving the instance specific FTData folder path").
-
Click OK.
Repeat steps 2 through 7 for the remaining linguistic components of the language (thesaurus file, language (locale), word breaker and stemmer). The values to register those components for the Danish, Polish, or Turkish language are provided below.
Values for Danish
Repeat steps 2 through 7 to add each set of values listed below, replacing the language-specific value type (step 2), value name (steps 3 and 5), and value data (step 6) for each value.
|
Value type for step 2
|
Value names for steps 3 and 5
|
Value type for step 6
|
|---|
|
String value
|
TsaurusFile
|
<Instance_specific_FTData_path>\tsdan.xml
|
|
DWORD value
|
Locale
|
00000406
|
|
String value
|
WBreakerClass
|
{16BC5CE4-2C78-4CB9-80D5-386A68CC2B2D}
|
|
string value
|
StemmerClass
|
{83BC7EF7-D27B-4950-A743-0F8E5CA928F8}
|
Values for Polish
For the Polish language, follow the steps outlined above, using the values listed below. Select the registry key you entered for Polish in Stage 2 above. For the first instance of the SQL Server, this would be: HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Microsoft SQL Server\MSSQL10.MSSQLSERVER\MSSearch\Language\plk
Complete steps 2 through 7 to add each set of values listed below, replacing the language-specific value type (step 2), value name (steps 3 and 5), and value data (step 6) for each value.
|
Value type for step 2
|
Value names for steps 3 and 5
|
Value data for step 6
|
|---|
|
String value
|
NoiseFile
|
<Instance_specific_FTData_path>\noiseplk.txt
|
|
String value
|
TsaurusFile
|
<Instance_specific_FTData_path>\tsplk.xml
|
|
DWORD value
|
Locale
|
00000415
|
|
String value
|
WBreakerClass
|
{CA665B09-4642-4C84-A9B7-9B8F3CD7C3F6}
|
|
String value
|
StemmerClass
|
{B8713269-2D9D-4BF5-BF40-2615D75723D8}
|
Values for Turkish
For the Turkish language, follow the steps outlined above, using the values listed below. Select the registry key you entered for Turkish in Stage 2 above. For the first instance of SQL Server, this would be: HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Microsoft SQL Server\MSSQL10.MSSQLSERVER\MSSearch\Language\trk
Complete steps 2 through 7 to add each set of values listed below, replacing the language-specific value type (step 2), value name (steps 3 and 5), and value data (step 6) for each value.
|
Value type for step 2
|
Value names for steps 3 and 5
|
Value data for step 6
|
|---|
|
String value
|
NoiseFile
|
<Instance_specific_FTData_path>\noisetrk.txt
|
|
String value
|
TsaurusFile
|
<Instance_specific_FTData_path>\tstrk.xml
|
|
DWORD value
|
Locale
|
0000041f
|
|
String value
|
WBreakerClass
|
{8DF412D1-62C7-4667-BBEC-38756576C21B}
|
|
String value
|
StemmerClass
|
{23A9C1C3-3C7A-4D2C-B894-4F286459DAD6}
|