Microsoft KB Archive/271818

From BetaArchive Wiki
Knowledge Base


How to configure Windows 2000 Indexing Service to use the Neutral word breaker

Article ID: 271818

Article Last Modified on 2/20/2007



APPLIES TO

  • Microsoft Windows 2000 Server
  • Microsoft Windows 2000 Advanced Server
  • Microsoft Windows 2000 Professional Edition
  • Microsoft Index Server 2.0
  • Microsoft Windows Indexing Service 2.0



This article was previously published under Q271818

IMPORTANT: This article contains information about modifying the registry. Before you modify the registry, make sure to back it up and make sure that you understand how to restore the registry if a problem occurs. For information about how to back up, restore, and edit the registry, click the following article number to view the article in the Microsoft Knowledge Base:

256986 Description of the Microsoft Windows Registry


SUMMARY

Some searches against Indexing Service may fail when you use a language-specific word breaker; however, you can use the Neutral word breaker to work around this issue. This "More Information" section of this article explains how to remove the language-specific word breaker so that the Neutral word breaker is used.

MORE INFORMATION

WARNING: If you use Registry Editor incorrectly, you may cause serious problems that may require you to reinstall your operating system. Microsoft cannot guarantee that you can solve problems that result from using Registry Editor incorrectly. Use Registry Editor at your own risk.

To remove all word breakers except the Neutral word breaker, follow these steps:

  1. Start Registry Editor (Regedt32.exe).
  2. Locate the following key in the registry:

    HKEY_LOCAL_MACHINE\System\CurrentControlSet\Control\ContentIndex\Language

  3. Select the Language key, and then save this key to a backup file.
  4. Delete all registry keys except Neutral.
  5. Locate the following key in the registry:

    HKEY_LOCAL_MACHINE\System\CurrentControlSet\Control\ContentIndex

  6. Select the DLLsToRegister registry key, and then save this key to a backup file.
  7. Double-click the DLLsToRegister value, and then remove the line with the Infosoft.dll file to prevent re-registering the other languages. Click OK.
  8. Quit Registry Editor.
  9. Stop Indexing Service, and then restart Indexing Service.
  10. Rebuild the catalog by doing a full build.

Please note the following differences that occcur when you use only the Neutral word breaker:

  • In all languages, "advanced" normalization acts differently:
    • Dates: "9/9/99" and "9-9-1999" do not match.
    • Numbers: "1,000" and "1000" do not match.
    • Hyphens: "database" and "data-base" do not match.
  • In German, compound decomposition does not work.
  • In other European languages, clitic stripping is lost. Loss of clitic stripping may reduce the chances of matching.
  • Word stemming no longer works. In word stemming, a query for "flew**" returns documents that match words like "flew," "fly," and "flown."


Keywords: kbhowto KB271818