Surface Form Normalization

Note

Indexing Service is no longer supported as of Windows XP and is unavailable for use as of Windows 8. Instead, use Windows Search for client side search and Microsoft Search Server Express for server side search.

 

This section includes a short list of considerations for parsing and normalizing surface forms for a language. This section includes the following topics:

  • Hyphenation describes how word breakers handle hyphenation.
  • Possessives describes how word breakers handle possessives.
  • Diacritics describes how word breakers and stemmers might handle diacritics.
  • Clitics describes how word breakers and stemmers might handle clitics.