Extending Language Resources
Windows Search uses language resources such as word breakers and stemmers to break text in its native locale during index creation and query processing. Microsoft provides word breakers and stemmers for several languages. This section describes how to implement and use custom word breakers and stemmers for languages and locales beyond those provided by Microsoft.
- Understanding Language Resource Components
- Implementing a Word Breaker and Stemmer
- Linguistic and Unicode Considerations
- Troubleshooting Language Resources and Best Practices
Additional Resources
- For a list of lanuages supported by word breakers, see Languages Supported by Windows Search.
- If you need to identify the language of a piece of text, you can use Language Auto-Detection (LAD), which is available in Windows 7 and later. For more information, see Extended Linguistic Services (ELS).
- For applicable reference documentation, see Data Add-in Interfaces.
Related topics
Feedback
https://aka.ms/ContentUserFeedback.
Coming soon: Throughout 2024 we will be phasing out GitHub Issues as the feedback mechanism for content and replacing it with a new feedback system. For more information see:Submit and view feedback for