Internationalization support for AI Search
Summarize
Summary of Internationalization support for AI Search
AI Search supports indexing and searching content in multiple languages offered by the ServiceNow AI Platform®. This internationalization support is automatic and covers over twenty languages including Arabic, Chinese (Simplified and Traditional), French, German, Japanese, Korean, Portuguese, Spanish, and Turkish among others. Language-specific tokenization and linguistic processing enable accurate content indexing and search query interpretation across these languages.
Show less
Indexing and Search Behavior
- Language Detection and Tokenization: AI Search identifies the language of content from Task table records, non-Task records, and external documents to apply language-specific tokenization. This ensures proper separation of words and phrases, which varies by language—for example, contextual tokenization for Chinese and Japanese.
- Reindexing Requirement: After activating a new language plugin, all relevant content must be reindexed to make it searchable in the new language. Reindexing is also recommended following certain updates to improve tokenization or feature support on specific languages.
- Multi-language Content Handling: AI Search can recognize and index embedded text regions in Japanese and Chinese within content of other languages, applying appropriate tokenization per segment.
- Search Query Processing: Search queries are tokenized based on the user’s session language, with special handling for Japanese and Chinese terms even if the session language differs, enabling accurate multi-language search queries.
Language-Dependent Features
Certain search capabilities depend on language support and alignment between the query and indexed content languages:
- Genius Results: Configurations triggered by natural language understanding (NLU) models require the search query and model to share the same language. Models using large language models (LLMs) or heuristic approaches support a subset of languages.
- Normalization and Lemmatization: AI Search performs language-specific lemma and Unicode normalization for terms, with additional processing such as term decompounding for Germanic and Nordic languages, and algorithmic stemming for Finnish.
- Result Improvement Rules, Stop Words, and Synonyms: These are applied only if their language matches the search query or is set to all languages.
- Typo Handling: Auto-correction is available for most supported languages except Japanese, Korean, and Chinese variants.
Handling Translated Content and Multilingual Search
- AI Search supports searching translated fields and documents, improving recall in multilingual environments by allowing users to find knowledge articles in multiple languages associated with their country or session.
- Administrators can configure globally searchable knowledge articles and control translation behavior of reference field values in search facets for better user experience across languages.
Fallback Languages and Filtering
- When fallback languages are set for a user session, AI Search searches translated documents in both the session and fallback languages, applying appropriate language dictionaries for stop words, synonyms, and typo handling.
- A global fallback locale can be configured to further enhance translated content search coverage.
- Search results from external content connectors are filtered to include only content in the user's session language, fallback language, or global fallback locale language, ensuring relevant multilingual search results.
AI Search supports indexing and search in all languages offered by the ServiceNow AI Platform®. Search linguistic features are supported in Arabic, Brazilian Portuguese, Czech, Danish, Dutch, English, Finnish, French, French - Canada, German, Hebrew, Hungarian, Italian, Japanese, Korean, Norwegian (Bokmål), Polish, Portuguese, Russian, Simplified Chinese, Spanish, Swedish, Traditional Chinese, and Turkish.
Internationalization support is automatically enabled and isn't configurable.
| Date | Languages to reindex |
|---|---|
| August 2024 | Brazilian Portuguese, Dutch, Italian, Japanese, Korean, Portuguese, Swedish |
| August 2025 | Finnish |
| December 2025 | Polish |
| June 2026 | Arabic, Czech, Danish, Hebrew, Hungarian, Norwegian (Bokmål), Russian, Turkish |
Indexing behavior in supported languages
When indexing content and metadata from a ServiceNow AI Platform source record or an external document, AI Search uses tokenization settings for the language of the record or document, as shown in the following table.
| Record or Document | Tokenization Settings |
|---|---|
| Source record from the Task [task] table or one of its child tables | AI Search performs language identification and uses tokenization settings for the detected language to index the record's content and metadata. Note: Language identification only identifies Arabic, Brazilian Portuguese, Czech, Danish, Dutch, English, Finnish, French, French - Canada, German, Hebrew, Hungarian, Italian, Japanese, Korean, Norwegian (Bokmål), Polish,
Portuguese, Russian, Simplified Chinese, Spanish, Swedish, Traditional Chinese, and Turkish. Content in other languages is identified and treated as English. |
| Source record from a non-Task table | AI Search uses tokenization settings for the record's language to index its content and metadata.
If the record has no language specified, the ServiceNow AI Platform treats it as being in the instance's default language. In an English instance, for example, AI Search indexes records without specified languages using tokenization settings for English. |
| External document | AI Search performs language identification and uses tokenization settings for the detected language to index the document's content and metadata. Note: Language identification only identifies Arabic, Brazilian Portuguese, Czech, Danish, Dutch, English, Finnish, French, French - Canada, German, Hebrew, Hungarian, Italian, Japanese, Korean, Norwegian (Bokmål), Polish,
Portuguese, Russian, Simplified Chinese, Spanish, Swedish, Traditional Chinese, and Turkish. Content in other languages is identified and treated as English. |
Search query behavior in supported languages
AI Search compares your search query terms with terms from indexed content and metadata, returning search results for indexed records or documents that contain matches. When your search terms are in the same language as the indexed terms, AI Search processes both sets of terms with the same tokenization settings, producing predictable matches and search results. If your search terms aren't in the same language as the indexed terms, AI Search processes the two sets of terms with different tokenization settings and matching may be unpredictable.
Language dependence for search features
The following search features are language-dependent and supported only for the listed languages.
| Feature | Language dependence and supported languages |
|---|---|
| Genius Results | AI Search only evaluates Genius Result configurations with NLU triggers if the linked NLU model has the same language as the search query. Genius Results configurations that use an LLM or a heuristic model, such as Now Assist Multi-Content Response or Q&A, are limited to languages supported by the model. Supported languages: Arabic, Brazilian Portuguese, Czech, Danish, Dutch, English, Finnish, French, French - Canada, German, Hebrew, Hungarian, Italian, Japanese, Korean, Norwegian (Bokmål), Polish, Portuguese, Russian, Simplified Chinese, Spanish, Swedish, Traditional Chinese, and Turkish. |
| Language identification and tokenization | During indexing, AI Search identifies supported languages in Task table records and external documents. Text processing for the indexed content uses tokenization settings for the identified language.
Supported languages: Arabic, Brazilian Portuguese, Czech, Danish, Dutch, English, Finnish, French, French - Canada, German, Hebrew, Hungarian, Italian, Japanese, Korean, Norwegian (Bokmål), Polish, Portuguese, Russian, Simplified Chinese, Spanish, Swedish, Traditional Chinese, and Turkish. |
| Lemma and Unicode normalization | AI Search performs language-specific lemma normalization for terms in indexed content and search queries.
Supported languages: Arabic, Brazilian Portuguese, Czech, Danish, Dutch, English, Finnish, French, French - Canada, German, Hebrew, Hungarian, Italian, Japanese, Korean, Norwegian (Bokmål), Polish,
Portuguese, Russian, Simplified Chinese, Spanish, Swedish, Traditional Chinese, and Turkish.
Note: For German, Danish, Hungarian, Korean, Norwegian (Bokmål), and Swedish, AI Search performs term decompounding in addition to lemma normalization. For Finnish, AI Search uses algorithmic stemming to identify lemmas. AI Search performs Unicode normalization for all terms in indexed content and search queries. For more information on normalization of lemmas and Unicode forms in indexed content and search queries, see Lemma and Unicode normalization. |
| Result improvement rules | AI Search only evaluates activation for result improvement rules that have the same language as the search query or that have All Languages specified.
Supported languages: All languages activated in your instance. For the list of languages you can activate, see Activate a language. |
| Stop words | AI Search only considers stop words from dictionaries that have the same language as the search query.
Supported languages: All languages activated in your instance. For the list of languages you can activate, see Activate a language. |
| Synonyms | AI Search only considers synonyms from dictionaries that have the same language as the search query.
Supported languages: All languages activated in your instance. For the list of languages you can activate, see Activate a language. |
| Typo handling | AI Search derives a separate list of auto-correction terms for each supported language found in search source indexed content. Auto-correction only replaces search query terms with terms from
the list that has the same language as the search query.
Supported languages: Arabic, Brazilian Portuguese, Czech, Danish, Dutch, English, Finnish, French - Canada, French, German, Hebrew, Hungarian, Italian, Norwegian (Bokmål), Polish, Portuguese, Russian, Spanish, Swedish, and Turkish. Typo handling isn't supported for Japanese, Korean, Simplified Chinese, or Traditional Chinese. |