Configuring an external or custom embedding model

Release version: Yokohama

Updated July 31, 2025

1 minute to read

You can connect and configure an external or custom embedding model in the AI Search Retrieval Augmented Generation (RAG) application to generate embeddings.

Embedding model overview

The AI Search Retrieval Augmented Generation (RAG) application uses embedding models to retrieve relevant information from the indexed sources and generate embeddings. These embeddings are a way to transform complex objects—like words or images—into numerical forms that capture their meanings and relationships. This transformation helps machine learning (ML) models analyze and understand data more effectively, improving tasks like natural language processing (NLP), recommendation systems, and image recognition.

With the AI Search RAG application, you can create your own custom embedding model by using the Bring Your Own Model (BYOM) capability. The application also supports third-party embedding models, such as the Azure OpenAI Embedding model and Google Gemini Embedding model. You can configure the default connection and credential aliases that are provided for these third-party embedding models to authenticate and enable the integration between your instance and the selected embedding model.

Configuring embedding models

Configure your custom and third-party embedding models in the AI Search RAG application:

Configuring third-party embedding models

You can integrate a third-party embedding model with your system:

Set up a connection and credentials for a third-party embedding model. For more information, see Setting up connection and credentials for third-party embedding model.
Activate a third-party embedding model. For more information, see Activating the third-party embedding model.

Configuring custom embedding models

You can configure a custom embedding model end-to-end by leveraging the Bring Your Own Embedding Model (BYOM) capability. For more information, see Configuring bring your own model (BYOM).