Search query SLA minimum replicas

March 21, 2025 ยท View on GitHub

SYNOPSIS

Use a minimum of 2 replicas to receive an SLA for index queries.

DESCRIPTION

AI Search (Previously known as Cognitive Search) services support indexing and querying. Indexing is the process of loading content into the service to make it searchable. Querying is the process where a client searches for content by sending queries to the index.

AI Search supports a configurable number of replicas. Having multiple replicas allows queries and index updates to load balance across multiple replicas.

To receive a Service Level Agreement (SLA) for Search index queries a minimum of 2 replicas is required.

RECOMMENDATION

Consider increasing the number of replicas to a minimum of 2 to receive an SLA on index query requests.

EXAMPLES

Configure with Azure template

To deploy AI Search services that pass this rule:

  • Set the properties.replicaCount property to a minimum of 2.

For example:

{
  "type": "Microsoft.Search/searchServices",
  "apiVersion": "2022-09-01",
  "name": "[parameters('name')]",
  "location": "[parameters('location')]",
  "identity": {
    "type": "SystemAssigned"
  },
  "sku": {
    "name": "standard"
  },
  "properties": {
    "replicaCount": 3,
    "partitionCount": 1,
    "hostingMode": "default"
  }
}

Configure with Bicep

To deploy AI Search services that pass this rule:

  • Set the properties.replicaCount property to a minimum of 2.

For example:

resource search 'Microsoft.Search/searchServices@2022-09-01' = {
  name: name
  location: location
  identity: {
    type: 'SystemAssigned'
  }
  sku: {
    name: 'standard'
  }
  properties: {
    replicaCount: 3
    partitionCount: 1
    hostingMode: 'default'
  }
}