Question 1

What is Azure AI Search?

Accepted Answer

Azure AI Search provides a dedicated search engine and persistent storage of your searchable content for full text and vector search scenarios. It also includes optional, integrated AI to extract more text and structure from raw content, and to chunk and vectorize content for vector search.

Question 2

How do I work with Azure AI Search?

Accepted Answer

The primary workflow is create, load, and query an index. Although you can use the portal for most tasks, Azure AI Search is intended to be used programmatically, handling requests from client code. Programmatic support is provided through REST APIs and client libraries in .NET, Python, Java, and JavaScript SDKs for Azure.

Question 3

Are "Azure Search" and "Azure Cognitive Search" and "Azure AI Search" the same product?

Accepted Answer

Azure Search was renamed to Azure Cognitive Search in October 2019 to reflect the expanded (yet optional) use of cognitive skills and AI processing in service operations. Azure Cognitive Search was renamed to Azure AI Search in October 2023 to align with Azure AI services.

Question 4

What languages are supported?

Accepted Answer

The default analyzer used for tokenization is standard Lucene and it is language agnostic. Otherwise, language support is expressed through language analyzers that apply linguistic rules to inbound (indexing) and outbound (queries) content. Some features, such as speller, are limited to a subset of languages.

Question 5

How do I integrate search into my solution?

Accepted Answer

Client code should call the Azure SDK client libraries or REST APIs to connect to a search index, formulate queries, and handle responses. You can also write code that builds and refreshes an index, or runs indexers programmatically or by script.

Question 6

Is there functional parity across the various APIs?

Accepted Answer

Not always. The REST API is always the first to implement new features in preview API versions. The client libraries in Azure SDKs will pick up new features over time, but are released on their own schedule.

Although the REST APIs are first out with newest features, the Azure SDKs provide more coding support, and are recommended over REST unless a required feature is unavailable.

Question 7

Can I pause the service and stop billing?

Accepted Answer

You can't pause a search service. In Azure AI Search, computing resources are allocated when the service is created. It's not possible to release and reclaim those resources on-demand.

Question 8

Can I upgrade, downgrade, rename or move the service?

Accepted Answer

Service tier, name, and region are fixed for the lifetime of the service.

Question 9

If I migrate my search service to another subscription or resource group, should I expect any downtime?

Accepted Answer

As long as you follow the checklist before moving resources and make sure each step is completed, there shouldn't be any downtime.

Question 10

What does "indexing" mean in Azure AI Search?

Accepted Answer

It refers to the ingestion, parsing, and storing of textual content and tokens that populate a search index. Indexing creates inverted indexes and other physical data structures that support information retrieval.

It creates vector indexes if the schema includes vector fields.

Question 11

Can I move, backup, and restore indexes?

Accepted Answer

There's no native support for porting indexes. Search indexes are considered downstream data structures, accepting content from other data sources that collect operational data. As such, there's no built-in support for backing up and restoring indexes because the expectation is that you would rebuild an index from source data if you deleted it, or wanted to move it.

However, if you want to move an index between search services, you can try the index-backup-restore sample code in this Azure AI Search .NET sample repo. There's also a Python version of backup and restore.

Question 12

Can I restore my index or service once it's deleted?

Accepted Answer

No, if you delete an Azure AI Search index or service, it can't be recovered. When you delete a search service, all indexes in the service are deleted permanently.

Question 13

Can I index from SQL Database replicas?

Accepted Answer

If you're using the search indexer for Azure SQL Database, there are no restrictions on the use of primary or secondary replicas as a data source when building an index from scratch. However, refreshing an index with incremental updates (based on changed records) requires the primary replica. This requirement comes from SQL Database, which guarantees change tracking on primary replicas only. If you try using secondary replicas for an index refresh workload, there's no guarantee you get all of the data.

Question 14

What is vector search?

Accepted Answer

Vector search is a technique that finds the most similar documents by comparing their vector representations. Since the goal of a vector representation is to capture the essential characteristics of an item in a numerical format, vector queries can identify similar content even if there are no explicit matches based on keywords or tags. When a user performs a search, the query is summarized into a vector representation and the vector search engine identifies the most similar documents. To improve efficiency on large databases, vector search often provides the approximate nearest neighbors for a query vector. See Vector search overview for the specifics of Azure AI Search's vector offering.

Question 15

Does Azure AI Search support vector search?

Accepted Answer

Azure AI Search supports vector indexing and retrieval. It can vectorize query strings and content if you use the preview and beta libraries.

Question 16

How does vector search work in Azure AI Search?

Accepted Answer

With standalone vector search, you first use an embedding model to transform content into a vector representation within an embedding space. You can then provide these vectors in a document payload to the search index for indexing. To serve search requests, you use the same DNN from indexing to transform the search query into a vector representation, and vector search finds the most similar vectors and return the corresponding documents.

In Azure AI Search, you can index vector data as fields in documents alongside textual and other types of content. The data type for a vector field is Collection(Edm.Single).

Vector queries can be issued standalone or in combination with other query types, including term queries and filters in the same search request.

Question 17

Can Azure AI Search vectorize my content or queries?

Accepted Answer

Built-in integrated vectorization is now in public preview.

Question 18

Does my search service support vector search?

Accepted Answer

Most existing services support vector search. If you're using a package or API that supports vector search and index creation fails, the underlying search service doesn't support vector search, and a new service must be created. This can occur for a small subset of services created prior to January 1, 2019.

Question 19

Can I add vector search to an existing index?

Accepted Answer

If your search service supports vector search, both existing and new indexes can accommodate vector fields.

Question 20

Why do I see different vector index size limits between my new search services and existing search services?

Accepted Answer

We're rolling out improved vector index size limits worldwide for new search services, but we're still building out infrastructure capacity in certain regions. New search services created in supported regions will see increased vector index size limits. Unfortunately, we can't migrate existing services to the new limits.

Question 21

How do I enable vector search on a search index?

Accepted Answer

To enable vector search in an index, you should:

Add one or more vector fields to a field collection.
Add a "vectorSearch" section to the index schema specifying the configuration used by vector search fields, including the parameters of the Approximate Nearest Neighbor algorithm used, like HNSW.
Use 2023-11-01 or an Azure SDK to create or update the index, load documents, and issue queries.

Question 22

Where does query execution occur?

Accepted Answer

Queries execute over a single search index that's hosted on your search service. You can't join multiple indexes to search content in two or more indexes, but you can query same-name indexes in multiple search services.

Question 23

Why are there zero matches on terms I know to be valid?

Accepted Answer

The most common case isn't knowing that each query type supports different search behaviors and levels of linguistic analyses. Full text search, which is the predominant workload, includes a language analysis phase that breaks down terms to root forms. This aspect of query parsing casts a broader net over possible matches, because the tokenized term matches a greater number of variants.

Wildcard, fuzzy and regex queries, however, aren't analyzed like regular term or phrase queries and can lead to poor recall if the query doesn't match the analyzed form of the word in the search index. For more information on query parsing and analysis, see query architecture.

Question 24

Why are my wildcard searches slow?

Accepted Answer

Most wildcard search queries, like prefix, fuzzy and regex, are rewritten internally with matching terms in the search index. This extra processing adds to latency. Further, broad search queries, like a* for example, are likely to be rewritten with many terms, which can be slow. For performant wildcard searches, consider defining a custom analyzer.

Question 25

Can I search across multiple indexes?

Accepted Answer

No, a query is always scoped to a single index.

Question 26

Why is the search score a constant 1.0 for every match?

Accepted Answer

Search scores are generated for full text search queries, based on the statistical properties of matching terms, and ordered high to low in the result set. Query types that aren't full text search (wildcard, prefix, regex) aren't ranked by a relevance score. This behavior is by design. A constant score allow matches found through query expansion to be included in the results, without affecting the ranking.

For example, suppose an input of "tour*" in a wildcard search produces matches on "tours", "tourettes", and "tourmaline". Given the nature of these results, there's no way to reasonably infer which terms are more valuable than others. For this reason, term frequencies are ignored when scoring results in queries of types wildcard, prefix, and regex. Search results based on a partial input are given a constant score to avoid bias towards potentially unexpected matches.

Question 27

Where does Azure AI Search store customer data?

Accepted Answer

It stores your data wherever your service is deployed. Azure AI Search doesn't store customer data outside of the deployment region.

Question 28

Does Azure AI Search send customer data to other services for processing?

Accepted Answer

Yes, if you use the built-in skills based on Azure AI services, the indexer sends requests to Azure AI services over the internal network. If you add a custom skill, the indexer sends content to the URI provided in the custom skill over the public network.

Question 29

Can I control access to search results based on user identity?

Accepted Answer

Not exactly. Typically, users who are authorized to run your application are also authorized to see all search results. Azure AI Search doesn't have built-in support for row-level or document-level permissions, but you can implement security filters as a workaround.

Question 30

Can I control access to operations based on user identity?

Accepted Answer

Yes, you can use role-based authorization for data plane operations over content.

Question 31

Can I use the Azure portal to view and manage search content if the search service is behind an IP firewall or a private endpoint?

Accepted Answer

You can use the Azure portal on a network-protected search service if you create a network exception that allows client and portal access. For more information, see connect through an IP firewall or connect through a private endpoint.

Azure AI Search Frequently Asked Questions

General