Limits for eDiscovery search
Microsoft 365 compliance is now called Microsoft Purview and the solutions within the compliance area have been rebranded. For more information about Microsoft Purview, see the blog announcement.
Various limits are applied to eDiscovery search tools in the Microsoft Purview compliance portal. This includes searches run on the Content search page and searches that are associated with an eDiscovery case on the eDiscovery (Standard) page. These limits help to maintain the health and quality of services provided to organizations. There are also limits related to the indexing of email messages in Exchange Online for search. You can't modify the limits for eDiscovery searches or email indexing, but you should be aware of them so that you can take these limits into consideration when planning, running, and troubleshooting eDiscovery searches.
For limits related to the Microsoft Purview eDiscovery (Premium) tool, see Limits in eDiscovery (Premium)
The following table lists the search limits when using the content search tool in the compliance portal and for searches that are associated with a Microsoft Purview eDiscovery (Standard) case.
|Description of limit||Limit|
|The maximum number of mailboxes or sites that can be searched in a single search||No limit 1|
|The maximum number of searches that can run at the same time in your organization.||30|
|The maximum number of organization-wide searches that can be run at the same time.||3|
|The maximum number of searches that a single user can start at the same time. This limit is most likely hit when the user tries to start multiple searches by using the Get-ComplianceSearch |Start-ComplianceSearch command in Security & Compliance Center PowerShell.||10|
|The maximum number of items per user mailbox that are displayed on the preview page when previewing Content Search results.||100|
|The maximum number of items found in all user mailboxes that can possibly be displayed on the preview page when previewing search results. The newest items are displayed.||1,000 2|
|The maximum number of user mailboxes that can be previewed for search results. If there are more than 1000 mailboxes that contain content that matches the search query, at most, only the top 1000 mailboxes with the most search results will be available for preview.||1,000|
|The maximum number of items found in SharePoint and OneDrive for Business sites that are displayed on the preview page when previewing search results. The newest items are displayed.||200|
|The maximum number of sites (in SharePoint and OneDrive for Business) that can be previewed for search results. If there are more than 200 total sites that contain content that matches the search query, only the top 200 sites with the most search results will be available for preview.||200|
|The maximum number of items per public folder mailbox that are displayed on the preview page when previewing content search results.||100|
|The maximum number of items found in all public folder mailboxes that are displayed on the preview page when previewing content search results.||200|
|The maximum number of public folder mailboxes that can be previewed for search results. If there are more than 500 public folder mailboxes that contain content that matches the search query, only the top 500 public folder mailboxes with the most search results will be available for preview.||500|
|The maximum size of an item that can be viewed on the preview page.||10,000,000 bytes (approximately 9.5 MB)|
|The maximum number of characters for the search query (including operators and conditions) for a search.
Note: This limit takes effect after the query is expanded and includes characters from the keyword query, any search permissions filters applied to the user, and the URLs of all site locations. This means the query will get expanded against each of the keywords. For example, if a search query has 15 keywords and additional parameters and conditions, the query gets expanded 15 times, each with the other parameters and conditions in the query. So even though the number of characters in the search query may be below the limit, it's the expanded query that may contribute to exceeding this limit.
Sites: 4,000 when searching all sites or 2,000 when searching up to 20 sites. 3
|The maximum number of variants returned when using a prefix wildcard to search for an exact phrase in a search query or when using a prefix wildcard and the NEAR Boolean operator.||10,000 4|
|The minimum number of alpha characters for prefix wildcards; for example,
|The maximum number of mailboxes in a search that you can delete items in by doing a "search and purge" action (by using the New-ComplianceSearchAction -Purge command). If the search that you're doing a purge action for has more source mailboxes than this limit, the purge action will fail. For more information about search and purge, see Search for and delete email messages in your organization.||50,000|
|The maximum number of locations in a search that you can export items from. If the search that you're exporting has more locations than this limit, the export will fail. For more information, see Export content search results.||100,000|
1 Although you can search an unlimited number of mailboxes in a single search, you can only download the exported search results from a maximum of 100,000 mailboxes using the eDiscovery Export Tool in the compliance portal.
2 The intent of the preview page is to show a limited sample of the results. Even for massive searches with thousands of results, the number of items that are shown on the preview page can, and often will, be much less than maximum possible value of 1000. To see the complete search results, you need to export the results.
3 When searching SharePoint and OneDrive for Business locations, the characters in the URLs of the sites being searched are counted against this limit.
4 For non-phrase queries (a keyword value that doesn't use double quotation marks) we use a special prefix index. This tells us that a word occurs in a document, but not where it occurs in the document. To do a phrase query (a keyword value with double quotation marks), we need to compare the position within the document for the words in the phrase. This means that we can't use the prefix index for phrase queries. In this case, we internally expand the query with all possible words that the prefix expands to; for example,
"time*" can expand to
"time OR timer OR times OR timex OR timeboxed OR ...". 10,000 is the maximum number of variants the word can expand to, not the number of documents matching the query. There is no upper limit for non-phrase terms.
Microsoft collects performance information for searches run by all organizations. While the complexity of the search query can impact search times, the biggest factor that affects how long searches take is the number of mailboxes searched. Although Microsoft doesn't provide a Service Level Agreement for search times, the following table lists average search times for collection searches based on the number of mailboxes included in the search.
|Number of mailboxes||Average search time|
The following table lists the limits when exporting the results of a content search. These limits also apply when you export content from a eDiscovery (Standard) case.
|Description of limit||Limit|
|Maximum amount of exportable data from a single search
Note: If the search results are larger than 2 TB, consider using date ranges or other types of filters to decrease the total size of the search results.
|Maximum an organization can export in a single day
Note: This limit is reset daily at 12:00AM UTC
|Maximum concurrent exports that can be ran at same time within your organization
Note: Running a Report Only export counts against total concurrent exports for your organization. If three users are performing 3 exports each, then only one other export can be performed. Whether it is exporting a report or search results, no other exports can be performed until one has completed.
|Maximum exports a single user can run at any one time||3|
|Maximum number of mailboxes for search results that can be downloaded using the eDiscovery Export Tool||100,000|
|Maximum size of PST file that can be exported
Note: If the search results from a user's mailbox are larger than 10 GB, the search results for the mailbox will be exported in two (or more) separate PST files. If you choose to export all search results in a single PST file, the PST file will be spilt into additional PST files if the total size of the search results is larger than 10 GB. If you want to change this default size, you can edit the Windows Registry on the computer that you use to export the search results. See Change the size of PST files when exporting eDiscovery search results. The search results from a specific mailbox won't be divided among multiple PST files unless the content from a single mailbox is more than 10 GB. If you chose to export the search results in one PST file for that contains all messages in a single folder and the search results are larger than 10 GB, the items are still organized in chronological order, so they will be spilt into additional PST files based on the sent date.
|Rate at which search results from mailboxes and sites are uploaded to a Microsoft-provided Azure Storage location.||Maximum of 2 GB per hour|
Indexing limits for email messages
The following table describes the indexing limits that might result in an email message being returned as an unindexed item or a partially indexed item in the results of a content search.
|Indexing limit||Maximum value||Description|
|Maximum attachment size||150 MB||The maximum size of an email attachment that will parse for indexing. Any attachment that's larger than this limit won't be parsed for indexing, and the message with the attachment will be marked as partially indexed.
Note: Parsing is the process where the indexing service extracts text from the attachment, removes unnecessary characters like punctuation and spaces, and then divides the text into words (in a process called tokenization), that are then stored in the index.
|Maximum number of attachments||250||The maximum number of files attached to an email message that will be parsed for indexing. If a message has more than 250 attachments, the first 250 attachments are parsed and indexed, and the message is marked as partially indexed because it had additional attachments that weren't parsed.|
|Maximum attachment depth||30||The maximum number of nested attachments that are parsed. For example, if an email message has another message attached to it and the attached message has an attached Word document, the Word document and the attached message will be indexed. This behavior will continue for up to 30 nested attachments.|
|Maximum number of attached images||0||An image that's attached to an email message is skipped by the parser and isn't indexed.|
|Maximum time spent parsing an item||30 seconds||A maximum of 30 seconds is spent parsing an item for indexing. If the parsing time exceeds 30 seconds, the item is marked as partially indexed.|
|Maximum parser output||2 million characters||The maximum amount of text output from the parser that's indexed. For example, if the parser extracted 8 million characters from a document, only the first 2 million characters are indexed.|
|Maximum annotation tokens||2 million||When an email message is indexed, each word is annotated with different processing instructions that specify how that word should be indexed. Each set of processing instructions is called an annotation token. To maintain the quality of service in Office 365, there is a limit of 2 million annotation tokens for an email message.|
|Maximum body size in index||67 million characters||The total number of characters in the body of an email message and all its attachments. When an email message is indexed, all text in the body of the message and in all attachments is concatenated into a single string. The maximum size of this string that is indexed is 67 million characters.|
|Maximum unique tokens in body||1 million||As previously explained, tokens are the result of extracting text from content, removing punctuation and spaces, and then dividing it into words (called tokens) that are stored in the index. For example, the phrase
There are additional limits related to different aspects of searching for content, such as content indexing. For more information about these limits, see the following topics:
For information about content searches, see:
For case limits related to eDiscovery (Standard) and eDiscovery (Premium), see:
Submit and view feedback for