Skip to content

Commit

Permalink
Update search-limits.md
Browse files Browse the repository at this point in the history
We recently did work and increased the crawling support from 150MB to 512 MB for PDF, PPTX, PPT, DOC, and DOCX file types.
  • Loading branch information
NeethaTuluri authored Nov 13, 2024
1 parent b22af13 commit 370d61d
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion SharePoint/SharePointOnline/search-limits.md
Original file line number Diff line number Diff line change
Expand Up @@ -35,7 +35,7 @@ The following table lists the limits for SharePoint search.

|Limit|Maximum value|Limit type|Notes|
|:-----|:-----|:-----|:-----|
|Size of document that can be downloaded by the crawl components |150 MB|Boundary |Search downloads metadata of the document only for files >150 MB. The content of the document isn't downloaded. |
|Size of document that can be downloaded by the crawl components |All File Types: 150 MB <br/> PDF, PPTX, PPT, DOC, DOCX: 512 MB |Boundary | Crawl processing supports files up to 150 MB for all types, with an extended limit of 512 MB for PDF, PPTX, PPT, DOC, and DOCX formats. For files exceeding these limits, only document metadata is downloaded, while the full content is NOT downloaded and remains unavailable for search. |
|Parsed content size |2 million characters |Boundary |Search stops parsing an item after it has parsed up to 2 million characters of content from it, including the item's attachments. The actual number of parsed characters can be lower than this limit because search uses a maximum of 30 seconds on parsing a single item and its attachments. When search stops parsing an item, the item is marked as partially processed. Any unparsed content isn't processed and therefore isn't indexed. |
|Characters processed by the word breaker |1,000,000 |Boundary |Search breaks content into individual words (tokens). The word breaker produces tokens from the first 1,000,000 characters of a single item, including the item's attachments. <br/> The actual amount of tokens can be lower than this limit because search uses a maximum of 30 seconds on word breaking. Any remaining content isn't processed. |
|Indexed managed property size |512 KB per managed property that is set to either "searchable" or "queryable" |Boundary ||
Expand Down

0 comments on commit 370d61d

Please sign in to comment.