Problem: There are many Design documents uploaded to a SharePoint 2010 document library in Microsoft Word format in a Team site. Users reported these documents aren’t appearing in search result.
Same time found in crawl logs: “This item was partially parsed. The item has been truncated in the index because it exceeds the maximum size.” error.
Root cause: This is because SharePoint search crawler doesn’t index large files > 16 MB by default! (in SharePoint 2010. For SharePoint 2013 & 2016, its 64 MB) Confirmed that the documents uploaded are about 20MB each! So SharePoint crawls only the meta data associated with the document and skips the contents inside the file!
Solution: Increase SharePoint search index file size limit
Add-PSSnapin Microsoft.SharePoint.Powershell -ErrorAction SilentlyContinue $SSA = Get-SPEnterpriseSearchServiceApplication #Get the current size $SSA.GetProperty("MaxDownloadSize") #Increase maximum file size in Crawl - indexer $SSA.SetProperty("MaxDownloadSize", 100) $SSA.Update()
This will increase the maximum size of files to index to 100 MB. Restart Search Service for the changes to take effect and Run a Full Crawl! You can also set maximum size for specific file type. E.g. MaxDownloadSizeExcel.