Friday, March 25, 2011

Configuring iFilter for PDF Search in SharePoint 2010 - Step by Step

What is iFilter by the way?

IFilters are components that allow SharePoint to index content of specific file types, letting users to search for content in those files.
Using the SharePoint Search we can find documents based on their filename, metadata or content within a document. By default the content of Office documents is indexed by the SharePoint crawler, but PDF files are not crawled.
To add support for PDF files you have to add an I-Filter which the SharePoint crawler uses to read through PDF files and add the information to the search index.
There are two steps in configuring PDF Search IFilters for SharePoint
1.       Install the PDF iFilter
2.       Configure the iFilter

Installing IFilter

The Installation of the iFilter 9.0 has to be done on the Index Server(s)
How to install ifilter in SharePoint 2010? Download  and install the iFilter installation file for 64-bit machine from http://www.adobe.com/support/downloads/detail.jsp?ftpID=4025
Configuring iFilter for PDF Search in SharePoint 2010
Choose the location for the installation files to reside and complete the installation by clicking Install and finish the installation wizard.
install ifilter on sharepoint 2010

 Configuring iFilter

I. On WFE Server(s):

1.       Download PDF icon picture (17x17) from Adobe web site http://www.adobe.com/misc/linking.html and Copy to C:\Program Files\Common Files\Microsoft Shared\Web Server Extensions\14\TEMPLATE\IMAGES\

pdf ifilter on sharepoint 2010

2. Add an entry in docicon.xml

Backup the docicon.xml file, and Add an entry in docicon.xml for the PDF icon:

C:\Program Files\Common Files\Microsoft Shared\Web Server Extensions\14\TEMPLATE\XML


<Mapping Key="pdf" Value="pdficon17.gif">

Above Steps are done in order to have the PDF icon in SharePoint search results.

Add PDF file type on the File Type page under Search Service

· Open SharePoint Central Administration >> General Application Settings >> Farm Search Administration

add pdf search to sharepoint 2010·         Click on Search Service Application

sharepoint 2010 search service pdf
·         Click on File Types to add PDF extension for SharePoint 2010
sharepoint 2010 search file types pdf

 ·         Add new File Type extension “PDF” to setup IFilter for SharePoint 2010
sharepoint search server 2010 pdf

·        Enter file extension as pdf and click OK and get out from Central Administration
pdf search sharepoint 2010
II.    On Index Server(s)

·         Open Regedit and go to : HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Office Server\14.0\Search\Setup\Filters
·         Right-click the Filters folder and select New Key. Enter “.pdf” for the key value.

sharepoint 2010 pdf ifilter registry change
·         Add the following values
<REG_SZ> Default = <value not set>
<REG_SZ> Extension = pdf
<REG_DWORD> FileTypeBucket = 1
<REG_SZ> MimeTypes = application/pdf
sharepoint 2010 pdf ifilter registry

Associate the iFilter by following below steps:

·         Open Regedit
·         Navigate to the following location: HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Office Server\14.0\Search\Setup\ContentIndexCommon\Filters\Extension
·         Right-click > Click New > Key to create a new key for .pdf
·         Add the following GUID in the default value {E8978DA6-047F-4E3D-9C78-CDBE46041603}
pdf search in sharepoint 2010
Restart all SharePoint services as well as IIS.
·          Restart the SharePoint Server Search 14
how to install ifilter sharepoint 2010
Or Execute the below commands:
Net Stop OSearch14
Net Start OSearch14

·         Reboot the SharePoint servers in Farm
·         Perform FULL Crawl to get search result.

To allow PDF’s to open in the browser

·         Navigate  to Central Administration >> Application Management
·          Select the appropriate web application (you may have to repeat these steps if you have multiple web apps) then click on General Settings in the Ribbon
·          Scroll down to “Browser File Handling” and set this to Permissive
·         Click the OK button.  (Repeat the last two steps for additional web apps)

Unit Testing:

Create a document library upload any sample PDF document(s). Once the full crawl is completed we will get search results.
pdf search on sharepoint 2010




You might also like:
SharePoint Usage Reports
Usage reports, collaboration and audit for SharePoint.
Document SharePoint Farm
Automatically generate SharePoint documentation.
*Sponsored


Check out these SharePoint products:

8 comments :

  1. Hi, i am unable to open a pdf file from document library in SP2010 the dialog box shows Save option but not the open option. I tried changing the Web Application settings from "Strict" to "Permissive".

    ReplyDelete
    Replies
    1. Try adding the PDF file type to Web Application's allowed MIME Type.
      Refer my post: Open PDF files in browser from SharePoint 2010

      Regards,
      Sal

      Delete
  2. Hello,

    I try this tutorial and the result I lost all my docx, xlsx icon.
    have you any idea ?

    ReplyDelete
    Replies
    1. Which means, Your DocIcon.xml file is not properly updated! Check the XML.

      Delete
  3. I tried this solution Salaudeen and I get object not found for PDF file types as the error in my Crawl log...any ideas?

    ReplyDelete
    Replies
    1. Try to trigger Index Reset and Do a Full crawl again.

      Delete
  4. Is this GUID E8978DA6-047F-4E3D-9C78-CDBE46041603} same form all.
    If different, how to get this?

    ReplyDelete
    Replies
    1. Well, that's the GUID for Adobe IFilter as per http://support.microsoft.com/kb/2518465. If you use some other IFilters, like Foxit - This will vary!

      Delete

Please Login and comment to get your questions answered!

You might also like:

Related Posts Plugin for WordPress, Blogger...