Search PDF title and content using MOSS 2007

Introduction

By default the SharePoint 2007 Search indexed only the meta data of a PDF document. By installing and configuring a PDF IFilter the Search will also index the contents of the PDF document. This allows users to find documents based on text inside the document. This process is called full text indexing.

[Indexing Server]: the server(s) in the SharePoint Farm that has/have the “Indexing” Role assigned. In a small farm this can be a single server for all roles.

[Web Front End Server]: the server(s) in the SharePoint Farm that has/have the “Web Front End” Role assigned. In a small farm this can be a single server for all roles.

[Indexing Server]

1.Install the PDF IFilter (see below for a list of available IFilters)
2.Add the .pdf file type to the index list:
1.Go to Central Administration, then to the Shared Services Administration Web of the current SSP, go to Search Settings and next to File Type
2.Add a new file type pdf
3.[This step only applies to 64 bit servers]
3.1.Go to
HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Shared Tools\Web Server Extensions\12.0\Search\Setup\ContentIndexCommon\Filters\Extension\.pdf
HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Office Server\12.0\Search\Setup\ContentIndexCommon\Filters\Extension\.pdf
NOTE: The first registry key entry will enable the title search and the second one will enable the content of the PDF.
3.2.Change the (Default) key value
3.2.1.Old value: {4C904448-74A9-11D0-AF6E-00C04FD8DC02}
3.2.2.(Foxit x64 PDF IFilter) New value: {987F8D1A-26E6-4554-B007-6B20E2680632}
3.2.3.(Adobe x64 PDF IFilter) New value: {E8978DA6-047F-4E3D-9C78-CDBE46041603}
NOTE: Add the curly bracket with the value.
4.Perform an iisreset
5.Perform a Full Update on the Search content indexes
5.1.Open a Command Prompt on the Indexing Server
5.2.net stop osearch
5.3.net start osearch
5.4.Go to Central Administration, then to the Shared Services Administration Web of the current SSP, go to Search Settings and start a full crawl of all locations containing PDF files

[Web Front End Server]

1.Copy the ICPDF.GIF file to “C:\Program Files\Common Files\Microsoft Shared\Web Server Extensions\12\Template\Images”
2.Edit the file C:\Program Files\Common Files\Microsoft Shared\Web server extensions\12\Template\Xml\DOCICON.XML
1.Add an entry for the .pdf extension entry just before </ByExtension>,
<Mapping Key="pdf" Value="icpdf.gif"/>

Reference:
http://jasoncoltrin.com
www.moss2007.be
www.alpesh.nakars.com
http://www.adobe.com/support/downloads/detail.jsp?ftpID=2611

Advertisements

Diganta Kumar has architected and developed software for more than a decade for a wide range of industries and development platforms and over the years has filled many roles including program manager, founder, developer, architect, team lead, mentor and project manager. Diganta is founder of two online IT businesses. He is a certified AWS Solutions Architect, certified Professional Scrum Master (PSM I), certified Professional Scrum Developer (PSD I) and ITIL Certified. He has presented at Microsoft Tech.Ed, Microsoft AppFest and Ark Group Intranet conference. He attends AWS Seattle Official Events, Seattle AWS Architects-Engineers, and AWS Cloud Commerce user groups. He likes to help, mentor and manage software development teams to improve and produce great software. He currently work as a Senior Technical Program Manager for Amazon Web Services.

Posted in SharePoint

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s