Attention

This document was written for an old version of Plone, Plone 3, and was last updated 1203 days ago.

To learn how to upgrade to the current version of Plone, read the upgrade manual.

Enable Indexing of pdf and word docs with Windows in Five steps:three minutes of your times without problems!

by Stefano Saltannecchi last modified Feb 04, 2009 03:04 AM
Very Simple Five istructions to index pdf and word documents in Plone with Windows

Purpose

Clear written and useful istructions for indexing pdf and word docs on windows.

 

Step by step:  only Five!

 

First: install OpenOffice.org on your system. It's very simple to use and replace very good Microsoft Office (c) at least for most users.

Secondly, take the Windows xpdf  package (http://www.foolabs.com/xpdf/download.html). You can download the Windows version, following this link: ftp://ftp.foolabs.com/pub/xpdf/xpdf-3.02pl1-win32.zip

Third: unpacking the files. Zip xpdf inside C: \ WINDOWS \ system32

Fourth launch Plone, check inside Plone/portal_transform if there is the transform    word_to_html

Fifth: click on Add Transform; Enter in ID:  pdf_to_text
Enter in Module: Products.PortalTransforms.transforms.pdf_to_text

Now you can post your word and pdf documents and will be automatically indexed.

To find out what has been indexed of incorporated  documents you can look at SearchableText inside Plone/portal_catalog/Catalog/   for documents tracked in the index.

 

Further information

For a POSIX guide, see http://plone.org/documentation/how-to/enable-full-text-indexing-of-word-documents-and-pdfs-in-plone-3-0-gnu-linux/?searchterm=index%20pdf

 

For an alternative "hard" Windows guide, see: http://plone.org/documentation/how-to/enable-full-text-indexing-of-word-documents-and-pdfs-in-plone-3-0-windows/?searchterm=indexing%20windows

 


Contribute

Something wrong or out of date? Anybody can edit or create a new article in the knowledge base. Simply create an account on this site, log in, and click the Edit button to contribute.