[Silva-general] indexing of .doc/.pdf files

Kit BLAKE kitblake at infrae.com
Wed May 20 08:51:27 CEST 2009


On 19 May 2009, at 18:04, Marc Petitmermet wrote:
> how can i tell silva which silva file (.doc/.pdf) to index and which
> not? is it possible to de-index a file afterwards? i know exactly
> which files should be indexed and which not (the final accepted
> version and not any other revisions). or is there some kind of filter
> which excludes all data from unwanted silva files from the search
> results (some metadata would be nice: index/don't index like the "hide
> from tables of content")? if there is no easy way i probably just copy
> the final version to a different directory and have silva reindex the
> catalogue and then restrict silva find to this folder.

All content gets indexed. If there are multiple versions of the same  
asset (.doc/.pdf) you should move the older versions somewhere else.  
There are various ways you can control the search. One is by setting a  
path and restricting the search to a folder or branch of the site.

You could also put the old versions in a (sub)folder and give it an  
access restriction. Public visitors who are not logged in won't see  
the results, but if you're logged in you will. Then set the folder to  
not show up in navigation.

Interesting idea about the metadata, but it's better to control that  
with queries. Silva Find is not just for public use, it can also be  
really useful for authoring tasks.

Kit


-- 
Kit BLAKE · Infrae · http://infrae.com/ + 31 10 243 7051




More information about the silva-general mailing list