[Silva-general] indexing of .doc/.pdf files
kitblake at infrae.com
Wed May 20 08:51:27 CEST 2009
On 19 May 2009, at 18:04, Marc Petitmermet wrote:
> how can i tell silva which silva file (.doc/.pdf) to index and which
> not? is it possible to de-index a file afterwards? i know exactly
> which files should be indexed and which not (the final accepted
> version and not any other revisions). or is there some kind of filter
> which excludes all data from unwanted silva files from the search
> results (some metadata would be nice: index/don't index like the "hide
> from tables of content")? if there is no easy way i probably just copy
> the final version to a different directory and have silva reindex the
> catalogue and then restrict silva find to this folder.
All content gets indexed. If there are multiple versions of the same
asset (.doc/.pdf) you should move the older versions somewhere else.
There are various ways you can control the search. One is by setting a
path and restricting the search to a folder or branch of the site.
You could also put the old versions in a (sub)folder and give it an
access restriction. Public visitors who are not logged in won't see
the results, but if you're logged in you will. Then set the folder to
not show up in navigation.
Interesting idea about the metadata, but it's better to control that
with queries. Silva Find is not just for public use, it can also be
really useful for authoring tasks.
Kit BLAKE · Infrae · http://infrae.com/ + 31 10 243 7051
More information about the silva-general