Refreshing a Collection


You last indexed a reference site several weeks ago, and now you want to refresh the collection to reindex any documents which have been added or changed.

Warning! If you are refreshing a collection that was built by the Verity Spider using the -noindex option, and by another indexing tool such as mkvdk, you must perform a resync before a refresh. This is to ensure that the persistent store has the latest document information for the collection.

NOTE: The following command must be issued as a single line from the command-line. It is broken up here for readability.

vspider -collection icd.coll
-refresh -host www.company.com
-include http://www.company.com/reference/*

Case-specific Options

Option
Reason
-refresh
You just want to make sure you get the latest versions of any of the documents in the collection.
-host, -include
These options allow you to control what is indexed. When you use -refresh, you must use at least one of -host, -domain, -nofollow, or -unlimited.

Unnecessary Options for this Case

Option
Reason
-start, -restart
When you use the -refresh option, you cannot use the -start or -restart options.

Now say you want to refresh the collection again, only this time you want to exclude a directory because you know the documents have not changed.

NOTE: The following command must be issued as a single line from the command-line. It is broken up here for readability.

vspider -collection icd.coll -refresh -host www.company.com
-exclude "http://www.company.com/financial/*"




Copyright © 1998, Verity, Inc. All rights reserved.