Including Previously Dropped Documents


You indexed a site during which several hosts timed out and could not provide necessary documents. You have since learned that the hosts in question were undergoing maintenance and are now available.

NOTE: The following command must be issued as a single line from the command-line. It is broken up here for readability.

vspider -collection icd.coll
-restart
-host internal.verity.com -host marketing.verity.com
-timeout 90
-delay 30000 -retry

Case-specific Options

Option
Reason
-restart
This option specifies an incremental indexing job.
-host
This option restricts indexing to only those hosts which are specified. In this case , they are the hosts you know were not available when the original job ran. When you use -restart, you must use at least one of -host, -domain, -nofollow, or -unlimited.
-timeout
By increasing the amount of time before a request times out, you increase the chances that will be able to maintain the connection with the host and retrieve documents.
-delay
By specifying a delay in the http requests, you decrease the chances that the hosts will be overwhelmed.
-retry
By specifying -retry, you increase the chances that a document will be retrieved. The default value is 4.

Unnecessary Options for this Case

Option
Reason
-start, -resync, -refresh
When you use the -restart option, you cannot use the -start, -resync or -refresh options.





Copyright © 1998, Verity, Inc. All rights reserved.