File Systems
You want to spider a network drive to index all
Microsoft Word and ASCII text documents, while skipping all directories
named TEMP, CONFIDENTIAL, and ACCOUNTING and all others more than 4
subdirectories deep.
- vspider -cmdfile c:\verity\vspider\files.cmd
where files.cmd consists of:
- -collection icd.coll
- -start f:\documents
- -indmimeinclude application/msword -indmimeinclude 'text/plain'
- -prunedir 'TEM*' 'CONFIDENTI*' 'ACCOUNT*' -pathlen 6
If you find MIME types are being dropped, or you know
you will be indexing files whose extensions are not known to the Verity
Spider by default, use the regular expression '*/*' for your MIME
criteria.
For example:
- -mimeinclude '*/*'
Remember, on UNIX you need to include single quotes
for values which include wildcard characters.
Furthermore, you should also use inclusion and
exclusion criteria to fine tune what is indexed.
- If your list of file types to index is rather long, use one of the
exclusion criteria (
-exclude, -indexclude,
-mimeexclude, or -indmimeexclude) to exclude
extensions you know you do not want to index. For example:
-exclude '*.exe' '*.com'
- If the list of file types you want to index is relatively small, use
one of the inclusion criteria (
-include, -indinclude,
-mimeinclude, or -indmimeinclude) to specify
them. For example:
-indinclude '*.txt' '*.1st' '*.log'
Case-specific Options
Unnecessary Options for this Case
Copyright © 1998, Verity, Inc. All rights reserved.