Indexing Virtual Hosts


You want to index both www.mysite.com and search.mysite.com, and they are both DNS aliases of webhost.mysite.com. Webhost is the canonical name of the physical machine on which the web server hosting the aliases is running. The web server is set up to serve http://www.mysite.com and http://search.mysite.com with different document roots.

vspider -cmdfile /verity/spider/vhosts.cmd

where vhosts.cmd consists of:

-collection icd.coll
-start http://www.mysite.com http://search.mysite.com
-virtualhost www.mysite.com search.mysite.com

Case-specific Options

Option
Reason
-virtualhost
You want to index multiple sites running on the same server. Without -virtualhost, only the documents from www.mysite.com would be indexed whenever a duplicate file name also existed for search.mysite.com. This is because a DNS lookup would resolve both sites to Webhost and the documents would be considered duplicate based on name.





Copyright © 1998, Verity, Inc. All rights reserved.