MSpiderCommand-Line Options

MSpider MUST be capable of being run from the command line in batch-mode and MUST NOT require a graphical user interface. Essentially, the user MUST be able to run the spider client with a command line similar to the following:

        java MSpider [options] >&! logfile &
(assuming a csh-like command shell and a UNIX-like environment) and then coming back some later time to view the results.

The spider client MUST support at least the following command-line options. The designer MAY provide additional options, but all such options MUST be fully described in the user documentation.

-c #
Set the CRAWL-MAX parameter to # for this run. Note that different CRAWL-MAXs may be used on different runs.
-i iPage
Set the initial page from which to begin the crawl.
-w moogFileName
Load/save WEB DATABASE to/from moogFileName. If no filename extension is provided, MSpider SHOULD assume an extension of .moog.

In addition, the system SHOULD provide the following option:

-h
Print a help message and then exit.

Finally, MSpiderMAY provide the following options, as well as other options at the designer's discretion:

-d
Enable debugging output.
-p #
Set delay between loading successive PAGEs. If this option is allowed, the user MUST NOT be able to set the delay to a value less than one second (Section 5.2).

Terran Lane 2005-08-23