Support Libraries

A number of library tools are available to help construct the MSpider program. Network access and document retrieval is provided by the java.net package via the java.net.URL and java.net.HttpURLConnection classes. Elements of the java.io package will also be necessary to support network activities.

Parsing and analysis of HTML documents can be accomplished with the javax.swing.text.html tools. Specifically, the javax.swing.text.html.parser.ParserDelegator class provides parsing infrastructure, though it will be necessary for the designer to provide a specialization of the Javax.swing.text.html.HTMLEditorKit.ParserCallback class to implement the necessary functionality.

Other elements of the Java JDK library may also be useful (e.g., java.util). Most elements of the JDK library are available for use in the Moogle suite, but check Section 5 for restrictions.

The gnu.getopt.Getopt is a useful tool for handling command-line options, though the designer MAY choose to handle options in a different way. Note, however, that in the past students who have ``rolled their own'' command-line handling have reported that using Getopt would ultimately have saved time, in spite of the learning curve.

While class lecture, exercises, and labs will describe some of these libraries, there is not enough time to convey all important information in the necessary detail. Therefore it is the designer's responsibility to read and understand the necessary documentation. In addition to the class text, the online Java API specifications and a number of online tutorial documents will be useful in completing this project.

Terran Lane 2005-02-14