Information Analysis: Overview

The analysis of information is an area of Computer Science rapidly growing in importance. Information analysis is an umbrella term that applies to a multitude of techniques for extracting from massive quantities of information various types of important, interesting, or unexpected phenomena. Because the information of interest is of a wide variety of natures--including structured databases, unstructured text, real-valued sensor data, and digitized images--and because the type of phenomena which we seek varies and often is ill defined, many diverse technologies must be developed and applied in novel ways.

It often is convenient to view information analysis as involving three main steps: data acquisition, information extraction and representation, and analysis.

Data Acquisition

Data of a variety of natures is acquired from a possibly large number of diverse sources. Examples of data and their sources include:

Extraction and Representation

A crucial aspect of the analysis system is the representation, storage, and retrieval of the information under study. Rather than develop distinct analysis techniques for each type of data we might encounter, the best approach, we argue, is to represent and exploit the salient features of data within a common data model and to develop a uniform analysis methodology that operates upon this common model.

Once a suitable representation is chosen, extraction tools are defined for each type of data source to map data from the form gathered into the common representation and to store the resulting data in the underlying database.

The data model devised must be sufficiently rich and flexible to support the variety of data we expect, but it also must be capable of supporting efficiently sophisticated analyses against massively large data sets, including retrieval operations required for data mining. To address these issues, we are interested in customizing and adapting one or more data models well studied in computer science so as to be suitable for our problem. Adaptations may include support for statistical analyses, expert rule bases and axiom systems, complex hierarchical relationships such as and/or relationships, and the identification of data equivalence classes.

Information Analysis

Information analysis requires a suite of sophisticated support tools, including:
small IAL logo Return to
IAL Home

Last Modified: October 28, 1998 by veroff@cs.unm.edu