next up previous
Next: Data Collection Up: A Research Agent Architecture Previous: Introduction

Overview


  
Figure 1: Main User Interface
\includegraphics[width=3.3in]{graphics/calvin2.eps}

Calvin has been designed and implemented in the context of supporting users as they access documents. As the user browses, Calvin records the documents which the user is accessing. From these documents, Calvin uses information retrieval techniques to create an index for each document reflecting its subject matter. These documents are retrieved and suggested to the user when similar documents are accessed in the future. A screen shot of Calvin's resource suggester is shown in figure 1. Calvin's task and user interactions are described in detail in [8].

Although Calvin was designed in the context of information retrieval, its architecture was designed to be general enough to enable using its components in multiple task domains, and for its data storage techniques to be applied in other research contexts where processing data needs to be collected and analyzed for system evaluation.

Calvin's architecture is shown in figure 2. Calvin includes sub-agents as part of an extensible system to collect data from various sources and send the information for analysis via an XML message. This information is analyzed by a researcher-defined ``Data Analysis'' component. This component conforms to a specific interface specification so that data analysis components can be changed out without changing the rest of the system. The kinds of data which can be passed among system components are specified in a registry and are customizable. This configuration allows researchers to use Calvin to perform the data collection/storage and focus on issues of analysis.


  
Figure 2: Calvin's Architecture
\includegraphics[width=3.3in]{graphics/calvinarchitecture.eps}

Calvin's architecture has the following features:

1.
Data Collection
(a)
Flexible data collection techniques for knowledge engineering or machine learning focused approaches to information analysis.
(b)
Information collection from a diverse extensible set of resource types.
(c)
The ability to inter-operate in a multi-agent/multi-platform data collection environment using XML.
2.
User Interaction
(a)
An agent user interface for making suggestions to users and allowing the user interaction.
(b)
A user interface for gathering data for controlled experiments.
3.
Data Analysis
(a)
Creation/use of a test bed of document access behavior for standards creation.
(b)
Facilities for simulating/replaying user browsing.
(c)
Plug-and-play data analysis component interfaces for experimenting with multiple types of analysis without altering the infrastructure.
4.
Implementation
(a)
Written in Java for cross-platform development/experimentation and uses a number of freely available packages.
(b)
Uses standard protocols for exchanging information for easy data analysis in third party packages.
The following sections discuss each of these in turn.


next up previous
Next: Data Collection Up: A Research Agent Architecture Previous: Introduction
Travis Bauer
2002-01-25