Next: Introduction
Bauer and Leake
WordSieve:
A Method for Real-Time Context
Extraction
Travis Bauer - David B. Leake
Computer Science Department
Lindley Hall, Indiana University
150 S. Woodlawn Avenue
Bloomington, IN 47405, U.S.A.
http://www.cs.indiana.edu/~{trbauer,leake}
{trbauer,leake}@cs.indiana.edu
Abstract:
In order to be useful, intelligent information retrieval agents must
provide their users with context-relevant information. This paper
presents WordSieve, an algorithm for automatically extracting
information about the context in which documents are consulted during
web browsing. Using information extracted from the stream of
documents consulted by the user, WordSieve automatically builds
context profiles which differentiate sets of documents that
users tend to access in groups. These profiles are used in a research-aiding
system to index documents consulted in the current context and
pro-actively suggest them to users in similar future
contexts. In initial experiments on the capability to match documents
to the task contexts in which they were consulted, WordSieve indexing
outperformed indexing based on Term Frequency/Inverse Document
Frequency, a common document indexing approach for intelligent agents in
information retrieval.
Travis Bauer
2002-01-25