GGrantIndex
← Search

CAREER: Updating Problems in Information Retrieval and a Mathematical Dissection Lab

$419,881FY2006CSENSF

College Of Charleston, Charleston SC

Investigators

Abstract

ABSTRACT 0546622 Amy Langville College of Charleston Updating Problems in Information Retrieval and a Mathematical Dissection Lab This proposal is about information retrieval, the field of organizing information in order to retrieve documents relating to a specific information need. Information retrieval is divided into two categories: re- trieval from web collections and retrieval from smaller, nonlinked document collections, called traditional collections. Both types of retrieval, web and traditional, share at least one common problem: the updating problem. Any time a document collection changes, which inevitably happens often as documents are mod- ified, added, or deleted, several items must be updated, including performance measures, storage files, and memory pointers. Of the many types of updating problems, this proposal deals with mathematical updating problems. A mathematical updating problem for web retrieval that has impact for any application, such as population modeling, webpage ranking, graph theoretic measures, and email analysis, that uses the domi- nant eigenvector of a large, sparse, nonnegative matrix will be studied. For traditional information retrieval, the project will study an updating problem regarding the so-called nonnegative matrix factorization (NMF), which has impact on the many other fields to which the NMF has been applied including image processing, fingerprint query processing, music processing, signal processing, movie animations, machine learning, data mining, and summary extraction for audio and video data. This research will be enhanced by a partnership with SAS Institute, Inc. The Education Phase of this project proposes the design and implementation of a Mathematical Device Dissection Lab. The proposed lab, which is similar in spirit to the mechanical dissection labs found in many engineering schools, aims to solve a common problem with students perception of mathematics: students think mathematics is merely about calculation, not concepts. To remedy this, the proposed lab allows students to dissect in a hands-on, interactive manner virtual mathematical devices that demonstrate the use of fundamental concepts. The lab requires self-study by groups of students, and as such, mimics the research skills needed in the real world. This lab relates to the Research Phase of this project as three of the labs nine modules will come from the above research in information retrieval. Each module describes a virtual mathematical device. Examples include least squares, the vector space model of information retrieval, and Googles method of ranking webpages. All modules must be able to be described in low dimensional space, so that students can visualize the ideas, after which point, they will understand the power of mathematics to enable extensions to higher dimensional space. Modules also contain a scientific computation component as the students extend the simple models to much larger problems. Each module will be posted on a website, making the modules available for worldwide dissemination and adaptation. In addition, the lab will be used in several outreach projects. One of the labs goals is to change students attitudes towards mathematics by demystifying the process of mathematical invention, showing them that mathematics is more than mere calculation, it is a creative process with broad applications.

View original record on NSF Award Search →