GGrantIndex
← Search

The Analysis of Signal Elements in Promoter Sequences

$0Z01FY2004LMNIH

National Library Of Medicine

Investigators

Linked publications & trials

Abstract

The signal elements in promoter sequences are not well characterized. We propose to develop statistical tests to find nucleotide words (generally of length 8) that appear localized relative to TSSs (transcription start site). These words will then form "seeds" for expansion to develop PSSMs (position-specific scoring matrices) characterizing systems of co-regulated genes. To this end, Dr. Marino-Ramirez has collected a database of about 4700 sequences around the TSS of human genes. The database is exceptionally well characterized, and ideal for our statistical study. We are using two statistics, the well respected KS (Kolmogorov-Smirnov) statistic, and a less known Poisson scan statistic, to determine whether occurrences of a given 8-letter DNA word are clustered unusually relative to the TSS. The KS statistic provides a standard for sensitivity, but it does not assign the statistical significance to a particular set of word occurrences. The Poisson scan statistic does, however, and we plan to use its list of significant occurrences as our "seeds" for developing our PSSMs.

View original record on NIH RePORTER →
The Analysis of Signal Elements in Promoter Sequences · GrantIndex