ACTIVE SITE SIGNATURES FOR AUTOMATIC UPDATES OF SFLD SUPERFAMILIES

$23,849P41FY2009RRNIH

University Of California, San Francisco, San Francisco CA

Investigators

Linked publications & trials

Paper 39713412 Paper 39553940 Paper 39548826 Paper 39253485 Paper 39030416 Paper 38690801 Paper 38617273 Paper 38562713 Paper 38391029 Paper 38351080 Paper 38227666 Paper 38168172 Paper 38168038 Paper 38011569 Paper 37398295 Paper 36346431 Paper 36040254 Paper 35700955 Paper 35606511 Paper 35069571 Paper 35023662 Paper 34948166 Paper 34889960 Paper 34467357 Paper 34311129 Paper 33087276 Paper 32913117 Paper 32350111 Paper 30601716 Paper 29222970 Paper 29211988 Paper 28871556 Paper 28871552 Paper 28590477 Paper 28257700 Paper 27908751 Paper 27885823 Paper 27613871 Paper 27557444 Paper 27537504 Paper 27331778 Paper 27167687 Paper 26922228 Paper 26911287 Paper 26876147 Paper 26837391 Paper 26789761 Paper 26789758 Paper 26745530 Paper 26743208 Paper 26606292 Paper 26513823 Paper 26392486 Paper 26333660 Paper 26304117 Paper 26301601 Paper 26044118 Paper 26038302 Paper 26038232 Paper 25809480 Paper 25792539 Paper 25768529 Paper 25501940 Paper 25450177 Paper 25393646 Paper 25347186 Paper 25267529 Paper 25236489 Paper 25101801 Paper 25089370 Paper 25085313 Paper 25023485 Paper 24991000 Paper 24859038 Paper 24856833 Paper 24779348 Paper 24727133 Paper 24714126 Paper 24619609 Paper 24516137 Paper 24477690 Paper 24379376 Paper 24361271 Paper 24359247 Paper 24311578 Paper 24194526 Paper 24161732 Paper 24140596 Paper 24128175 Paper 24010878 Paper 23999392 Paper 23956109 Paper 23932360 Paper 23911417 Paper 23893342 Paper 23877444 Paper 23834438 Paper 23788528 Paper 23762236 Paper 23754381

Abstract

This subproject is one of many research subprojects utilizing the resources provided by a Center grant funded by NIH/NCRR. The subproject and investigator (PI) may have received primary funding from another NIH source, and thus could be represented in other CRISP entries. The institution listed is for the Center, which is not necessarily the institution for the investigator. major unsolved problem for structure-function linkage using computational prediction is that while we can accurately cluster protein sequences and structures with good statistical significance based on many types of similarity metrics, how those clusters link to functional classes is not clear. Although simple approaches such as ortholog prediction can achieve good results for sequences that are closely similar or that contain readily identifiable motifs that distinguish functional classes, for many protein superfamilies successful prediction is far from trivial. This is the case for the functionally diverse superfamilies in the SFLD. These are homologous sets of enzymes that carry out different chemical transformations, using different substrates, but all share a specific chemical functionality or partial reaction. The main purpose of the SFLD is to aid researchers in the curation of these types of superfamilies, to help in the identification of new members of these superfamilies, and to provide an explicit structure-function mapping for these enzymes. (For more information about mechanistically diverse enzyme superfamilies, see Gerlt &Babbitt, Annual Rev Biochem. 2001, pp. 209-46.) Because the different functional families in a given superfamily look similar but perform different specific reactions, they are difficult to annotate and easy to misannotate, showing levels of misannotation as high as 80% in the archival databases Genbank NR and TrEMBL (Schnoes, Dodevski, and Babbitt, submitted). Because sequence information is still coming available in large volumes, automated methods are required to update the SFLD superfamilies with newly determined sequences and assign them to the appropriate functional families. Clearly, improved methods for achieving these functional assignments are urgently needed. Development of an approach to achieve this has been a major focus of the Babbitt and Ferrin groups in collaboration with the group of Prof. Jacquelyn Fetrow of Wake Forest University.

View original record on NIH RePORTER →