Technology Development for a MolBio Knowledge-Base

$611,872R01FY2006LMNIH

University Of Colorado Denver, Aurora CO

Investigators

Linked publications & trials

Paper 38605048 Paper 37270143 Paper 37208468 Paper 36540992 Paper 33954284 Paper 33907758 Paper 33791733 Paper 33691008 Paper 33264411 Paper 33196056 Paper 32387679 Paper 30531622 Paper 30417177 Paper 30294517 Paper 29911205 Paper 29568822 Paper 29568821 Paper 29568820 Paper 29568819 Paper 29308065 Paper 29218915 Paper 29218876 Paper 28818042 Paper 28363736 Paper 28358134 Paper 28105587 Paper 27617858 Paper 27613112 Paper 27504010 Paper 26510531 Paper 26420780 Paper 25937701 Paper 25925016 Paper 25903923 Paper 25670730 Paper 25592601 Paper 24297559 Paper 24048470 Paper 23895341 Paper 23633944 Paper 23613763 Paper 23424141 Paper 22901054 Paper 22833496 Paper 22776079 Paper 22643061 Paper 22627919 Paper 22450367 Paper 22177292 Paper 22151901 Paper 21914784 Paper 21865542 Paper 21540299 Paper 21319786 Paper 21243075 Paper 20975846 Paper 20971216 Paper 20953344 Paper 20920264 Paper 20846932 Paper 20671318 Paper 20505005 Paper 20470899 Paper 20368141 Paper 20016822 Paper 19904832 Paper 19527520 Paper 19461996 Paper 19414535 Paper 19208187 Paper 19106086 Paper 19055758 Paper 18834500 Paper 18834494 Paper 18816389 Paper 18779866 Paper 18590549 Paper 18547432 Paper 18463117 Paper 18412966 Paper 18237434 Paper 18229722 Paper 18172927 Paper 17990508 Paper 17990498 Paper 17990495 Paper 17977867 Paper 17967189 Paper 17803817 Paper 17646325 Paper 17318087 Paper 17134478 Paper 17094227 Paper 17094225 Paper 17011833 Paper 16970551 Paper 16779021 Paper 16678038 Paper 16507357 Paper 16006358

Abstract

DESCRIPTION (provided by applicant): [unreadable] Since the introduction of the Mycin system more than 25 years ago, it has widely been hypothesized that extensive, well-represented computer knowledge-bases will facilitate a wide variety of scientific and clinical tasks. Driven by growing knowledge-management challenges arising from the proliferation of high throughput instrumentation, recently created knowledge-bases in areas related to genomics and related aspects of contemporary biology, such as the Gene Ontology, EcoCyc and PharmGKB, have begun to become integrated into the laboratory practices of a growing number of molecular biologists. However, these successful molecular biology knowledge-bases (MBKBs) have two drawbacks which impede their more general application: each has been narrowed to a particular special purpose, either in its domain of applicability or in the scope of knowledge represented, and each of these knowledge-bases was constructed largely on the basis of enormous human effort. Given the current state of molecular biology data and recent advances in database integration and information extraction technology, we proposed to test the following hypothesis: Current computational technology and existing human-curated knowledge resources are sufficient to build an extensive, high-quality computational knowledge-base of molecular biology. To test this hypothesis we propose to first create tools which can (a) automatically link incommensurate knowledge sources using semantic linking, and (b) use natural language processing techniques to extract new information from NCBrs GeneRIFs and from the GO definitions fields; and second, to evaluate the results of these methods by carefully quantifying the degree to which the induced linkages and extracted assertions are complete, consistent and correct. Although we propose to construct a broad and rich knowledge-base in order to develop and test the adequacy of largely automated methods to leverage existing human-curated collections, we do not propose to build a comprehensive MBKB. [unreadable] [unreadable] [unreadable]

View original record on NIH RePORTER →