Computational Biology Core

$231,853P42FY2008ESNIH

University Of California Berkeley, Berkeley CA

Investigators

Linked publications & trials

Paper 39463491 Paper 39333453 Paper 39326242 Paper 39251742 Paper 39120907 Paper 38964580 Paper 38946984 Paper 38906272 Paper 38904462 Paper 38898328 Paper 38861597 Paper 38752991 Paper 38371654 Paper 38283952 Paper 38110187 Paper 37921440 Paper 37741554 Paper 37694915 Paper 37651660 Paper 37640476 Paper 37409972 Paper 37398941 Paper 37398036 Paper 37295475 Paper 37138425 Paper 37064786 Paper 37008181 Paper 36926844 Paper 36746112 Paper 36696271 Paper 36635734 Paper 36592815 Paper 36573044 Paper 36493610 Paper 36346153 Paper 36201310 Paper 36178055 Paper 36067935 Paper 36017556 Paper 35861429 Paper 35809487 Paper 35769198 Paper 35717575 Paper 35661546 Paper 35588500 Paper 35432890 Paper 35383362 Paper 35162442 Paper 35093747 Paper 34936392 Paper 34750507 Paper 34660833 Paper 34560324 Paper 34558968 Paper 34485590 Paper 34450064 Paper 34435882 Paper 34243768 Paper 34139223 Paper 34105804 Paper 34078642 Paper 34020997 Paper 33870020 Paper 33801661 Paper 33784186 Paper 33622793 Paper 33565865 Paper 33531396 Paper 33511330 Paper 33400181 Paper 33370316 Paper 33189395 Paper 33085113 Paper 33084937 Paper 32989163 Paper 32966806 Paper 32938756 Paper 32861163 Paper 32858110 Paper 32833432 Paper 32766950 Paper 32764800 Paper 32572277 Paper 32519538 Paper 32253360 Paper 32152214 Paper 32123103 Paper 32045263 Paper 32029645 Paper 31960937 Paper 31913608 Paper 31904937 Paper 31898917 Paper 31760269 Paper 31746186 Paper 31719706 Paper 31641032 Paper 31634900 Paper 31631918 Paper 31605413

Abstract

The support provided under Core D reflect a growing trend in studies of[unreadable] environmental exposure from more traditional epidemiological studies and simple experimental designs to[unreadable] high-dimensional biology, with its emphasis on 'omic' technologies and complicated questions addressing[unreadable] the possible interaction of environmental exposures and high-dimensional measures of the genome,[unreadable] proteome, etc. These high-dimensional data sets are characterized by many (thousands) of measurements[unreadable] made on only a few independent units (e.g., people). Thus, the Core D reflects a parallel evolution in the[unreadable] field of biostatistics towards developing methodologies that can both find patterns in high dimensional data[unreadable] sets as well as providing proper statistical inference for these patterns. Besides offering consulting on[unreadable] traditional epidemiological experimental design and analysis questions, Core D will focus its efforts on[unreadable] providing the most relevant and rigorous statistical techniques to the Program's projects. With new 'omic'[unreadable] technologies, biology has entered a new more empirical phase where the goals of the research are[unreadable] ambitious (e.g., discovery of regulatory gene networks affected by particular environmental toxicants), but[unreadable] the sample sizes relatively small (biological replicates numbering in the tens). With these technologies,[unreadable] have come also a proliferation of proposed methods to find biologically meaningful patterns and typically[unreadable] little theory is provided to guide their relative worth. The goal of this Core is to provide the project[unreadable] researchers with the best techniques available, software to help implement them, a computational[unreadable] environment that can handle computer-intensive methods on large data sets and, most importantly,[unreadable] rigorous statistical inference for the parameters estimated by these procedures. A subset of the[unreadable] developments related to the proliferation of high-dimensional biological/epidemiological data particularly[unreadable] relevant to this proposal are 1) multiple testing, 2) machine-learning and loss-based estimation, 3) grouping[unreadable] algorithms methods, 4) causal inference and 5) biological metadata and systems biology. In addition, the[unreadable] Core will provide access to a computational environment that lends itself to the computationally intensive[unreadable] methods developed for data mining and re-sampling based inference.

View original record on NIH RePORTER →