Statistical Methods for Gene Mapping

$585,181R01FY2016GMNIH

University Of California Los Angeles, Los Angeles CA

Investigators

Linked publications & trials

Paper 37448596 Paper 36254789 Paper 34289008 Paper 34174829 Paper 34142722 Paper 34014947 Paper 33941078 Paper 33651796 Paper 33547645 Paper 32655193 Paper 32524061 Paper 32523233 Paper 32133245 Paper 31905256 Paper 31879980 Paper 31649491 Paper 31592195 Paper 30915546 Paper 30623484 Paper 30618485 Paper 30501857 Paper 30201759 Paper 29947758 Paper 29860323 Paper 29630901 Paper 29626666 Paper 29100087 Paper 28875524 Paper 28689109 Paper 28373601 Paper 28369161 Paper 28348500 Paper 28238358 Paper 28231077 Paper 28214848 Paper 27980643 Paper 27943406 Paper 27772736 Paper 27663501 Paper 27663074 Paper 27646141 Paper 27216439 Paper 27112634 Paper 27087770 Paper 26681992 Paper 26637429 Paper 26622074 Paper 26567478 Paper 26549920 Paper 26498930 Paper 26457621 Paper 26366044 Paper 26189819 Paper 26139633 Paper 26097641 Paper 26072484 Paper 25965340 Paper 25898925 Paper 25526526 Paper 25519348 Paper 25392563 Paper 25371484 Paper 25370538 Paper 25359894 Paper 25357204 Paper 25284823 Paper 25242858 Paper 25242816 Paper 25143662 Paper 25104515 Paper 25012181 Paper 24990607 Paper 24955378 Paper 24917141 Paper 24886709 Paper 24743331 Paper 24634545 Paper 24566108 Paper 24443689 Paper 24348518 Paper 24288159 Paper 24039382 Paper 23825370 Paper 23730305 Paper 23610370 Paper 23497424 Paper 23386649 Paper 23382428 Paper 23364324 Paper 23284608 Paper 23233546 Paper 23222517 Paper 22954633 Paper 22897923 Paper 22829776 Paper 22816662 Paper 22218271 Paper 22160768 Paper 22143921 Paper 22139419

Abstract

? DESCRIPTION (provided by applicant): For nearly two decades this grant has developed statistical and computational tools vital to gene mapping. During that period, technology and genomic data changed dramatically. Expression and genotyping chips became standard scientific tools; the full genomes from a host of organisms, including the human species, were sequenced; and low-cost sequencing transitioned from fantasy to reality. The last decade has also witnessed a shift from common to rare SNVs (single nucleotide variants) and from moderate-sized studies to large consortium studies. Simultaneously, computers have grown exponentially in speed and memory. These parallel advances have powered thousands of successful human gene mapping studies for both Mendelian and complex traits. Because these successes have shed light on only a fraction of the heritability of common traits, we have not yet reached the endgame of statistical genetics. There is still need for new ideas and better software. We plan to build on our previous successes, with particular stress on adapting modern methods of data mining to genetic applications. We and others have made great strides in applying penalized estimation and model selection in genomics. Genetic analysis via penalized regression easily handles non-genetic predictors, uncertainty in genotype and sequence calls, corrections for ethnic admixture, quantitative traits and disease dichotomies, gene-gene and gene-environment interactions, and both rare and common variants. Unfortunately, it is now apparent that penalized estimation is hampered by severe shrinkage and inflated false positive rates. Our recent development of the proximal distance algorithms and AIC (Akaike information criterion) guided regression show that severe shrinkage can be eliminated and false positive rates tamed. We are also convinced that haplotypes have been underexploited in genetic analysis. These flag local gene sharing, serve as surrogates for rare variants, capture intragenic interactions, and enable both fixed and random effects QTL (quantitative trait locus) mapping. Our extensive list of aims should not be interpreted as a lack of focus. Our track record shows that we can make progress on a number of fronts simultaneously. All of our efforts are directed toward sharpening the tools of genetic analysis. As our programs SIMWALK, MENDEL, and ADMIXTURE illustrate, we are committed to translating theoretical advances into user-friendly software. These programs are notable for their comprehensiveness, speed, reliability, small memory usage, and detailed documentation. The goal of this grant is to empower the very large genetic studies on the horizon. Collectively, our Specific Aims go a long way towards that goal.

View original record on NIH RePORTER →