Statistical methods for analyzing high-throughput genotype and sequencing data
Case Western Reserve University, Cleveland OH
Investigators
Linked publications & trials
Abstract
DESCRIPTION (provided by applicant): Statistical methods for analyzing high-throughput genotype and sequencing data. Recent advances in sequencing technologies make it possible to sequence a large number of subjects and test many low frequency or rare genetic variants. Many new statistical methods have been developed to analyze sequencing data and the majority of these methods focus on population based samples. It is well- known that rare variants segregate in families and population based methods will suffer lower power when studying rare variants. In addition, how to control population stratification has not been well studied for rare variant analysis. As such, this renewal application requests support to continue developing statistical methods for analyzing large scale genetic data sets. In particular, we propose to 1) Develop unified statistical methods to detect rare genetic variants from sequencing data using both families and unrelated subjects; 2) Develop statistical methods of joint modeling of local ancestry and association in rare variant analysis, at the same time controlling for population stratification; 3) Develop statistical association methods that can be applied to multiple traits using summary statistics from GWAS or whole sequencing data; 4) Develop corresponding software that will be made publicly available. We will collaborate with laboratory-based investigators to obtain appropriate data sets and apply our new analytic methods to this crucial practical problem in genetic epidemiology. In particular, we will be analyzing the GWAS and sequencing data on traits related to hypertension and sleep apnea.
View original record on NIH RePORTER →