GGrantIndex
← Search

Multi-allelic copy number variation of the human genome

$477,500R01FY2013HGNIH

Harvard Medical School, Boston MA

Investigators

Linked publications, trials & patents

Abstract

DESCRIPTION (provided by applicant): The human genome exhibits extensive copy number variation (CNV). We today understand only the simplest form of copy number variation (CNV) - simple deletions and duplications. A large, functionally important and still-uncharacterized form of genome structural variation is multi-allelic copy-number variation (mCNV), involving genes and other functional elements for which three or more segregating alleles give rise to a wide range of copy numbers (such as 2 to 10) per diploid human genome. mCNVs have been refractory to widely used analysis methods and are not assessed in the genome-scale molecular or statistical approaches used to study genetically complex phenotypes in humans. In this work, we will develop approaches and supporting data sets that enable mCNVs to be routinely and rigorously analyzed for relationship to variation in human phenotypes. We will accurately analyze mCNVs in reference populations, using two new approaches, one computational (based on analysis of available whole-genome sequence data) and one molecular (based on PCR in digitally counted microdroplets) for accurately analyzing mCNVs in cohorts (Aim 1). By analyzing these data in a statistical framework that incorporates information about genotypes, allele frequencies, inheritance, and haplotypes, we will place mCNV alleles onto the haplotype maps created by HapMap and 1000 Genomes, and render mCNVs accessible to genotype imputation to the fullest extent possible (Aim 2). We will deeply characterize mCNVs at ten biomedically important loci, to understand these polymorphisms at the levels of population genetics, mutational rates and histories, and relationships to clinical phenotypes (Aim 3). Finally, we will pilot inexpensive in silico genome-wide association studies for mCNVs based on statistical imputation into existing GWAS data sets (Aim 4). The successful completion of this work will lead to the discovery of relationships between disease risk and gene dosage, helping to reveal the molecular etiology of human disease.

View original record on NIH RePORTER →
Multi-allelic copy number variation of the human genome · GrantIndex