GGrantIndex
← Search

Data Analysis and Submission Core: Enabling rapid analysis and public sharing of human virome data

$419,190U54FY2025AGNIH

Broad Institute, Inc., Cambridge MA

Investigators

Abstract

Project Summary- Data Analysis and Submission Core Analyzing the human virome poses significant challenges in analyzing and sharing data. The diversity of viruses and their interactions with the host necessitate advanced bioinformatics tools and computational resources to process the high-throughput sequencing data. In consortium settings, standardizing protocols and data analysis pipelines are crucial to harmonize disparate datasets and ensure consistency and comparability. Overcoming these challenges is key to combining diverse areas of expertise across the consortium and achieving a deeper understanding of the human virome. Our Data Analysis and Submission Core (DASC) is ideally positioned to advance the mission of our Center in generating, analyzing, storing, and sharing large scale human virome data. The Data Analysis and Submission Core will work closely with the Biospecimen Collection Core (BCC) and Biospecimen Analysis Core (BAC) to ensure optimal processing and cataloging of generated data, as well as with the HVP CODCC, NIH, and NCBI to ensure timely and FAIR release of our Center’s data sets and analytic tools. The DASC will achieve this by pursuing four primary activities. 1) High throughput processing of data: the DASC will leverage the Broad Institute's existing infrastructure for data processing and storage to track samples and metadata and enable efficient data processing. 2) Analyzing, integrating, and interpreting of data: the DASC will integrate high-content imaging, omics, and other data types obtained from the Biospecimen Analysis Core, bridging the gap between different data modalities. 3) Constructing the HVP Virome Database: the DASC will assemble and validate data across VCCs, identify viral tropism and analyze variability in the human virome, and develop interactive visualization tools for multi-modal data exploration and tailor existing tools for the human virome. 4) Coordination across the HVP Consortium: the DASC will ensure interoperability and accessibility of tools and data within the VCC and the rest of the consortium.

View original record on NIH RePORTER →