Statistical Learning for Precision Medicine Based on Multi-Source Data

$617,103R01FY2025HLNIH

Stanford University, Stanford CA

Investigators

Linked publications & trials

Paper 38771658 Paper 38073026 Paper 37982071 Paper 37882364 Paper 37707829 Paper 37641619 Paper 37603326 Paper 37398349 Paper 37342044 Paper 37331495 Paper 37293026 Paper 37010873 Paper 36723915 Paper 36653933 Paper 36645553 Paper 36512353 Paper 36372072 Paper 36275859 Paper 35980612 Paper 35754309 Paper 35419255 Paper 35166342 Paper 35076930 Paper 34921121 Paper 34874550 Paper 34799398 Paper 34783094 Paper 34747010 Paper 34734980 Paper 33927801 Paper 33915597 Paper 33767517 Paper 33053155 Paper 32803245 Paper 32628533 Paper 32562264 Paper 32413171 Paper 31820458 Paper 31797368 Paper 30614483 Paper 30430540 Paper 30352486 Paper 30102227 Paper 30098159 Paper 30047148 Paper 29931103 Paper 29788308 Paper 29664143 Paper 29617717 Paper 29608649 Paper 29335682 Paper 29126253 Paper 28901017 Paper 28877311 Paper 28294286 Paper 28273695 Paper 28211943 Paper 27933612 Paper 27107008 Paper 27037494 Paper 26999054 Paper 26692376 Paper 26689167 Paper 26302239 Paper 26236061 Paper 26194988 Paper 26177343 Paper 25729117 Paper 25678939 Paper 25196727 Paper 25122189 Paper 24982461 Paper 24779731 Paper 24659838 Paper 24436503 Paper 24292992 Paper 24058223 Paper 23902636 Paper 23807695 Paper 23494768 Paper 23293405 Paper 23263882 Paper 23254468 Paper 22914867 Paper 22844171 Paper 22294672 Paper 21908541 Paper 21504421 Paper 21415016 Paper 20876663 Paper 20663850 Paper 20618311 Paper 18922759

Abstract

PROJECT SUMMARY/ABSTRACT The pursuit of tailored treatment strategies for individual patients remains crucial for enhancing cost- effectiveness in clinical practice. Despite advancements in statistical methodologies and machine learning, persistent challenges impede the progress and implementation of precision medicine in clinical practice. Limited sample sizes pose a significant hurdle in estimating individualized treatment effects on clinical outcomes, necessitating the utilization of information from multiple data sources. However, effective integration of such data requires appropriately addressing population heterogeneity, privacy constraint, and features alignment across datasets. Furthermore, even with a group of well-developed prediction models of different complexity in place, there is still a need to devise smart strategies for adaptively employing them in practice. Lastly, addressing treatment effect heterogeneity in clinical trials remains challenging, particularly in efficiently synthesizing information from both discovery and validation stages without introducing bias. Our proposal aims to develop innovative solutions to aforementioned problems. First, we will introduce a novel transfer learning approach to accommodate overlapping but non-identical prediction feature sets in source and target populations. Second, we will develop a latent class model leveraging knowledge graph information from multiple sources for flexible feature alignment. Third, an innovative dynamic prediction strategy will be created to optimize the sequence of acquiring prediction features, thereby enhancing prediction accuracy while minimizing measurement cost. Fourth, we will extend reinforcement learning at a single site to federated learning setting under privacy constraints so that adaptive strategy such as personalized dynamic treatment regimen can be better developed. Lastly, we will propose a comprehensive framework for integrating information from both discovery and validation stages in studying the treatment effect heterogeneity, enabling unbiased inference of treatment effects among a selected subgroup of responders. All methodological developments will undergo rigorous numerical studies and real-data applications, ensuring their effectiveness, and will be disseminated widely to benefit the clinical community.

View original record on NIH RePORTER →