RI: SMALL: Causal concept models for causal reasoning and concept discovery in generative AI

$545,359FY2025CSENSF

University Of Chicago, Chicago IL

Investigators

Abstract

Generative artificial intelligence (AI) has transformed information processing and has demonstrated remarkable capacities for creativity in language and imagery. However, despite these advancements, generative AI continues to struggle when it comes to reasoning about cause and effect. Causality is the ability to understand how and why things happen. For example, recognizing which factors bring about a particular outcome, as well as what might happen if conditions change. This lack of the deeper causal understanding in generative AI is an issue because it is often needed for more complex decision-making. This type of reasoning is essential for building AI systems that can generalize more reliably, predict the results of actions or changes, and offer deeper understanding of the systems they are meant to model. This project will address these shortcomings of generative AI systems by developing a new generation of AI models designed specifically to learn causal models. By integrating theoretical principles from causality with recent advances in deep learning, these models will enable robust causal reasoning and meaningful abstraction. Ultimately, this research aims to deliver generative AI systems that are interpretable, verifiable, and robust, significantly enhancing their applicability to reason across a broad range of real-world scenarios. Current generative AI models, which leverage end-to-end deep learning over large, unstructured datasets, demonstrate impressive scalability and expressivity, successfully capturing latent structures useful for various applications. However, these models typically produce representations that are difficult to interpret and reliant on spurious correlations rather than robust causal relationships. Such limitations hinder their effectiveness in mission-critical settings, where interpretable causal reasoning and reliable explanations are essential. This research will develop a new generation of models capable of both causal reasoning and abstraction through automated, data-driven learning of causal representations. This approach maintains expressivity while incorporating explicit statistical guarantees to ensure learned representations are causal, interpretable, and reproducible. Unlike existing approaches that rely heavily on manual specification, known causal graphs, or explicit supervision, our framework is built upon a theoretically rigorous method for discovering causal relationships directly from data. By clearly articulating underlying assumptions within causal graphical models and avoiding reliance on prior knowledge, the framework enables training generative models that intrinsically learn meaningful causal representations from scratch. Incorporating causal reasoning into generative models enhances their ability to make more informed decisions, benefiting a wide range of sectors by improving accuracy and effectiveness across various applications. In education, the impacts will be achieved through the integration of undergraduate and graduate students in research, fostering a deeper understanding of causal relationships and encouraging hands-on learning in cutting-edge areas of AI. This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

View original record on NSF Award Search →